Getting explicit SIMD on stable Rust

stoklund · December 1, 2016, 9:57pm

There is a lot of details to work out. Suppose you’re writing impl Mul for i32x4. It would go something like this:

If MIPS MSA is available, use __msa_mulv_w().
If ARM NEON is available, use vmul_i32().
If SSE 4.1 is available, use _mm_mullo_epi32().
If SSE2 is available, try to cobble something together out of 16-bit multiplications using _mm_mulhi_epi16() and _mm_mullo_epi16(), or maybe _mm_mul_epu32() combined with some shuffling.
Otherwise, expand into a lane-wise scalar multiplication.

In particular, trying to construct an i32x4 multiplication out of existing SSE2 intrinsics requires some work and knowledge. Work and knowledge that has already been put into LLVM.

This is just one operation for one type. There’s about 150 of those to go through. Then you would need to write individual unit tests for all of them since you’re guaranteed to have picked the wrong intrinsic by mistake at least once. Then find a MIPS machine to run your unit tests on. No, not that one. One with SIMD instructions available.

Topic		Replies	Views
Stabilizing SIMD-aligned types ahead of the rest of SIMD language design	3	1845	March 25, 2019
SIMD now available in libstd on nightly! libs	15	9275	March 25, 2019
What's the next step towards the stabilization of SIMD? language design	16	3806	March 25, 2019
How to make core::arch simd intrinsics safe: language design	6	1274	August 28, 2022
Packed_simd: `cfg(target_feature)` does not play well with `#[target_feature]`	3	2069	March 25, 2019

Getting explicit SIMD on stable Rust

Related topics