Missing fneg and fcmp fast float LLVM intrinsics

Hello everybody.

I've noticed LLVM has two fast float intrinsics - fneg and fcmp (LLVM Language Reference Manual — LLVM 13 documentation) which are missing in Rust nightly (std::intrinsics - Rust). Is there any reason they are missing? How complicated would it be to add them?

Thank you

They'd be easy to add, but "fast" floats are pretty much in limbo right now. They're not exposed to stable, and there's no plan to do so. So there's not really a reason for the compiler team to take the cost of having intrinsics for them.

Here's one of the many other IRLO threads discussing what to do with them: Suggestion: FFastMath intrinsics in stable - #60 by scottmcm

1 Like

Thanks for your reply.

Would the compiler team accept PR adding them even if they remain in nightly only or do you prefer to not spend time on it currently?

Bonus and most likely stupid question: Why some of the intrinsics are kept always in nightly and never get into stable Rust?

All intrinsics are unstable and implementation details. Many intrinsics have a stable wrapper function, but this doesn't imply that the wrapper will forever be implemented by calling an intrinsic. Many can be rewritten to avoid any intrinsics.

Intrinsics come and go over time. Sometimes the language itself gains a new feature that allows implementing the wrapper without intrinsics. For example std::mem::forget used to be implemented using an intrinsic, but is now implemented using ManuallyDrop. Other intrinsics become unused for other reasons and may be removed. Yet other intrinsics are introduced as implementation detail for a new feature. There are also intrinsics that can only be meaningfully implemented by a single codegen backend. While LLVM is currently the only stable backend, there are several others like cg_clif (created by me), cg_gcc (a GCC backend) and cg_spirv (part of the rust-gpu project). As intrinsics may be added and removed at any time, they won't ever be stabilized and thus can't be used on stable.

5 Likes

Understood - thanks for the detailed explanation!

I'm not on the compiler team (nor the compiler contributers team) and am not familiar with their thinking on such things.

But generally from a software engineering perspective, if there's not a need to have it for an approved initiative, then it's unclear that adding it is necessary.

One can always use a local fork for experimentation, or for fneg and fcmp there are reasonable proxies that should allow experimentation just fine -- fast_sub(0, x) probably works great for fast_fneg, and you could try x.partial_cmp(y).unwrap_unchecked() for fast_fcmp.

1 Like

The fast_* intrinsics are unfortunately mostly useless as it is now, since there are many different fast flags and these just enable all of them. So it would be a service to Rust to further this area somehow. I'm not working on it, unfortunately, even though the most interesting direction to explore (to me) would be the safest subset of flags that would still improve autovectorization for certain loops.

5 Likes

Okay, so it sounds like we need to wait for some higher level facade on top of those intrinsics. Having those optimizations working would be really great not only for gaming but also for cpu intensive numerical code. I know there's SIMD but that's a bit different story. Thanks again - will keep track of this issue

This topic was automatically closed 90 days after the last reply. New replies are no longer allowed.