Missing fneg and fcmp fast float LLVM intrinsics

nbot · November 29, 2021, 11:38pm

Hello everybody.

I've noticed LLVM has two fast float intrinsics - fneg and fcmp (LLVM Language Reference Manual — LLVM 13 documentation) which are missing in Rust nightly (std::intrinsics - Rust). Is there any reason they are missing? How complicated would it be to add them?

Thank you

scottmcm · November 30, 2021, 12:28am

They'd be easy to add, but "fast" floats are pretty much in limbo right now. They're not exposed to stable, and there's no plan to do so. So there's not really a reason for the compiler team to take the cost of having intrinsics for them.

Here's one of the many other IRLO threads discussing what to do with them: Suggestion: FFastMath intrinsics in stable - #60 by scottmcm

nbot · November 30, 2021, 10:51am

Thanks for your reply.

Would the compiler team accept PR adding them even if they remain in nightly only or do you prefer to not spend time on it currently?

Bonus and most likely stupid question: Why some of the intrinsics are kept always in nightly and never get into stable Rust?

bjorn3 · November 30, 2021, 4:41pm

All intrinsics are unstable and implementation details. Many intrinsics have a stable wrapper function, but this doesn't imply that the wrapper will forever be implemented by calling an intrinsic. Many can be rewritten to avoid any intrinsics.

Intrinsics come and go over time. Sometimes the language itself gains a new feature that allows implementing the wrapper without intrinsics. For example std::mem::forget used to be implemented using an intrinsic, but is now implemented using ManuallyDrop. Other intrinsics become unused for other reasons and may be removed. Yet other intrinsics are introduced as implementation detail for a new feature. There are also intrinsics that can only be meaningfully implemented by a single codegen backend. While LLVM is currently the only stable backend, there are several others like cg_clif (created by me), cg_gcc (a GCC backend) and cg_spirv (part of the rust-gpu project). As intrinsics may be added and removed at any time, they won't ever be stabilized and thus can't be used on stable.

nbot · November 30, 2021, 5:23pm

Understood - thanks for the detailed explanation!

scottmcm · November 30, 2021, 7:22pm

I'm not on the compiler team (nor the compiler contributers team) and am not familiar with their thinking on such things.

But generally from a software engineering perspective, if there's not a need to have it for an approved initiative, then it's unclear that adding it is necessary.

One can always use a local fork for experimentation, or for fneg and fcmp there are reasonable proxies that should allow experimentation just fine -- fast_sub(0, x) probably works great for fast_fneg, and you could try x.partial_cmp(y).unwrap_unchecked() for fast_fcmp.

bluss · November 30, 2021, 7:32pm

The fast_* intrinsics are unfortunately mostly useless as it is now, since there are many different fast flags and these just enable all of them. So it would be a service to Rust to further this area somehow. I'm not working on it, unfortunately, even though the most interesting direction to explore (to me) would be the safest subset of flags that would still improve autovectorization for certain loops.

nbot · November 30, 2021, 9:35pm

Okay, so it sounds like we need to wait for some higher level facade on top of those intrinsics. Having those optimizations working would be really great not only for gaming but also for cpu intensive numerical code. I know there's SIMD but that's a bit different story. Thanks again - will keep track of this issue

system · February 28, 2022, 9:35pm

This topic was automatically closed 90 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
What's Rust policy towards LLVM intrinsics? libs	4	659	November 8, 2024
Suggestion: FFastMath intrinsics in stable libs	60	5756	January 24, 2022
Resolving Rust's forward progress guarantees compiler	53	6813	December 22, 2024
Is it finally time for f16 support? language design	4	758	July 3, 2024
Adding more atomic intrinsics	11	3050	March 25, 2019

Missing fneg and fcmp fast float LLVM intrinsics

Related topics