Inline assembly constant zero optimizations

parasyte · December 3, 2024, 10:31pm

This question came up on URLO the other day, and I haven't been able to find any discussions that address the need. The ask is for a peephole optimization that selects the constant-zero register on supported architectures when the register input expression evaluates to zero.

The best I was able to come up with was a declarative macro that can specialize when constant zero is its input. This has a shortcoming that expressions are not evaluated, so you get the less optimal instruction pair that assigns zero to a GPR ^[1]:

GCC is capable of evaluating the input expression for inline assembly:

Clang does not have support for this feature, AFAICT.

LLVM itself does constant zero optimizations, but that's the full extent of my knowledge in this area. I have no clue how the MIR -> LLVMIR pipeline handles inline assembly or what this kind of feature would look like from an implementation perspective. It appears to be desirable, though.

A procedural macro could hypothetically do the expression evaluation itself. That's a lot more work that duplicates functionality that already exists in the compiler. ↩︎

Topic		Replies	Views
[Pre-RFC]: Inline assembly language design	70	14077	March 25, 2019
Stabilization path for asm!()? language design	11	3314	March 25, 2019
[Pre-RFC #2]: Inline assembly language design	161	10693	March 15, 2020
Register attribute language design	29	2400	March 28, 2023
Inline LLVM IR ideas (deprecated)	5	5040	March 25, 2019

Inline assembly constant zero optimizations

Related topics