Slower code with "-C target-cpu=native"

burjui · September 2, 2022, 4:05pm

Thanks! Turns out, the culprit is AVX: when compiled with -C target-cpu=native -C target-feature=-avx, programs perform as good as without any flags or with -C target-cpu=generic, and they also run slower when compiled with -C target-feature=+avx. I was quite surprised to learn this. So either AVX is slow on my CPU (unlikely) or, more likely, LLVM has decided to use AVX in places where it doesn't bring any benefits, like for unrolling small loops.

Topic		Replies	Views
Policies around default CPU architecture compiler	20	5880	January 14, 2019
Suggestion for a low-effort way to take advantage of SIMD and other architecture specific tricks LLVM knows about internals	18	3236	June 8, 2017
Pre-RFC: Cargo Target Features cargo	20	8249	April 7, 2018
Getting explicit SIMD on stable Rust	335	47413	November 1, 2017
Better codegen for CPU feature detection	44	3471	January 12, 2026

Slower code with "-C target-cpu=native"

Related topics