Providing the compiler with semantic reasoning in code path

kolotyluk · October 7, 2025, 6:07pm

Proposal: `#[likely]` / `#[unlikely]` Attributes for Branch and Layout Optimization

Summary

Rust currently provides a few coarse mechanisms to help the compiler reason about branch frequency — #[cold] on functions, core::intrinsics::{likely, unlikely} (nightly), and profile-guided optimization (PGO). These are useful, but they don’t scale elegantly to common patterns such as multi-arm match statements or nested if chains, where the programmer knows which paths are overwhelmingly dominant (“happy path”) and which are rare (“sad path”).

I’d like to explore adding lightweight, first-class attributes such as:

if #[likely] cond { ... } else #[unlikely] { ... }

match result {
    Ok(v)      #[likely(0)]    => process(v),
    Retry      #[likely(1)]    => retry(),
    Timeout    #[likely(0.05)] => backoff(),
    Err(e)     #[unlikely]     => handle(e),
}

The goal is to let programmers express expected execution likelihood in a way the compiler can translate into existing LLVM/Cranelift metadata, improving branch prediction, code layout, cache locality, and pipeline efficiency, without affecting semantics.

Motivation

Programmers often have domain knowledge that static analysis and profiling can’t capture easily — for example:

Error or failure paths that occur <1% of the time.
Status enums where one or two arms dominate under normal operation.
Control loops where the “continue” case is typical and the “break” case is exceptional.

Today we can use intrinsics or code re-ordering tricks, but these are less expressive and inconsistent across if/match constructs.

Explicit hints would help:

Shape code layout (hot fall-through first, cold blocks out-of-line).
Drive branch-weight metadata (llvm.expect, !prof branch_weights).
Influence backend layout (ARM branch hint bits, x86 fall-through, etc.).
Improve I-cache and pipeline utilization by keeping hot paths contiguous.

Semantics (sketch)

Purely advisory; no effect on program logic.
Accepted forms:
#[likely] → hot path.
#[unlikely] → cold path.
#[likely(p)] → numeric hint (p as ordinal or probability).
Unannotated arms/branches are neutral.
Compiler lowers hints to MIR metadata → backend branch weights → target-specific layout and hints.
PGO or JIT data override static hints when available.

Implementation considerations

On LLVM backends, this maps cleanly to llvm.expect or !prof branch_weights.
On Cranelift, could attach similar metadata to control-flow edges.
On x86, this affects block ordering and alignment; on ARM, also sets architectural “taken/not-taken” bits.
Can interact naturally with existing #[cold] and inlining heuristics.
#[likely]/#[unlikely] on match arms could influence both ordering and outlining decisions.

Example

#[cold]
fn handle_error(e: Error) { log::error!("{:?}", e); }

fn process(x: i32) -> Result<i32, Error> {
    if #[likely(x >= 0)] {
        Ok(x + 1) // hot fall-through
    } else #[unlikely] {
        handle_error(Error::BadArg);
        Err(Error::BadArg)
    }
}

Even on x86 (no explicit hint bits), this would encourage contiguous layout of the hot path and push the error handler into a cold section. On ARM and others, actual branch-hint encodings would be emitted.

Open questions

Should numeric weights (#[likely(0.9)]) be supported, or only binary hints?
How would this interact with #[cold] and inlining thresholds?
Should unannotated arms remain neutral, or implicitly less likely?
What’s the best MIR representation (block metadata vs. edge attributes)?
Is there a feasible path to make this work uniformly across LLVM, Cranelift, and GCC backends?

Why discuss now

LLVM and Cranelift already support branch-weight metadata. Rust developers frequently reach for core::intrinsics::likely/unlikely, but a first-class attribute syntax could make this idiomatic, stable, and portable — especially valuable for embedded and high-performance workloads where layout matters as much as prediction.

Would the language or compiler teams be open to exploring this direction, perhaps starting as an experimental -Z likely-attributes feature?

Links / background

#[cold] attribute docs
core::intrinsics::likely, unlikely
LLVM llvm.expect intrinsic
LLVM PGO and BOLT documentation

Closing

This proposal isn’t about micromanaging the predictor; it’s about giving the compiler better priors so it can produce straighter, more cache-friendly instruction streams. Feedback from the compiler and language teams on feasibility, naming, and potential pitfalls would be greatly appreciated.

Note: this markdown was generate by ChatGPT based on our discussions.

bjorn3 · October 7, 2025, 6:30pm

github.com/rust-lang/rust

#[cold] on match arms

master ← x17jiri:cold_match_arms

geopend 10:12AM - 21 Jan 24 UTC

x17jiri

+39 -3

### Edit This should be in T-lang. I'm not sure how I can change it. There… is discussion: https://rust-lang.zulipchat.com/#narrow/stream/213817-t-lang/topic/Allow.20.23.5Bcold.5D.20on.20match.20and.20if.20arms ### Summary Adds the possibility to use `#[cold]` attribute on match arms to hint the optimizer that the arm is unlikely to be taken. ### Motivation These hints are sometimes thought to help branch prediction, but the effect is probably marginal. Modern CPUs don't support hints on conditional branch instructions. They either have the current branch in the BTB (branch prediction buffer), or not, in which case the branch is predicted not to be taken. These hints are, however, helpful in letting the compiler know what is the fast path, so it can be optimized at the expense of the slow path. `grep`-ing the LLVM code for BlockFrequencyInfo and BranchProbabilityInfo shows that these hints are used at many places in the optimizer. Such as: - block placement - improve locality by making the fast path compact and move everything else out of the way - inlining, loop unrolling - these optimizations can be less aggressive on the cold path therefore reducing code size - register allocation - preferably keep in registers the data needed on the fast path ### History RFC 1131 ( https://github.com/rust-lang/rust/issues/26179 ) added `likely` and `unlikely` intrinsics, which get converted to `llvm.expect.i8`. However this LLVM instruction is fragile and may get removed by some optimization passes. The problems with the intrinsics have been reported several times: https://github.com/rust-lang/rust/issues/96276 , https://github.com/rust-lang/rust/issues/96275 , https://github.com/rust-lang/rust/issues/88767 ### Other languages Clang and GCC C++ compilers provide `__builtin_expect`. Since C++20, it is also possible to use `[[likely]]` and `[[unlikely]]` attributes. Use: ``` if (__builtin_expect(condition, false)) { ... this branch is UNlikely ... } if (condition) [[likely]] { ... this branch is likely... } ``` Note that while clang provides `__builtin_expect`, it does not convert it to `llvm.expect.i8`. Instead, it looks at the surrounding code and if there is a condition, emits branch weight metadata for conditional branches. ### Design Implementing `likely`/`unlikely` type of functions properly to emit branch weights would add significant complexity to the compiler. Additionally, these functions are not easy to use with `match` arms. Replacing the functions with attributes is easier to implement and will also work with `match`. A question remains whether these attributes should be named `likely`/`unlikely` as in C++, or if we could reuse the already existing `#[cold]` attribute. `#[cold]` has the same meaning as `unlikely`, i.e., marking the slow path, but it can currently only be used on entire functions. I personally prefer `#[cold]` because it already exists in Rust and is a short word that looks better in code. It has one disadvantage though. This code: ``` if cond #[likely] { ... } ``` becomes: ``` if cond { ... } #[cold] { ... empty cold branch ... } ``` In this PR, I implemented the possibility to add `#[cold]` attribute on match arms. Use is as follows: ``` match x { #[cold] true => { ... } // the true arm is UNlikely _ => { ... } // the false arm is likely } ``` ### Limitations The implementation only works on bool, or integers with single value arm and an otherwise arm. Extending it to other types and to `if` statements should not be too difficult.

pitaj · October 7, 2025, 8:48pm

Don't see any mention of core::hint::{likely, unlikely}. Obviously only applicable to if conditions (not match arms etc) but also not intrinsics.

Also cold_path

kornel · October 8, 2025, 1:08am

On match, I'd prefer something that doesn't require annotating every arm.

In practice there are a few strategies to implement match: a lookup table (if you're lucky), a linear scan that favors early items the existing order, or bisection that assumes equal probability of all items. And they could be mixed for subsets of the range.

So match just needs a choice of the strategy or an approximate probability distribution function.

kolotyluk · October 8, 2025, 1:26am

This idea does not require annotating every arm, only as much you feel useful. If you don't annotate, it becomes neutral. In practice, it might make sense to only annotate one arm, depending on the context.

Can you suggest other strategies with match for indicating likelihood?

kornel · October 8, 2025, 10:26am

What does neutral mean?

I think in the current implementation, the compiler emits equivalent of linear if/elseif/else for all arms, which allows you to assume that the first match arm will be as fast as possible (implying power law distribution of likelihoods). It may optimize to a table anyway, but if not, it will still check the first arm first, so you can order arms by most likely first. That's O(1) to O(n).

The problem is that the compiler could try to use something fancier instead, like bisection (check if the value is in the first or second half of the arms, recursively, and it may require sorting the arms). It might be faster overall O(log n), but the first couple of arms won't be guaranteed to be the fastest anymore.

I presume that in practice, probabilities of match arms matching will be somewhere between a power law (where there's a most common value dominating) and an even distribution (random, unpredictable).

So I'm not sure how I would annotate all arms as equally likely, given that the current implementation doesn't treat them as equally likely. If I annotate one arm as likely, does it make all other arms as equally less likely?

kolotyluk · October 8, 2025, 4:25pm

Neutral means unweighted.

Yes, the implication is that if you annotate one arm as likely, it makes all other arms equally less likely. That would be my natural intuition.

I want to stress these are not directives to the compiler; they are 'hints'.

We are saying, "as a programmer, this is what I believe is the most likely path(s) the code will take, but you, the compiler, may know more than I do. Hopefully, the CPU Branch Prediction Logic will not undermine both of us."

If Rust ever implements this, it would be interesting to perform a performance test to see if it makes a difference.

scottmcm · October 8, 2025, 4:34pm

TBH, I don't know how much it would actually matter, because of all the other things that do exist that mostly cover it:

#[cold] so that any path leading to a panic (for example) is automatically unlikely without needing any annotation on the branch
select_unpredictable in std::hint - Rust for the places where actually it's neither likely nor unlikely, and you want to avoid branchiness
PGO for having better information than the programmer's intuition anyway about which way the branches usually go

TBH, most of the likely/unlikely I see are cargo-culted in ways that they're not obvious at all whether they helped -- and often they hurt things like inlining and other compiler optimizations so can just make things worse.

Topic		Replies	Views
Likely/unlikely hint on refutable patterns	4	2038	March 25, 2019
Pre-RFC: Stabilization of `likely(bool)`, `unlikely(bool)` & `assume(bool)` intrinsics libs	45	2162	December 4, 2023
Could `unlikely` and `likely` be stabilized in `std::hint`? libs	8	4641	September 30, 2019
Pre-RFC: branch optimization for panicking paths compiler	7	1171	March 25, 2019
Pre-RFC: optimise(size) and optimise(no) attributes language design	27	3800	March 25, 2019