Include racy reads in Rust memory model with `MaybeInvalid<T>`

RalfJung · May 31, 2026, 6:26pm

You are describing something more akin to symbolic execution than angelic choice. Symbolic execution is indeed a way to approximate or even implement angelic choice. But having that in an actual program run (rather than in a program analyzer) requires explicitly adding symbolic state to the program state, which in turn affects everything else that reasons about the program state.

For ptr2int casts specifically, the PNVI-ae-udi model in C does pretty much that: the "udi" mechanism introduces a symbolic form of provenance that approximates the angelic semantics. Sadly it does not easily work with more complicated models such as Stacked/Tree Borrows, or even with restrict in C.

ProgramCrafter · May 31, 2026, 8:55pm

When the check is moved inside, then there is effectively equivalent Rust code which explains what happens:

use std::hint::black_box;
use std::sync::RwLock;

struct SeqlockStory<T>(RwLock<T>);

impl<T: Clone + Default> SeqlockStory<T> {
    fn load(&self) -> T {
        if let Ok(v) = self.0.try_read() { return v.clone(); }
        return black_box(T::default());  // outer code can't rely on
                                         // the exact returned value
    }
    fn store(&self, value: T) {
        *self.0.write().unwrap() = value;
    }
}

RalfJung · May 31, 2026, 9:10pm

black_box is a NOP opsem-wise, so it doesn't ever make sense in a story. It's either unnecessary in your story or the story doesn't quite work.

Also what's try_read? You can't just access MiniRust here. As mentioned above, the correctness argument for the "story" approach relies on the story being expressible in regular Rust code.

programmerjake · May 31, 2026, 10:04pm

RalfJung · June 1, 2026, 8:08am

Oh sorry, I missed the RwLock. That has a different size than T so this doesn't literally work, but I guess we could imagine that the RwLock state is stored out-of-band.

The return T::default() is wrong though since the asm code may return a different value.

And generally that story is pretty far from what the asm actually does so arguing that the asm is a refinement of this code is far from trivial. For instance, the story code performs a bunch of synchronization (the try_read internally does an "acquire"), while the asm code doesn't do the same synchronization -- but for the story to be valid the asm code has to perform at least as much synchronization as the story.

ProgramCrafter · June 1, 2026, 2:07pm

True. There should be some source of nondet (demonic permitted)... well for any T that can be constructed from byte array (like std::vec::Vec<u8>, impl bytemuck::AnyBitPattern or impl arbitrary::Arbitrary) there is one: allocate a few blocks and use the pointer addresses to generate the instance.

I believe the asm code must do similar synchronization (including the Release fence between data read and check if someone wrote in parallel, which happens here on automatic unlock), and will leave that question to whoever writes the actual implementation.

Notably, SeqLock is also T plus a counter integer so it should work out.

RalfJung · June 1, 2026, 3:00pm

Oh you were imagining the inline asm to contain the entire seqlock, not just the racy read and write? Okay that changes things, I didn't think that variant through yet.

ProgramCrafter · June 2, 2026, 2:58pm

More precisely, I should've written the equivalent as follows: Rust Playground.

Though I do like the bytewise-atomic memcpy more than this justification. (If only it was defined behavior on hardware specification side...)

RalfJung · June 2, 2026, 3:28pm

Atomic bytewise memcpy (in its most restricted form, that is still sufficient for SeqLocks) does not require mixed-size accesses, so there's no problem from the hardware side.

ais523 · June 2, 2026, 6:47pm

It's worth noting that (in a previous program that I'm no longer working on) I ended up with a use case for after-the-fact-verfieid racy reads, seqlock style, for which the atomic-bytewise approach wouldn't work: I was writing an async-signal-safe memory allocator (i.e. it can allocate from a signal handler, even if the signal interrupted a use of the allocator), which stored its freelist in the free allocations themselves. Allocating would work like this:

Read the pointer to the head of the freelist;
Read the memory it points to in order to discover the next entry of the freelist – the pointer found there will be the new head of the freelist if step 3 succeeds;
Atomically swap the head of the freelist with the pointer that was read in step 2, only if the head of the freelist has not changed since step 1 (not even changed and changed back, to avoid the ABA problem).

Because the allocator is async-signal-safe, it is possible that, between step 1 and 2, a signal handler (or another thread) will allocate the first block of memory on the freelist. In this situation, step 2 will read from memory that has already been allocated and is already in use by the signal handler or other thread. The result will then be discarded in step 3 (due to the head of the freelist having changed) and will not be used for anything, but it was still actually read at the hardware level. Additionally, you can't rely on the writing code to use only atomic writes to the memory, because in this case the memory has been returned as an apparently normal allocation.

I no longer actually need to use this style of memory allocator for anything (I changed other parts of my design so that the ability to do this sort of thing wasn't necessary), but maybe someone else does. As such, I would if possible prefer a model which makes this sort of code legal, not just seqlocks.

Topic		Replies	Views
Loads and stores to/from outside the memory model language design	35	3980	June 29, 2019
Bit-wise reasoning for atomic accesses Unsafe Code Guidelines	38	4148	February 24, 2019
`unordered` as a solution to "Bit-wise reasoning for atomic accesses" Unsafe Code Guidelines	23	3992	October 15, 2019
Add volatile operations to core::arch::x86_64 language design	20	2448	July 28, 2019
How does inline assembly and the physical machine fit into the abstract machine model, or exist outside of it? language design	39	2580	May 9, 2025

Include racy reads in Rust memory model with `MaybeInvalid<T>`

Related topics