How to generate MIR into a basic block to read-increment-write to a constant address?

Acciente717 · July 21, 2024, 7:36pm

I am trying to interpose the drop shim code so that my runtime can know if a thread is currently executing inside a drop handler. My plan is to let each drop handler perform a read-increment-write to a fixed memory address before executing the drop handler body, and similarly a read-decrement-write after the body finishes execution. A thread is inside a drop handler if the counter is positive.

I believe the build_drop_shim function in the MIR stage is where I should make the modification. I am experimenting with adding additional basic blocks and generating MIR instructions. The following is my attempt to generate the read-increment-write operation. However, I encountered errors when trying to make a fixed address (0x2000_0000) a Place that I can load from and store to. My target is a 32-bit ARM microcontroller.

let addr = 0x2000_0000u32;
let addr_constant = ConstOperand {
    span,
    user_ty: None,
    const_: Const::Val(ConstValue::Scalar(Scalar::from_u32(0x2000_0000u32)), tcx.types.u32),
};
let addr_operand = Operand::Constant(Box::new(addr_constant));
let addr_place = Place::from(addr_operand); // <- Error

// Make a local variable.
let local_place = Place::from(Local::from_u32(1));

// Read from 0x2000_0000.
let load_rvalue = Rvalue::Use(Operand::Copy(addr_place));
body[prologue_block].statements.push(Statement {
    source_info,
    kind: StatementKind::Assign(Box::new((local_place, load_rvalue))),
});

// Increment the local variable.
let increment = Rvalue::BinaryOp(
    BinOp::Add,
    Box::new((
        Operand::Copy(local_place),
        Operand::Constant(Box::new(ConstOperand {
            span,
            user_ty: None,
            const_: Const::Val(ConstValue::Scalar(Scalar::from_u32(1)), tcx.types.u32),
        })),
    )),
);
body[prologue_block].statements.push(Statement {
    source_info,
    kind: StatementKind::Assign(Box::new((local_place, increment))),
});

// Write back to 0x2000_0000.
let store = Rvalue::Use(Operand::Copy(local_place));
body[prologue_block].statements.push(Statement {
    source_info,
    kind: StatementKind::Assign(Box::new((addr_place, store))),
});

I hope someone could enlighten me about how to represent the fixed address 0x2000_0000 as a Place.

CAD97 · July 21, 2024, 9:34pm

The best way to structure this is probably to have a #[lang] item static which you can utilize and use a linker script to ensure the static's address is what you intend. Getting the place should then just be a place mention of the lang item static.

Otherwise, you probably want to look at PlaceBuilder.

pitaj · July 21, 2024, 9:35pm

Also - I don't know MIR but you probably want this to be handled by volatile memory operations.

programmerjake · July 22, 2024, 9:34am

if it's static but not thread-local, you'd need atomic operations, not just volatile. too bad rust doesn't yet have atomic volatile operations...

pitaj · July 22, 2024, 2:27pm

This sounds like a single-threaded case to me

Acciente717 · July 22, 2024, 8:17pm

Thank you for your suggestion. I am working with the compiler version 1.75.0, and it looks like #[lang = "static"] is not available.

Nevertheless, now it seems like declaring an external global variable __VAR and perform the read-increment-write from and to __VAR would be the easiest.

However, I am still struggling to create a Place that represents the extern global variable. I hope you could shed some lights. Many thanks!

Acciente717 · July 22, 2024, 8:23pm

Thanks and I should definitely use volatile operations. But just to get a working (non-crashing) version I will start with normal read/write first. Seems like I need to call intrinsic functions to perform volatile read/write which will complicate the code.

Acciente717 · July 22, 2024, 8:23pm

Theoretically the counter is thread-local. When the runtime system performs context switch, it will also switch the counter value. There is also only one core on the microcontroller. For my use case it is fine to not use atomic load/store.

CAD97 · July 22, 2024, 8:31pm

To be clear, I'm suggesting adding a new language item^[1]. You're writing a bespoke MIR transform, so you're modifying the compiler toolchain already, and adding the language item is the cleanest way to get a known-to-the-compiler location.

If that's impossible for some reason, a "pretend" lang item which is used by unmangled name instead of the #[lang] glue is also doable, but at that point isn't all that different from a hardcoded magic address.

Thinking about it once again, though, it actually might be simpler to emit calls to two functions e.g. rust_enter_drop_glue and rust_exit_drop_glue instead of codegenning the prolog/epilog directly. Then any tracking is just (almost^[1:1]) straightforward Rust code, and you can set up a count in the same way manner that the panic count is tracked.

It would need to be careful to be free of any nontrivial destructors to avoid endless recursion. ↩︎ ↩︎

Acciente717 · July 22, 2024, 8:54pm

Emitting two functions calls looks like a simpler way to go. But I have a follow up question: How can I let the compiler assume that there are two externally defined function available? To make a function call, the compiler requires a DefId which I don't know where to get.

system · October 20, 2024, 8:54pm

This topic was automatically closed 90 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
How to interpose drop handlers? compiler	5	549	February 8, 2024
Custom MIR -- howto add a PlaceAlias instruction compiler	3	332	October 7, 2024
[MIR] constant evaluation compiler	74	15215	March 25, 2019
Making the compiler interface on-demand driven compiler	6	2015	April 15, 2019
Generating custom MIR(`std::intrinsics::mir`) from MIR compiler	3	485	September 28, 2023

How to generate MIR into a basic block to read-increment-write to a constant address?

Related topics