Annotate unsafe code in LLVM IR

priyasiddharth · February 3, 2021, 1:19am

Hello,

I am building a static analysis tool which gets LLVM IR as input. I want to know which Basic blocks in the IR map to an unsafe scope in the source program.

AFAICT this information is not present in LLVM IR and doesn't seem to be present in MIR also. I am looking for

Is there a known method to retain this information in LLVM IR?
If nothing exists, how might I might go about mapping an unsafe code block/function to it equivalent LLVM IR?

thanks Siddharth

gbutler · February 3, 2021, 1:33am

Seems like you could get this from debug info by mapping from LLVM IR back to source code lines.

bjorn3 · February 3, 2021, 6:57am

It is available at mir.source_scopes[source_scope].local_data.safety. Unsafeck has run on MIR instead of HIR or AST for a while now. There is a proposal to move it to THIR though.

priyasiddharth · February 3, 2021, 10:30pm

thank you both for the info!

DebugInfo should work. Specifically, following DILocation -> DILexicalBlock entries will get me the enclosing "unsafe" scope. Ideally, I only want to read the LLVM IR file and not the accompanying source. The solution here would be to add an "unsafe" attribute to a LexicalBlock debug info when it is generated by the compiler. Something similar will be needed for unsafe functions.

I see some properties of C++/FORTRAN functions can be marked with spFlags.

Can unsafe functions or unsafe code blocks be marked similarly? Thoughts?

scottmcm · February 3, 2021, 11:57pm

That sounds like a good idea to me. I have no idea what the right way to put this in DWARF would be, though

(Maybe Rust should request a DW_AT_unsafe flag, like Fortran has things like DW_AT_recursive? Or since it's not just the one of the function would it be DW_TAG_unsafe? Or until then maybe it would make sense to set the DW_AT_name to "unsafe" on the lexical block? But today's the first time I've looked at the standard, so I could likely be completely wrong...)

priyasiddharth · February 4, 2021, 2:47pm

The above looks like a good interim solution to me. Do we need agreement before code is written? What is the right forum/person/team to discuss this further?

scottmcm · February 4, 2021, 5:41pm

I don't know the appropriate people on compiler to ask for this. Maybe post in https://rust-lang.zulipchat.com/#narrow/stream/182449-t-compiler.2Fhelp if you don't get a response here.

system · May 5, 2021, 5:42pm

This topic was automatically closed 90 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Marking Unsafe Rust Blocks for LLVM-IR compiler	1	808	November 2, 2021
Unsafe code annotations Unsafe Code Guidelines	16	1999	April 21, 2019
Rust Unsafe Information in MIR Code compiler	2	703	November 24, 2022
Extending the unsafe keyword language design	45	2384	September 13, 2024
Threading rust attributes to LLVM annotations compiler	3	1247	May 9, 2019

Annotate unsafe code in LLVM IR

Related topics