Exploit the padding?

Soni · June 27, 2021, 6:27pm

Not everything needs to be fast. For example if your priority is embedded systems with kilobytes of RAM.

Not particularly. It would break unsafe code that relies on padding. That code should be audited before using the relevant compile-time flags.

SkiFire13 · June 27, 2021, 7:30pm

Like anything that uses std::ptr::copy and similar functions? It claims it "Copies count * size_of::<T>() bytes from src to dst", and that includes padding bytes. This means that under your proposal the current implementations of std::mem::replace & co are unsound.

InfernoDeity · June 27, 2021, 8:23pm

I work with the SNES, which has 128kiB of WRAM normally (more if you exploit the fact the cartidge has control over most of the address space, and add on-cartridge DRAM). In this case, the alignment of u32 is 2, and the ((u32,u8),u8) has a total of two padding bytes. To move the inner (u32,u8) costs 3 16-bit moves, or 2 16-bit moves and 1 8-bit move. Changing the size of moves costs 3 cycles and saves 1 per access (thus, if m=0, so memory accesses are 16-bit, the former is two cycles faster, and if m=1, so memory accesses are 8-bit, the latter is two cycles faster iff the 8-bit move occurs first; note: this is an oversimplification). I'd gladly trade those two bytes for those two cycles. I can certainly understand when that may not be the case, though.

Soni · June 27, 2021, 9:01pm

std::ptr::copy is generic and can thus be monomorphized to take padding into account.

it's fine.

SkiFire13 · June 27, 2021, 9:11pm

That would make it do something different than what it claims to do. It doesn't say it will copy count Ts, it says it will copy count * size_of::<T>() bytes, which is different because it implies it will copy padding bytes.

Soni · June 27, 2021, 11:01pm

Sometimes breaking changes are necessary.

Thankfully this one is meant to be opt-in, which makes it not really breaking.

... Altho maybe a better place to make this change would be a target 'triplet'. x86_64-pc-linux-gnu-packed ABI?

Btw: (((u32, u8), u16), u8) is not covered by size+stride, but is covered by padding.

Aaron1011 · June 27, 2021, 11:41pm

It cannot, because there is no requirement that the type used with std::ptr::copy be the same as the type that was originally used.

This code runs successfully under Miri:

use std::mem::MaybeUninit;

fn bytewise_copy<T>(src: &mut T, dst: &mut T) {
    let src_ptr: *mut MaybeUninit<u8> = src as *mut T as *mut _;
    let dst_ptr: *mut MaybeUninit<u8> = dst as *mut T as *mut _;
    unsafe { std::ptr::copy(src_ptr, dst_ptr, std::mem::size_of::<T>()); }
}

fn main() {
    let mut src: (u32, u8) = (10, 20);
    let mut dst: (u32, u8) = (30, 40);
    bytewise_copy(&mut src, &mut dst);
    println!("Dest: {:?}", dst);
}

If dst was a pointer into a ((u32, u8), u16), then bytewise_copy would end up overwriting the value of the u16, since it will overwrite the padding bytes used to store the u16. The only type that std::ptr::copy is told about is MaybeUninit<u8> - so even if we wanted to change its behavior, we couldn't, since we don't have the necessary information available when the function is monomorphized.

Yes, but only to fix soundness bugs, or as part of specifying previously underspecified areas of the compiler (e.g. std::mem::uninitialized with uninhabited types and non-zero-initializeable types). This case is neither of those.

Soni · June 28, 2021, 12:36am

Wait isn't padding UB today?

Anyway, what's wrong with forking the stdlib for this one feature? A critical feature that every web browser written in Rust should use.

Aaron1011 · June 28, 2021, 1:40am

I'm not sure what you're asking. The existence of padding is exposed via std::mem::size_of, and it's safe to read uninitialized bytes as MaybeUninit.

If you mean literally forking the rust-lang/rust repo and modifying the standard library (and trying to distribute that in some way), nothing is stopping you.

This is a pretty sweeping claim - do you have any benchmarks showng that this feature will give the performance gains that you think it will?

Soni · June 28, 2021, 2:29am

No we mean having two slightly different stdlibs, and picking between them based on target triplets.

In particular, having ptr::read be aware of padding in one of them.

It should lower RAM usage by half. The performance hit is more than worth it. It'll be more than made up for by not hitting swap.

(would definitely be interesting if gcc-rust had a GNU extension for packed structs. the -ffast-math of Rust layout optimizations.)

Aaron1011 · June 28, 2021, 3:32am

I posted a concrete example of code that copies data one byte at a time, with no knowledge of the 'actual' type. How can that existing code be compliled to make your proposal work?

Where are you getting 'half' from?

I would be very surprised if any browser frequently required swap due to exhausting RAM. Do you have any concrete data about how often this happens?

Soni · June 28, 2021, 3:41am

You break it. Opt-in, ofc.

Yeah we don't have much RAM. The swap is almost always at least 25% full. At least the system isn't entirely unusable because of it, after a lot of tuning. But still not great.

notriddle · June 28, 2021, 5:08am

Do you have profiles showing that a large fraction of real-world web browser RAM usage is going to struct padding?

SkiFire13 · June 28, 2021, 6:59am

Letting aside the fact that if you end up with a type like that you probably have bigger problems, can't you rewrite your program so that it uses an equivalent type but without padding? A bit of tuning on the most used structs should take much less effort than rewriting the stdlib and any other crate to be compatible with your proposal. Not to mention the ecosystem split and the double effort people will have to make to support both ways.

atagunov · June 28, 2021, 11:33am

Would I be right to summarize thinking as

Option8 - group discriminants together
Zero padding - use niches
size/stride - remove padding

?

E.g. three very different but related ideas?

Soni · June 28, 2021, 11:44am

This one is different from all of those tbh. None of those handle (((u32, u8), u16), u8) the way we want.

mathstuf · June 28, 2021, 1:49pm

That's fine, but why must you spell your type (((u32, u8), u16), u8)? Is there really no other spelling for that type that makes sense?

InfernoDeity · June 28, 2021, 5:19pm

rust already has repr(C,packed) that has that behaviour. I'd hope that gcc-rs would support that, and not some random extension to achieve the same.

InfernoDeity · June 28, 2021, 5:27pm

Also, if this applies to builtin types like tuples, this would also affect impls that provide stable abi guarantees, such as GitHub - LightningCreations/lccc: Lightning Creations Compiler Frontend for various languages. Either this has to become the default (which is a breaking change as mentioned above), or the abi changes with compiler flags/options outside of the two abi-control options -Z repr-rust-layout and -Z build-abi. Both of these are well beyond what I would want to implement, especially since it would likely apply to things like TokenStream that crosses the boundary between the compiler and compiled rust code, potentially breaking the knowledge I otherwise posses about the guaranteed layout of the type.

Soni · June 28, 2021, 5:56pm

Because that's how you'd usually lay out your types, unless you happen to only ever use integers/floats directly, or objects with guaranteed no padding.

You wouldn't write them literally like that, but you would build them up like that, with generics, etc.

Topic		Replies	Views
Reordering of writes via differently-typed pointers Unsafe Code Guidelines	14	2019	March 25, 2019
Pre-RFC: mem::trailing_padding!	13	307	October 10, 2024
Writing down binary data... with padding bytes	69	6440	February 21, 2020
Make a way to avoid UB when accessing byte representation of type with padding	6	1062	November 15, 2020
Pre-RFC: Allow array stride != size language design	34	3007	January 16, 2023

Exploit the padding?

Related topics