Why don't ZSTs have unique addresses?

Pr0methean · May 29, 2024, 5:36pm

If every ZST had a different address, then fat pointers could be checked for equality as though they were thin pointers, which would be faster. (The only exception would be where one pointer might be a partial borrow of the other -- either a slice prefix or a struct field with offset zero -- but the pointer types would rule that out in most cases.) So why doesn't Rust do this?

pitaj · May 29, 2024, 5:46pm

How are fat pointers related to ZSTs? Are you talking about each ZST instance, or each different ZST type? How would you give each ZST a unique address?

Counterexample:

let x = vec![(); usize::MAX];
let y = vec![(); usize::MAX];

kornel · May 29, 2024, 5:57pm

That would make them non-zero-size:

struct OST {
   a: (),  b: (),
}

C++ had a guaranteed address identity, and to get ZSTs it had to add an attribute to revert that:

https://en.cppreference.com/w/cpp/language/attributes/no_unique_address

Additionally, many LLVM optimizations require not having address exposed, so pointer comparisons can be expensive and inhibit optimizations.

Rust's niche optimizations for things like Result<()> would be much more difficult if that () had to have an identity.

BTW, today I've stumbled into a rabbit hole of stack pointer comparisons in LLVM. LLVM has two contradictory features:

Assumes that every stack variable has a unique address
Has an optimization pass that eliminates stack copies and reuses stack space, breaking assumption 1.

github.com/rust-lang/rust

Miscompilation: Equal pointers comparing as unequal

opened 01:37AM - 13 Feb 23 UTC

JakobDegen

A-LLVM P-high T-compiler I-unsound C-bug WG-llvm

I [tried this code](https://play.rust-lang.org/?version=stable&mode=release&edit…ion=2021&gist=9078cfb7b98e213bb02373f54e8b19b2): ```rust pub fn foo() -> (usize, usize, bool) { let a: *const u8; let b: *const u8; { let v: [u8; 16] = [std::hint::black_box(4); 16]; a = &(v[0]); } { let v: [u8; 16] = [std::hint::black_box(4); 16]; b = &(v[0]); } (a as usize, b as usize, a == b) } fn main() { println!("{:?}", foo()); } ``` I expected to see this happen: Either the pointers (when cast to integers) are the same and the comparison is `true`, or they are not the same and the comparison is `false`. Instead, this happened: It printed: `(140728325198984, 140728325198984, false)` [Upstream LLVM issue](https://github.com/llvm/llvm-project/issues/45725) ### Meta  Reproduced via `rustc +nightly -Copt-level=3 test.rs && ./test`. `rustc --version --verbose`: ``` rustc 1.69.0-nightly (5b8f28453 2023-02-12) binary: rustc commit-hash: 5b8f284536d00ba649ca968584bedab4820d8527 commit-date: 2023-02-12 host: x86_64-unknown-linux-gnu release: 1.69.0-nightly LLVM version: 15.0.7 ``` Also reproduces on master. @rustbot label +I-unsound +T-compiler +A-llvm

Pr0methean · May 29, 2024, 6:09pm

I'm talking about dyn pointers - they might point to ZST singletons which have the same address because they have the same alignment, but then have different vtable pointers.

pitaj · May 29, 2024, 6:33pm

Might be helpful to give an example. Are you talking about something like this?

struct Foo;
struct Bar;

trait Run {}

impl Run for Foo {}
impl Run for Bar {}

let v: Vec<Box<dyn Run>> = vec![Foo, Bar, Foo];

let a: &dyn Run = &*v[0];
let b: &dyn Run = &*v[1];
let x: &dyn Run = &*v[2];

assert!(!ptr_addr_eq(a, b));
assert!(ptr_addr_eq(a, c)); // or maybe even these should be different

quinedot · May 29, 2024, 6:46pm

You can't count on vtable pointer equality or inequality either.

note that comparing trait object pointers (*const dyn Trait) is unreliable: pointers to values of the same underlying type can compare inequal (because vtables are duplicated in multiple codegen units), and pointers to values of different underlying type can compare equal (since identical vtables can be deduplicated within a codegen unit).

Pr0methean · May 29, 2024, 11:45pm

@quinedot That's fine, because the deduped vtables would have to code for the same behavior. For my purposes, equivalent singletons are equal.

Pr0methean · May 29, 2024, 11:46pm

Yes, that's what I'm talking about, except that the trait would have a polymorphic method.

kpreid · May 30, 2024, 3:02pm

If it were to be required that all zero-sized types have unique addresses for their instances, then:

The compiler and linker would have to somehow assign these unique addresses, when the type is monomorphized if it is generic, even though that happens in separate compiler invocations when compiling multiple crates.
There would have to be a special case that whenever a ZST pointer is produced from a struct, slice, stack variable, etc. its pointer value is replaced with the unique one.

(I suppose this is not strictly necessary, because one could say that only literals of the ZST have this unique address, but then it's much less useful as a guarantee. As I see it, an important factor of how ZSTs work is that they are just like non-zero-sized values except that you can do more things with them. In particular, they are movable just like other values.)

system · August 28, 2024, 3:03pm

This topic was automatically closed 90 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Equality of ZST values language design	19	1115	February 6, 2024
Pointers Are Complicated II, or: We need better language specs	148	9294	January 17, 2021
Surprising interaction between zero sized structs and Hash libs	11	1756	March 25, 2019
Pre-RFC: Thinness as a property of the data rather than the trait internals	9	3704	March 25, 2019
How do exclusive references to zero sized types work? compiler	8	1781	August 3, 2020

Why don't ZSTs have unique addresses?

Related topics