Re: why Rust references are not pointers

kornel · April 4, 2018, 8:09pm

Continuing the discussion from Bikeshed: Rename `catch` blocks to `fallible` blocks:

Bikeshed: Rename `catch` blocks to `fallible` blocks

Bikeshed: Rename `catch` blocks to `fallible` blocks

“References” on the other hand are suuuper confusing to C++ programmers who expect them to be usable like regular pointers. I’d prefer &/&mut to be called something completely different, like “read permission”/“exclusive read/write permission”.

Interesting. At the risk of creating a tangent, I’m usually biased toward the C++ model of programming reality, and I always found “references” to be an extremely intuitive and broadly accurate term. To me the biggest difference between C++ references and Rust references is simply that Rust tries a lot harder to provide some basic semantic guarantees for them, e.g. that they even have a referent so it’s safe to dereference them.

In C/~~C++~~ there's clear distinction in the syntax between passing things by reference (pointer) and by value, but there's no distinction between owned and borrowed values.

In Rust it's the opposite. The syntax exists to separate borrowed vs owned, but the distinction between values and pointers is vague, and handled entirely differently (& may be a 2-usize struct, but Box is just a raw pointer, usually).

C is super explicit about dereferencing, but the concept of lifetimes is left entirely up to programmer's imagination.
OTOH Rust is explicit about borrows and lifetimes, but dereferencing is often hidden.

So given all that if you think Rust reference == C pointer, you'll think of them from a completely wrong perspective.

For example, in C it's common to return objects by pointer, because that's how malloc works, and there's no other way to have any private type than via an opaque struct pointer. But in Rust if you try to allocate something and return a reference to it the borrow checker will tell you it doesn't make any sense.

In C it's usual to put pointers in structs. In Rust references in structs is are a special case of limited usefulness.

So I think it's better to think of Rust's references as temporary read/write locks that are implemented at compile time. That fixes thinking from "I want to return this by pointer" to "why would I give a temporary read-only lock to the object, rather than the object itself?"

Ixrec · April 4, 2018, 8:30pm

I don’t have time for a detailed response atm, but I do want to emphasize that I was comparing Rust references to C++ references, not to C pointers (or C++/Rust raw pointers). For me at least, most of what you’re saying is also an argument “why C++ references are not pointers”.

But totally independently of that, I think the terminology point you appear to be arguing for has some merit, and I’ve even see people like niko express similar views. I just think that the terminology arguments to have are, say, whether a “shared reference” should be called a “readonly reference”, or whether a “mutable reference” should be called an “exclusive reference”; I don’t see any compelling alternatives to the “reference” part.

ExpHP · April 4, 2018, 8:32pm

I wouldn't group C++ in with C here.

C++ distinguishes between owned (T) and borrowed (T & or const T&) values.
It is not at all common to return objects by pointer in modern C++. Easiest way to return something is by value (copy constructor, usually with the hope of the compiler performing copy elision); preferred way in modern code is through move constructors.
Putting raw pointers in structs in C++ is a special case of limited usefulness. (edit: well, perhaps not exactly, since in C++ there's a lot you can do with a pointer. But most of it is dangerous, and that's why ref-counted pointers are often preferred where possible)

steveklabnik · April 4, 2018, 8:50pm

The way we explain these concepts in the book is that “pointer” is the most general idea, and “references” are a specific kind of pointer, one with more guarantees.

mark-i-m · April 4, 2018, 10:53pm

That’s how I tend to think of them, and how I have always understood them (and I come from a C/Java background)…

kornel · April 4, 2018, 10:54pm

The thing I struggled with was that Box is a pointer, too. Because it doesn’t have a pointer-like syntax sigil it was hard for me to internalise that, and I severely overused references in the beginning.

mark-i-m · April 5, 2018, 1:49am

Out of curiosity, do you come from a C++ background? C++ has smart pointers too (e.g. std::shared and std::unique), though I’m not sure it has anything like Box. I ask because I imagine that people coming from different backgrounds would struggle with different aspects of references…

Ixrec · April 5, 2018, 2:30am

I was always under the impression that Box<T> is analogous to std::unique_ptr<T>. They both express unique ownership of a T in the heap, and they both support move semantics and RAII cleanup. The biggest difference I know of is that Box<T> is not quite implementable without magic, but that’s only because of the “DerefMove” problem.

kornel · April 5, 2018, 11:33am

I’ve written much more plain C than C++, so to me Box is a fancy malloc(), and I’m still baffled that my structs and the rest of the program cares whether something came from “malloc” or not.

mbrubeck · April 5, 2018, 1:39pm

&T and Box<T> and *mut T have exactly the same in-memory representations. Box<str> and Box<[u8]> and Box<Trait> are fat pointers, just like &str and &[u8] and &Trait.

The real difference is the ownership and borrowing semantics:

Box<T>: unique owning pointer
Rc<T>: shared owning pointer
&mut T: unique borrowed pointer
&T: shared borrowed pointer

gbutler · April 5, 2018, 2:10pm

Nice way to put it. I'd think this statement alone would clarify a lot for most people.

dzamlo · April 5, 2018, 6:57pm

I like the term borrow. Event if it is quite specific to Rust.

system · March 25, 2019, 8:30am

This topic was automatically closed 90 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Readable unsafe is safe unsafe ideas (deprecated)	13	2894	March 25, 2019
Pass `&` references by value ideas (deprecated)	26	5416	March 25, 2019
How much more tricky is `unsafe` code than C code? (Aliases and `const_cast`) internals	5	1840	March 25, 2019
Comparing dangling pointers language design	43	10939	March 25, 2019
A possibly more erognomic syntax for borrow language design	39	2403	March 25, 2019

Re: why Rust references are not pointers

Related topics