[lang-team-minutes] feature status report: placement in and box

vadimcn · February 14, 2017, 6:09am

Sure, but I've seen similar arguments used against direct usage of closures for emplacement.

This sounds like an advantage of the closure form, really: vec.place_back(|| run_code()?) will give you a type error, whereas vec <- run_code()? will compile, but won't do the intended optimization.

vec.place_back() <- x is not even shorter than vec.place_back(|| x)

To win on the number of characters typed, we'd need to figure out auto-ref'ing, so one could write vec <- x. Which opens a wholly new can of worms, it seems.

hanna-kruppe · February 14, 2017, 3:24pm

Without auto-ref we could still have &mut vec <- x, or plain vec <- x if vec is actually a &mut Vec (not at all rare). But yes, auto-ref is a big win. I don't see what can of worms it opens, other than the current implementation (HIR expansion) being unsuitable — but at least for box, the implementation may already change for other reasons (type inference) as outlined in the OP.

And it's not just, and not primarily, about the number of characters typed. In my and other people's vision, <- would be the default, the most simple and aesthetic way of inserting in a collection: the <- syntax is visually evocative, it's the same for different collections (Vec::push vs. HashSet::insert), and it has less visual noise than the alternatives (no parentheses, no || prefix).

While reading this, it occurred to me that the closure form actually isn't equivalent in the number of moves! In the following example (after whatever temporaries and moves are involved in try! or ?), the payload of the Ok variant is moved into the local arg, then moved into the closure object, and then finally moved into the Vec's backing buffer.

let arg = try!(compute_argument());
vec.emplace(|| arg);

Contrast this with the <- equivalent (vec <- try!(arg);), which does indeed need a temporary for the Result just like the above code, but in the Ok case directly (again, after whatever happens in the try!) copies the payload into the Vec's memory. And despite the closure being quite inlinable, the move from the Result temporary into the local occurs before the memory allocation, so we can't be sure it gets reordered. (Not to mention that the closure isn't 100% guaranteed to be inlined, the fact that LLVM has been and still is less-than-stellar about optimizing memcpys, and a million other minor wrinkles.)

So there's that. But even if it were fully equivalent, I don't think we'd want the error that the closure implies. That would assume people only use placement (in whatever form) when they absolutely positively need the optimization and can not tolerate anything less. This is antithetical to the vision of placement becoming the default syntax — it doesn't always have to be always faster (the ? case needs a temporary regardless of how you approach it), it just has to be always at least as fast as the alternatives (Vec::push and your hypothetical Vec::emplace).

Besides, Rust makes a point of being relatively explicit about costs, if you know what to look for, but it doesn't go out of its way to actively penalize slight inefficencies. If someone does not know, or does not care, that a ? expression involves an extra temporary, forcing them to move the ? out of the closure doesn't help them, it just force them to write uglier code and prolong the "fight the compiler" phase. Furthermore, even if we wanted to highlight these situations, a lint could do the job just as well, but one could silence it.

canndrew · February 16, 2017, 7:25am

I just want to mention again that i don’t think it makes any sense to settle on a design for this until we’ve figured out a design for &in / &place pointers (ie. the opposite of &move pointers).

eddyb · February 16, 2017, 12:38pm

Neither &move nor “out-pointers” are really ergonomic without being parametrized on the allocation and owning it.

IOW, the Place types in the RFC are one of the few realistic versions of out-pointers for Rust (you could also imagine having a single type with a generic parameter, but the two are mostly equivalent), there just is no Place type provided for stack slots, only everything else. That makes p <- x what would otherwise be *p = x.

Storyyeller · February 16, 2017, 11:52pm

I really don't like vec <- x because it seems confusing and ambiguous to me. For example, with a vec, you usually want to push to the back, but for a deque, you can efficiently append to either end. And it's not like you can't insert into the beginning of a vec either, it's just slower than inserting to the back. So it's not immediately clear what vec <-x even means.

Apart from that, Go uses <- for an unrelated operation (sending and receiving on a channel), so that would increase confusion as well.

devyn · March 27, 2017, 8:33pm

Placement could be conceivably implemented for channels as well, since they're basically collections purpose-built for communication and synchronization. It makes enough sense to me that one could do

tx <- MyMessage(42)

instead of

tx.send(MyMessage(42))

with all of the benefits of in-place allocation that this could potentially get you.

oln · June 1, 2017, 4:28pm

Are there any updates on this? Is box syntax dependent on the rest of the functionality here, or would it be possible to stabilise box as it works in nightly before having the rest of the details sorted?

pnkfelix · June 8, 2017, 10:02am

One problem with stabilizing box as it works in nightly: It might prematurely stabilize a strong connection between box <expr> and Box<T> (for an expr: T), which in turn might make it difficult to generalize box in the future to other container types in a backward-compatible fashion.

Basically, we’ve been holding off stabilizing box until we make more progress on generalizing box (or, alternatively, deciding that generalizing box is not worth the effort/cost…)

oln · June 17, 2017, 1:48pm

Would it be an idea to special case Box::new(x) to simply act the same as the current unstable box x while this is not stabilised? That’s what the Box::new function contains anyhow, but it doesn’t seem to always inline properly. I don’t think there are any ways of guaranteeing heap allocations without temporaries on stable at the moment without delving into unsafe, especially on lower optimisation levels.

withoutboats · June 18, 2017, 3:06am

How would that special case work?

gulbanana · June 18, 2017, 4:26am

Perhaps by using call-by-name semantics.

bjorn3 · June 18, 2017, 9:26am

#[inline(always)]?

oln · June 20, 2017, 8:31pm

Consider MIR output (current rustc doesn’t really do any optimisation on the mir level at the moment except cleaning up &*, so debug and nightly is similar) of this code: https://is.gd/WvMCrx

My suggestion is that the compiler could emit MIR similar to the one emitted by box x when it encounters Box::new(x) rather than emitting the normal function call mir (ie. treat it as if it encountered box x. This lets LLVM optimise the function much better, if you look at the assembly output of the same functions. If you look at the MIR for the Box::new version it creates a temporary array on the stack (an possibly another one inside the function call). I don’t know the compiler internals well enough to say the exact details of how this could be implemented.

Another temporary alternative could maybe be a boxed! macro similar to vec!.

pnkfelix · March 25, 2019, 8:27am

This topic was automatically closed 90 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Removal of all unstable placement features announcements	21	6017	March 25, 2019
Pre-RFC: placement box with Placer trait ideas (deprecated)	10	5543	March 25, 2019
Placement NWBI<- FAQ (New/Box/In/Left Arrow) language design	27	10572	March 25, 2019
Feature Idea: Add a macro to "de-magic" box syntax language design	9	1219	April 10, 2021
Downstream crates may implement `Copy` for `Box<_>`	18	1872	July 12, 2020

[lang-team-minutes] feature status report: placement in and box

Related topics