Pre-Pre-RFC: A Box reinitialization interface

197g · September 4, 2019, 10:38pm

In reference to this comment, it would sometimes be more efficient to reuse a Box than to reallocate a new one. While the compiler may at some point become smart enough to figure this out, a safe interface that guarantees it can already be written.

Possible interface 1

At least for N: Sized I would have expected a signature that allows access to the replaced value like in mem::replace. Otherwise it can not retrieve the current item by value and so doesn't fully parallel *x. Since the pre-conditions can be checked outside in user code it might be sensible to panic similar to mem::transmute if the layout of the replacement differs from the current value. An interface that accomplishes this purely with Box<T> is:

impl Box<T> {
    fn reuse<N>(b: Self, val: N)-> (Box<N>, T) {
        use std::alloc::Layout;
        let layout = Layout::for_value(&*self);
        assert!(layout == Layout::new::<N>());

        let reboxed;
        unsafe {
            let ptr = Box::into_raw(self);
            let previous = ptr::read(ptr);

            let ptr = ptr as *mut N;
            ptr::write(ptr as *mut N, val);
            reboxed = Box::from_raw(ptr);
        }
        (reboxed, previous)
    }
}

Possible interface 2

Another way would be to be broken into two operations. This could allow both unsized values and the computation of N to involve a value of T instead of only a reference.

struct RawBox(Unique<u8>, Layout);

impl<T: ?Sized> Box<T> {
    /// Drop the value in the box but keep the allocation.
    /// This is slightly more efficient than `take` and works on
    /// unsized values.
    pub fn drop(own: Self) -> RawBox {
        let layout = Layout::for_value(&*own);
        let unique = Box::into_unique(own);
        unsafe {
            ptr::drop_in_place(unique.as_mut());
        }
        RawBox(inner.cast(), layout)
    }

    /// Take a sized value from the box.
    pub fn take(own: Self) -> (RawBox, T) where T: Sized {
        let unique = Box::into_unique(own);
        let val = unsafe {
            ptr::read(unique.as_ptr())
        };
        let raw_box = RawBox(unique.cast(), Layout::new::<T>());
        (raw_box, val)
    }
}

impl RawBox {
    pub fn layout(&self) -> Layout {
        self.1
    }

    pub fn init<N>(self, val: N) -> Box<N> {
        assert!(self.layout() == Layout::new::<N>());
        let unique = RawBox::into_unique(self);
        unsafe {
            ptr::write(unique.as_mut(), val);
        }
        Box::from(unique)
    }

    pub fn into_unique<T>(b: RawBox) -> Unique<T> {
        let mut unique = b.0;
        mem::forget(b);
        unsafe { Unique::new_unchecked(unique.as_mut() as *mut T) }
    }
}

Unresolved questions

Choose an interface with RawBox or not?

Handling initialization for unsized values?

RustyYato · September 4, 2019, 10:46pm

I like the RawBox interface, when I reimplemented Box for learning purposes a while back I used a similar interface and it works rather nicely. reuse can then be written as,

impl<T: ?Sized> Box<T> {
    fn reuse<N>(b: Self, new: N)-> (Box<N>, T) {
        let (raw, old) = Box::take(b);
        (raw.init(new), old)
    }
}

RustyYato · September 5, 2019, 2:16am

On the same note, can we map over Vec in place? i.e.

impl<T> Vec<T> {
    fn map<U, F: FnMut(T) -> U>(self, f: F) -> Vec<U> {
        ...
    }
}

This way we can reuse the allocation of the Vec<_> while still using owned values in the map.

Here's how to do that on playground

We could do something similar in the case of VecDeque to get in place mapping.

mcy · September 5, 2019, 2:42am

This is one of those situations where it would be really nice to have compile-time expressions in where clauses... though I'm pretty sure that invites EXCITING compatibility concerns? It's somewhat unfortunate that the layout of a type is nominally API-private information, even though it's ABI-crucial.

RustyYato · September 5, 2019, 5:23am

This seems related

bill_myers · September 5, 2019, 11:07am

I'd suggest to change RawBox into RawBox<T> and add a method that can convert it to a RawBox<U> with no allocation if the layouts are compatibile, and reallocating otherwise, and also adding a way to create a RawBox directly.

This can also be used to do a "placement box" by creating a RawBox and then filling it with an inline function.

Maybe it should be called UninitBox or BoxPlace instead of RawBox though.

dhm · September 5, 2019, 12:59pm

Possible Interface 3

impl Box<T> {
    fn map<U, F> (this: Box<T>, map: F)
        -> Box<U>
    where
        F : FnOnce(T) -> U,
    {
        use ::std::alloc;
        let layout = alloc::Layout::new::<T>();
        assert_eq!(
            layout,
            alloc::Layout::new::<U>(),
        );
        let ptr: NonNull<T> = this.into_raw_nonnull();
        let value: T = unsafe { ptr.as_ptr().read() };
        let guard = ::scopeguard::guard_on_unwind(
            ptr,
            // if the code unwinds, do:
            move |ptr| unsafe {
                alloc::dealloc(ptr.cast::<u8>().as_ptr(), layout);
            },
        );
        let value: U = map(value);
        let ptr: NonNull<U> = guard.into_inner().cast::<U>();
        ptr.as_ptr().write(value);
        Box::from_raw(ptr.as_ptr())
    }
}

(where the helper NonNull could be upgraded to Unique)

mjbshaw · September 5, 2019, 5:07pm

I like the idea of map, but I wouldn’t assert that T and U have the same layout. Just reallocate if necessary. It also makes me think that we could do something similar for Vec (allowing Vec.into_iter().map().collect() where the final collect() reuses the original memory if possible).

RustyYato · September 5, 2019, 5:39pm

Yes, that is a good idea.

Here is the updated Vec::map and Vec::try_map, this one won't panic on it's own.

edit: now added try_map for fallible operations!

CAD97 · September 6, 2019, 12:51pm

FWIW, I think ideally:

RawBox should carry the Layout its allocation was allocated with statically as a const generic, and have safe methods both to guarantee reuse of the allocation (with transmute-like magic checking or const generic magic) and to relocate if necessary (but reuse if possible). The reuse would be allowed to lower the guaranteed alignment but not otherwise change the layout.
Safe methods interchanging RawBox and Box<MaybeUninit<_>>. Maybe even just spell RawBox<T> as Box<MaybeUninit<T>>?
RawBox is implemented in std and Box is implemented in terms of RawBox. Box is no longer special and is purely a library type, RawBox inherits only some of today's Box's specialness. (I.e. they're no longer a completely unique kind of type.)
All of this is carefully done to still work with ?Sized types the whole way through.

Big unknowns:

box syntax.
Raw forms of other alloc types.
"DerefMove" (i.e. why Box is still special).
- Along with that but partial.

RustyYato · September 6, 2019, 1:52pm

Layout can't be a const generic due to DSTs not having a specified layout till runtime, and RawBox can't be Box<MaybeUninit<T>> due to DSTs being incompatible with unions. Box will have to stay special in order to support deref moves out of a box.

Other alloc types, like Vec may benefit from a similar api. But HashMap and other types that need other properties of T won't.

197g · September 6, 2019, 6:45pm

That is definitely not safe. The owner must remember the layout passed to alloc and pass it dealloc. Somewhat tragically realloc also does not allow changing the alignment, even though I imagine that many allocators would permit lowering the alignment of the layout without additional costs by calling realloc, or at least require actual reallocation in only few cases.

RustyYato · September 7, 2019, 4:41am

I have done it again! Now, I have added try_zip_with and zip_with. These take an additional vec and try and use the capacity of either one! I have also make the code more robust, now it will clean up it's mess even if you panic in the drop.

https://play.rust-lang.org/?version=nightly&mode=debug&edition=2018&gist=8655604d47b5a128ab306b1a5c87589a

In the interest of not polluting this thread even more with these updates, I have published this as a minimal crate so that you can track it's progress there and contribute if you would like to!

https://crates.io/crates/vec-utils

system · December 6, 2019, 4:42am

This topic was automatically closed 90 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Pre-RFC: placement box with Placer trait ideas (deprecated)	10	5576	March 25, 2019
[lang-team-minutes] feature status report: placement in and box language design	33	7523	March 25, 2019
[Pre-RFC] Allocators, take II ideas (deprecated)	10	4057	September 17, 2014
More efficient boxed slice creation language design	10	1238	August 1, 2020
mRFC: iterable Box<[T]> language design	10	1105	February 2, 2020

Pre-Pre-RFC: A Box reinitialization interface

Possible interface 1

Possible interface 2

Unresolved questions

Possible Interface 3

Related topics