GlobalAlloc::dealloc
states that the provided memory layout must be the same as the layout used to allocate the memory block. This implies that the alignment of these two layouts must be identical.
However, doesn't the deallocation method rely only on:
- the size of the requested and provided memory, and
- the alignment of the address it provided?
In other words, an allocator may return memory aligned to [N+1..]
, even when an N
-aligned layout was requested.
If the deallocation method relies on the alignment of the pointer it returned, why not provide a wrapper deallocation method without an alignment requirement? Let's call it deallocate_unaligned
:
(Edit 2): As user SkiFire13 stated, this implementation is undefined behavior (UB) because some allocators rely on the alignment of the provided layout to function correctly. Additionally, it depends on the Allocator::deallocate
method, which requires the same layout that was used during allocation.
trait Allocator {
// Existing methods ..
// This method should also be added to `GlobalAlloc`
/// The provided `layout.size()` must fall within the range `min ..= max`, where:
///
/// * `min` is the size of the layout most recently used to allocate the block, and
/// * `max` is the latest actual size returned from `allocate`, `grow`, or `shrink`.
fn deallocate_unaligned(&self, ptr: NonNull<u8>, layout: Layout) {
let address = ptr.as_ptr().addr();
// What if the address is 0?
let align = address & (!address + 1);
let layout_with_recovered_alignment =
unsafe { Layout::from_size_align_unchecked(layout.size(), align) };
self.deallocate(ptr, layout_with_recovered_alignment);
}
Note that allocator implementations that are insensitive to layout alignment during deallocation (e.g., C's free
) may simply override this method to call the inner deallocation function, incurring no additional runtime overhead.
(The documentation was copied from Memory fitting
.)
Such a method would allow memory reuse for types of the same size (in bytes) but with different alignments. For example, it would enable in-place mapping of [T]
to [U]
.[1]
Consider this example:
#[repr(align(2))]
struct Wrapper([u8; 2]);
pub fn eq_aligned(inp: Vec<u16>) -> Vec<Wrapper> {
assert!(align_of::<u16>() == 2);
assert!(align_of::<Wrapper>() == 2);
inp.into_iter()
.map(|t| Wrapper(t.to_ne_bytes()))
.collect()
}
pub fn ne_aligned(inp: Vec<u16>) -> Vec<[u8; 2]> {
assert!(align_of::<u16>() == 2);
assert!(align_of::<[u8; 2]>() == 1);
inp.into_iter()
.map(u16::to_ne_bytes)
.collect()
}
Now, look at the generated assembly in Compiler Explorer. If I'm not mistaken, this size and alignment check is preventing an optimization. This is understandable, as GlobalAlloc
requires the layout used for deallocation to match the one used for allocation. However, if a method like the previously mentioned deallocate_unaligned
existed, Vec
could potentially use it to reuse memory and deallocate it with a layout of a different alignment.
I understand that the size of the memory layout is crucial for deallocation. However, does the deallocation method truly depend on the alignment of the layout used during allocation, or on the alignment of the pointer it returned?
What prevents us from introducing a method like deallocate_unaligned
to allow deallocation using a layout with a different alignment?
I haven't been able to find a definitive answer. Libraries like jemalloc
and glibc
perform some unreadable sorcery.[2]
Edits
- Edit 1: Changed the title.
- Edit 2: Noted that the implementation of
deallocate_unaligned
is UB.
Footnotes
More specifically, it would allow in-place mapping of
[T]
to[U]
when!T::IS_ZST && !U::IS_ZST && size_of::<T>() % size_of::<U>() == 0
. ↩︎However, for instance,
rulloc
'sBlock::from_allocated_pointer
, used when deallocating, implicitly trusts the alignment of the provided layout. Yet, if you look atBucket::allocate
, you’ll notice it could compute alignment from the provided pointer instead of relying on the given layout’s alignment. ↩︎