Should stdlib error type use Box<str|[u8]> instead of String|Vec<u8>?

Dushistov · July 22, 2025, 3:35pm

In all the Rust code I've read error types are used as read only types. So why not optimize code this case, in other words do not return capacity field as part of error type?

For example NulError, it is struct NulError(usize, Vec<u8>):

github.com/rust-lang/rust

library/alloc/src/ffi/c_str.rs

c0b282f0c


      
          ///
          /// # Examples
          ///
          /// ```
          /// use std::ffi::{CString, NulError};
          ///
          /// let _: NulError = CString::new(b"f\0oo".to_vec()).unwrap_err();
          /// ```
          #[derive(Clone, PartialEq, Eq, Debug)]
          #[stable(feature = "alloc_c_string", since = "1.64.0")]
          pub struct NulError(usize, Vec<u8>);
          
          #[derive(Clone, PartialEq, Eq, Debug)]
          enum FromBytesWithNulErrorKind {
              InteriorNul(usize),
              NotNulTerminated,
          }
          
          /// An error indicating that a nul byte was not in the expected position.
          ///
          /// The vector used to create a [`CString`] must have one and only one nul byte,

it takes 32 bytes on amd64, but why it need capacity field?

pub struct NulError(usize, Box<[u8]>); contains the same data, but takes only 24 bytes.

The other example VarError, it's size is 24 bytes, but if use Box<[u8]> instead of OsString (=Vec<u8>) it would be only 16 bytes.

burntsushi · July 22, 2025, 4:05pm

For NulError, that Vec<u8> comes from the user input. If you round-trip that through Box<[u8]>, then you might incur an extra copy. And it also needs to support NulError::into_vec.

As for VarError, I think you mean Box<OsStr> and not Box<[u8]>. And this can only be a historical question, since switching it to a Box<OsStr> would be a breaking change. I'm not sure exactly why OsString was selected here. It does seem like a Box<OsStr> would work here since std::env::var accepts an AsRef<OsStr>. So creating a Box<OsStr> shouldn't involve any additional copies like it might in the CString::new case. But this would be a semver incompatible change at this point, and I doubt the extra 8 bytes really makes much of a difference in practice. (And if it did, it's easy to work around it.)

kornel · July 23, 2025, 11:14am

With errors in general the problem is that error messages are likely to come from format!, and format! does not allocate precise length, so conversion to Box<str> could require reallocation.

Topic		Replies	Views
Proposal: add std::error::BoxError libs	26	3378	February 16, 2020
`impl Error for String` libs	21	9074	March 25, 2019
Missing String->CString without copy libs	12	704	February 17, 2025
Why not impl TryFrom<Vec<u8>> for String? libs	11	986	March 24, 2024
Unified Errors, a non-proliferation treaty, and extensible types language design	40	5349	March 25, 2019

Should stdlib error type use Box<str|[u8]> instead of String|Vec<u8>?

Related topics