Panic in `CString::from_slice` and friends

aturon · February 12, 2015, 6:20am

When converting from Rust strings to C strings, we have to ensure that there are no interior nulls, which cannot be represented in C strings.

Currently, we do this via the CString type in methods like from_slice, which panic if an interior null is found.

Since interior nulls are relatively rare, that means there are rare panics lurking in anything that winds up calling system APIs, such as IO.

For things like IO, however, we could easily propagate such errors using the InvalidInput variant for std::io errors. That would have essentially zero ergonomic impact on users of IO and similar APIs, but might offer somewhat more graceful handling of the situation.

There are a couple of questions to ask:

Is there a reasonable way for clients to handle InvalidInput errors other than simply panicking themselves? That is, would this change actually benefit code in practice?
Would the ergonomics of CString::from_slice take too much of a hit for code that uses it directly?

cc @mzabaluev @alexcrichton @wycats

aturon · February 12, 2015, 6:24am

Thinking a bit more, it’s probably possible to change the behavior within std from panicking to producing an error variant in the future (this is broadly considered a backwards-compatible change). So this may just be a question about the right “default” behavior for CString.

mzabaluev · February 12, 2015, 11:03am

I favor a domain specific error with a FromError impl for converting into std::io::Error. A NulError can provide programmatic access to the position of the first NUL encountered. CString::from_vec also needs a move-friendly Err value with the original vector.

@aturon, I don’t see how can CString::from_slice be changed to return Result without breaking backward compatibility?

tbu · February 12, 2015, 12:02pm

Definitely in favor of methods returning a Result instead of panicking here!

iopq · February 12, 2015, 1:44pm

Why would it panic? Let me unwrap() it when I want it to panic and let me give a sensible default if I don’t.

kornel · February 13, 2015, 1:21am

Are there cases where one would get NUL in a C string, other than data corruption or an attack?

I’ve encountered the problem of internal NULs vs C strings only in PHP filesystem APIs, but there it’s pretty much guaranteed to be an attack attempt — http://cwe.mitre.org/data/definitions/626.html

Even if Rust’s stdlib handled such strings safely and gracefully, my app probably wouldn’t, so to me a panic! in such case seems fine.

mzabaluev · February 13, 2015, 6:16am

The hazard here is that NUL is so rare in real-world string data that it is easy to get overlooked during development, unless the developers consciously test invalid inputs.

Imagine you’ve got this library interface to work with:

fn set_text(text: &str) {
    let c_text = CString::from_slice(text);  // panic lurks here
    unsafe { ffi::set_text(c_text.as_ptr()) };
}

Unless you look at the code, you never know that the input string has some additional constraint, which is never enforced in pure Rust code because it is only imposed by C libraries. The creator of that code might not even have been aware of the problem: the from_slice call compiles, and it works with the “normal” strings they tested it with. A Result would provide enough of a speed bump to consider the appropriate failure mode, or at least there is an unwrap() in the code to indicate the potential panic.

winksaville · February 13, 2015, 5:26pm

What is the rust idiom for identifying a method may panic? If from_slice can panic I’d sure like the compiler to know that and tell me.

mzabaluev · February 13, 2015, 5:34pm

As I understand the prevailing convention, panics happen only as a result of programmer errors. The way to inform the client about a potential failure when working with external data is is to return a Result.

winksaville · February 13, 2015, 7:36pm

So I just looked at the docs for from_slice and at least it does say it panics, but based on what mzabaluev@ said I’d say it should return Result.

A search turned up this FAQ http://doc.rust-lang.org/complement-lang-faq.html#why-is-panic-unwinding-non-recoverable-within-a-task?-why-not-try-to-“catch-exceptions”? entry about panics which says they can only be caught “outside of a task”. Can someone point me to some example code on how “catch” a panic and have the application resume/recover?

stebalien · February 13, 2015, 7:46pm

I’ve actually had to re-implement the null checking part of from_slice and then call from_slice_unchecked to prevent this panic so I’d really appreciate this change.

mzabaluev · February 14, 2015, 5:26am

Filed as a RFC PR: https://github.com/rust-lang/rfcs/pull/840

aturon · March 25, 2019, 8:23am

This topic was automatically closed 90 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Suggestion: str::to_c_string_lossy or CStr::from_bytes_may_clone libs	9	1513	May 22, 2019
`&CStr` from/to `&[c_char]`, safely libs	15	1221	June 28, 2020
CStr::in_bytes method libs	4	559	June 17, 2021
Impl TryFrom<&str> for CString language design	7	185	August 20, 2024
Planned CStr changes seem inefficient libs	38	1562	February 10, 2025

Panic in `CString::from_slice` and friends

Related topics