Can the lifetimes be simpler?

SparrowLii · March 3, 2025, 6:36am

In current Rust, lifetimes are sometimes redundant. For example, in the following code, the compiler should be able to automatically infer that s1, s2 and the return value in longest have the same lifetime.

fn longest(s1: &str, s2: &str) -> &str {
    if s1.len() > s2.len() {
        s1
    } else {
        s2
    }
}

fn main() {
    let string1 = String::from("Rust");
    let string2 = String::from("Programming");

    let result = longest(&string1, &string2);
    println!("The longest string is: {}", result);
}

However currently it has to be specified manually:

fn longest<'a>(s1: &'a str, s2: &'a str) -> &'a str {
    if s1.len() > s2.len() {
        s1
    } else {
        s2
    }
}

SparrowLii · March 3, 2025, 6:50am

And in this case, the compiler should automatically infer the lifetime of the return value as the same as s1

fn foo(s1: &str, s2: &str) -> &str {
    println!("{}", s2);
    s1
}

Instead of having to manually infer like this:

fn foo<'a>(s1: &'a str, s2: &str) -> &'a str {
    println!("{}", s2);
    s1
}

FZs · March 3, 2025, 7:13am

Lifetimes are determined based only on the signature of the function, not the body^[1].

This allows Rust to borrow-check each function in isolation.

Besides that, it also eliminates a semver hazard (changing the body of a function such that it silently changes the lifetimes in the signature).

even elided lifetimes can be determined purely syntactically from the signature ↩︎

SparrowLii · March 3, 2025, 7:48am

Yea, the current lifetime reduces the design burden on the compiler. But on the other hand, based on the experience from Rust developers around, lifetimes bring a significant burden on users.

This is not very friendly to expanding the use of this language, especially for who want to write large projects with Rust. The tcx that is everywhere in the compiler's code is an example.

So I wonder if we can shift part of the user's burden to the compiler's automatic inference.

quinedot · March 3, 2025, 9:08am

Not being able to tell what the lifetime relationships are based on the function signature is a deteriment to any Rust developer other than the developer actively writing the function --- including the same developer one month later. Having to read the function body in able to ~~know~~ try to infer the lifetime relationships is a burden on the developer.

An IDE being able to infer the relationships and supply the function signature for the actively writing developer -- that's a different story. That is where the desired functionality belongs.

Vorpal · March 3, 2025, 9:11am

Do you have a source for that statement? Some survey perhaps?

SparrowLii · March 3, 2025, 10:01am

Not at current, but this sounds a good idea, probably I can do a survey in the next few days

SparrowLii · March 3, 2025, 10:08am

According to this logic, the lifetimes should be like comments. We encourage everyone to write it, but we can relax the restrictions so that learners can learn, try and write the code faster.

FZs · March 3, 2025, 10:32am

While allowing inferred lifetimes may help learners in the simplest cases, imo it would just move the difficulty further down the line. In more complicated cases where the compiler can't infer the correct lifetimes (I assume there would be such cases), having not learned how lifetimes work in simpler cases, beginners might have an even harder time fixing the problem.

Having an IDE code action insert the lifetime annotations (as @quinedot suggested) would be a better solution for learners of the language too, because they could see what they should've written to make the code work, which makes them more likely to learn how lifetime annotations work^[1] before hitting the harder cases.

than not having to write lifetime annotations at all ↩︎

SparrowLii · March 3, 2025, 12:21pm

In fact, my ultimate goal is to have only a few special cases that require users to annotate lifetimes, and let the compiler infer most other cases. Like you said, if only a small number of cases can be simplified, then there is indeed little value.

However, I still feel that the original purpose of lifetimes is to reduce the burden on the compiler, rather than asking users to understand the lifetimes of variables via manual declarations. It has been more than ten years since Rust 1.0 was released, and I think the designers of the language and compiler should really re-cosider this feature.

bjorn3 · March 3, 2025, 12:30pm

Rustc already infers lifetimes based on the function signature according to a couple simple rules. (Lifetime elision - The Rust Reference) Inferring them based on the body is going to be a semver hazard as any change to a function body can accidentally cause the public api to change.

SparrowLii · March 3, 2025, 12:43pm

Yea, semver hazard is an important consideration. But letting the compiler infer (assuming it can) is no more restrictive than manual annotations, so when users need to change their code, they won't make the project more complicated than if they had manually annotated it.

FZs · March 3, 2025, 1:31pm

I think you're missing the point here. The function signature forms a contract between the caller and the callee (function body). If you make the lifetimes less restrictive to the callee, that means more restrictive to the caller and vice versa.

If I have a function like this:

fn foo(s1: &str, s2: &str) -> &str {
    s1
}

and someone writes a function like this:

fn main() {
    let s1 = "s1".to_string();
    let s2 = "s2".to_string();

    let r = foo(&s1, &s2);

    drop(s2);

    println!("{r}");
}

having inference would mean that I can change the body of foo^[1] to be:

fn foo(s1: &str, s2: &str) -> &str {
    if s1.len() > s2.len() {
        s1
    } else {
        s2
    }
}

...which would break main downstream.

Without inference, such a breaking change is not possible.^[2]

without touching its signature ↩︎
except for RPIT leaking auto-traits, which is already unfortunate enough ↩︎

robinm · March 3, 2025, 4:17pm

I see a couple of options:

Do nothing (that’s probably the best thing to do).
Assume that if there are multiple arguments with references, then references in the output must outlive all references passed as arguments (ie. add a shared 'anonymous to all references), possibly behind a #![output_must_outlive_all_arguments] switch used during development, just like #![unused_code] and the like.
Ensure that both rust-analyzer and rustc have a "fix-me" button that uses inference and modify the source code to add lifetime annotation accordingly.

I assume that a combination of 2 and 3 could slightly improve the developer experience, but I’m not even sure 2 would really be that useful.

dlight · March 3, 2025, 5:49pm

Per step 2 of Niko's awesome The borrow checker within, maybe some day you will be able to write this as

fn longest(s1: &str, s2: &str) -> &'(s1, s2) str {
    if s1.len() > s2.len() {
        s1
    } else {
        s2
    }
}

(disabling syntax highlighting because the parser botched it)

I think this is a big improvement because the 'a doesn't really mean anything, it's just a placeholder for a generic parameter - but this function ought to not be generic! We just lack the syntax to specify that "the output will borrow either from s1 or from s2", which is what &'(s1, s2) str means.

And IMO this is probably more intuitive and teachable than describing the same thing using generics: when you write fn longest<'a>(s1: &'a str, s2: &'a str) -> &'a str you crucially are depending on the compiler to unify the lifetimes of s1 and s2, which means that 'a is the smaller (or intersection) of the corresponding unconstrained lifetimes, which is IMO very subtle until you have internalized this concept - and then suddenly it becomes second nature and you are puzzled why people are confused about it.

Note that while simpler and easier to understand (IMO), this syntax is not the same as inferring the lifetimes: you still need to explicitly state which parameters exactly the output can borrow.

But... perhaps we could infer lifetimes of private functions, under the rationale that if we change the body of the function it won't break any crate that depend on it. But even then, I think that there should be explicit syntax to infer it, like this:

fn longest(s1: &str, s2: &str) -> &'_ str {
    if s1.len() > s2.len() {
        s1
    } else {
        s2
    }
}

And if you add a pub to this function the compiler would of course refuse to compile, but also print the lifetime it inferred so you can paste into your code - and maybe rust-analyzer could apply it with a code action.

zackw · March 3, 2025, 6:18pm

... and I would like to point out that when you are reading the documentation of a public API, "which parameters exactly the output can borrow" is important information that you need to know, and you don't get to look at the body of the function to figure it out.

(Often you can, but only by clicking through the [source] link; and the author of the API might not have chosen to publish the source; and it can be substantially more difficult than just reading the annotations!)

When people are saying things like

this is what they mean.

scottmcm · March 3, 2025, 7:01pm

The biggest reason, IMHO, that inferring them from the body is bad is that I want to be able to write

fn longest<'a>(s1: &'a str, s2: &'a str) -> &'a str {
    todo!()
}

Then use that in a bunch of other places, knowing that I won't suddenly have all the callers break once I actually implement it.

If todo!() bodies were treated as "well actually none of them matter so you can do whatever", that would be horrible.

quinedot · March 3, 2025, 8:03pm

You must have misinterpreted my post, as making lifetimes optional is the opposite of my position.

dlight · March 3, 2025, 9:49pm

Yeah, that's why such inference, if available for private functions, should be opt in and probably used sparingly.

scottmcm · March 3, 2025, 10:10pm

I think that for private functions like that, what people often want is not an inferred signature at all, but a "macro function" that essentially just copies the code into the caller. So it doesn't need to infer a trait type, but can just do x + y without worrying about what those types are until it's used. And where you can conditionally move something but the move checker runs in the context of the caller, etc.

Topic		Replies	Views
Why does Rust require explicit lifetimes? language design	16	376	March 19, 2025
General lifetime elision of values possible or even desired?	4	509	March 29, 2021
Lifetime elision Alteration language design	6	482	July 14, 2024
More robust lifetime inference ideas (deprecated)	5	2057	March 25, 2019
Inferring lifetimes in simple cases internals	1	1071	March 25, 2019

Can the lifetimes be simpler?

Related topics