Python-like chained comparison operators

mayabyte · October 11, 2020, 5:19pm

Python allows chaining comparison operators as a helpful shorthand:

if a < b < c: # equivalent to (a < b and b < c)
    ...

However Rust explicitly disallows this:

fn main() { 
    5 < 10 < 15 
} 
// error: comparison operators cannot be chained 
// --> chain.rs:2:5 
// | 
// | 5 < 10 < 15 
// |   ^    ^ 
// | 
// help: split the comparison into two 
// | 
// | 5 < 10 && 10 < 15 
// |      ^^^^^ 
// 
// error: aborting due to previous error

rust-lang/rfcs#558 seems to be the source of this explicit disallow, with the primary motivation being confusing behavior: a < b < c used to be interpreted as (a < b) < c , resulting in a type error. The RFC also claims this would allow implentation of Python-like chained comparison operators in the future. Is there any motivation not to add these Python-like chained comparison operators into Rust? Given that the syntax for it has been a compiler error since pre-1.0, it doesn't seem like it could be a breaking change. Thoughts?

tesuji · October 11, 2020, 5:33pm

This should be valid Rust, but it is compilation error now:

5 < 3 < true

josh · October 11, 2020, 5:34pm

There's no fundamental reason not to allow it, and this change was intentionally forward-compatible with the possibility. It would require a lang project proposal (MCP), and a spec.

The spec should cover things such as limiting the types of operators that can chain; for instance, a = b > c, but == and != should not chain, and chains with operators in opposite directions like a = c should not work.

The spec would also need to document the desugaring, which should only evaluate each piece once and preserve left-to-right evaluation order; for instance, a() < b() < c() would desugar to { let temp1 = a(); let temp2 = b(); let temp3 = c(); temp1 < temp2 && temp2 < temp3 }.

Given a project proposal, and a subsequent spec, I don't see any fundamental reason we wouldn't do this. However, the lang team would need to discuss the project proposal and determine if this would be a net win for the language, before the spec went forward.

josh · October 11, 2020, 5:35pm

That is intentionally not valid Rust, to avoid confusing expressions like that (or, much more complex confusing expressions that include type inference). Even if we decided not to allow chained comparisons, I don't think we'd allow that.

mayabyte · October 11, 2020, 6:00pm

What makes you say this? Letting == and != chain would be hugely helpful and I'm not seeing any issues it would cause. Operators in opposite directions may be a little unusual but also don't seem to cause any issues; a = c would desugar into a = c, which is a perfectly valid thing to write.

josh · October 11, 2020, 6:11pm

Operators in opposite directions could have a valid desugaring, but that doesn't mean they make for clear code.

== and != is a little more debatable, and personally I'm less attached to that one; I could probably be convinced, given some code that shows how it'd be useful. I'd be much more hesitant to support x == y > z != a > b, though.

djc · October 11, 2020, 6:49pm

"Perfectly valid thing to write" seems necessary but not sufficient. I would guess a = c is much harder to mentally parse than a = c, and it's thus not clear to me the additional density is worth it for that case.

jjpe · October 11, 2020, 7:12pm

I guess that depends on how often code like a = c actually occurs. If one had to do this a lot then I could see this syntax be worth the complexity budget. But I don't know if that's actually the case.

mayabyte · October 11, 2020, 7:24pm

That's a good point. The most common uses of this syntax I see in Python are either checking that a value lies between two others, or checking that a bunch of values are all equal to each other; mixed direction comparisons don't show up much. Not sure if this is also true in Rust, but I imagine mixed-direction chained comparisons could always be added later if there's demand for them.

scottmcm · October 11, 2020, 7:39pm

I think this is the core of the issue. I feel like the most common cases are things where the are other acceptable ways. For example, (a..b).contains(c) seems like a perfectly acceptable way of writing a <= c && c < b. And I'm fine with a <= b <= c <= d not working, since I can write it fine as [a, b, c, d].is_sorted(). Similarly, a c is b > max(a, c), which I like as a way to write it anyway, plus it avoids weird generics-looking things.

So maybe there aren't amazing ways of writing mixes of these things, like if for some reason I needed a <= b < c <= d, but that case seems so unlikely that I don't care.

pdolezal · October 11, 2020, 7:47pm

I guess that a == b == c is the case which brings no doubts about the meaning and could be allowed. I can yet imagine a < b == c < d (I'd tell that it is understandable as well). On the other hand, chaining != seems to me a bad idea. So, I agree that chaining == and != should be considered very carefully.

Judging from the examples, I think that chains that look like ordered comparing interval boundaries are the case where the chaining works well and where can be useful. The other cases… I think that they are more difficult to parse for a human.

josh · October 11, 2020, 8:35pm

This is something that can be discussed as part of a project group working on the problem, ideally with code examples for how various projects would look with this available.

Tom-Phinney · October 11, 2020, 8:38pm

So how does f(a) < g(b) < h(c) desugar? Recall that the arguments to each comparison are evaluated in place expression context.

Unless the std::cmp::PartialOrd::lt operator is overloaded to have side effects, the following probably works:

{
    let fa = f(a);
    let gb = g(b);
    let hc = h(c);
    std::cmp::PartialOrd::lt(&fa, &gb) & std::cmp::PartialOrd::lt(&gb, &hc)
}

atagunov · October 11, 2020, 9:29pm

Actually it is not

is overloaded == invoked for (a, b) or (b, c) first?
does it instead mean this?

   let a = 8;
   let b = 9;
   let c = true;
   
   (a == b) == c

...I've seen people write something similar in Java

toc · October 11, 2020, 11:06pm

Note: Python "desugars" this as

{
    let fa = f(a);
    let gb = g(b);
    std::cmp::PartialOrd::lt(&fa, &gb) && std::cmp::PartialOrd::lt(&gb, &h(c))
}

(that is, lazily evaluate h(c)). This directly matches "what I would write by hand" if efficiency matters, and is maybe more analogous to Iterator::is_sorted or Generator::is_sorted. The actual desugaring should cover f(a) < g(b) < h(c) < i(d) < ... (and be laziness all the way down).

I am in favor of this proposal. I always thought the python chained comparators were pretty intuitively reasonable; I don't think it blows the complexity budget in that regard. The motivation (make some chained comparisons a little nicer) is not super pressing, so it probably more needs a motivated person to push it all the way through.

scottmcm · October 12, 2020, 12:41am

I've definitely done things like (a == b) != (c == d) in C#, because != is ^.

eminence · October 12, 2020, 12:50am

(Slightly off topic, but nice job to the compiler and team for providing an excellent diagnostic on this error)

tesuji · October 12, 2020, 2:20am

also cc https://github.com/rust-lang/rfcs/issues/2083

H2CO3 · October 12, 2020, 5:19am

I'm against this, especially with the short-circuit behavior:

In my experience, the problem of chained comparisons just doesn't come up. And even if it does, there's no significant inconvenience in writing a < b && b < c.
So far all the comparison operators have been eager. Making them conditionally lazy requires such a mental shift that I would consider it breaking (even though it technically may not be), and I definitely put it into the "too much clever terseness just for the sake of terseness" category.

kennytm · October 12, 2020, 7:06am

So far all the comparison operators have been eager. Making them conditionally lazy requires such a mental shift that I would consider it breaking (even though it technically may not be), and I definitely put it into the "too much clever terseness just for the sake of terseness" category.

How could it be breaking if chained comparison did not exist.

As demonstrated before, for Python and Raku (Perl 6), in a < b < c when a < b is false the next expression c will not be evaluated. After all, a < b && b < c won't evaluate b < c either when a < b is false. So always eagerly evaluating c is the mental-shifting breaking change.

There are other languages with chained operators too.

In Julia the evaluation order is undefined (bleh), e.g. a < b < c in Julia will first evaluate (eagerly) b, then (eagerly) a, and finally (lazily) c. This breaks the left-to-right order guarantee of Rust, but the conditional laziness is still there.

Scheme and Clojure's comparison operator can accept multiple values e.g. (< a b c) where all of a, b and c are evaluated eagerly. But < isn't a special operator in these Lispy languages and thus not really applicable to Rust.

Topic		Replies	Views
Current syntax	17	5562	March 25, 2019
Compare a reference and a value language design	14	2017	December 15, 2022
Something about combinators language design	14	1018	December 3, 2022
Optional chaining language design	8	1039	October 23, 2023
Feature Request: Avoid brackets on if (And similar) statements language design	10	1119	December 20, 2022

Python-like chained comparison operators

Related topics