Wrapping_rem should allow modulo 0

abgros · February 9, 2025, 8:55pm

It occasionally comes up that we want to take a number n (let's say it's a u64) and calculate n % m, where m can be any value from 1 to 2^64. Since 2^64 can't be represented in a u64, it would be convenient to store it as 0 and have that act like 2^64 when using the wrapping operations (0 already acts like 2^64 in wrapping_add, wrapping_sub, wrapping_mul, and others). Thus n.wrapping_rem(0) should be equal to n.

This is an example of what I mean:

fn main() {
    let small_number: u64 = 34;
    let big_number = u64::MAX.wrapping_add(1); // this is supposed to represent 2^64
    println!("{}", small_number.wrapping_rem(big_number)); // expected: 34
    // panic: attempt to calculate the remainder with a divisor of zero
}

I don't think this should be considered a particularly strange result. Wikipedia notes: "Some systems leave a modulo 0 undefined, though others define it as a."

Also, wrapping_rem is completely useless on unsigned types because it's exactly the same as the normal modulo. It would be nice if it could actually be made useful somehow, although I realize that changing its behaviour at this point is probably intractable. Maybe we'd need a new method...

steffahn · February 9, 2025, 9:45pm

I believe the story of wrapping_… methods – and correspondingly the Wrapping<…>-wrapped number types – should be that they always behave the same as the wrapping behavior (e.g. the default choice of the release profile) of the corresponding operator.

I’d be wary to break this correspondence (but I’m also not 100% certain if it isn’t broken anywhere yet).

Indeed, such a change could be dangerous. Doing a rem with s.len() is a common/reasonable approach to ensure an index ends up in-bounds, so unsafe code can very reasonably depend on this property for soundness. (The question of whether or not usage of wrapping_rem – on a usize – is all that reasonable for this, is perhaps a different one.)

jrose · February 9, 2025, 10:01pm

I’d be wary of changing rem to be inconsistent with div, and then I’d be wary of changing wrapping_div(0) to return 0. I wouldn’t say it’s completely unreasonable, but it does give me pause.

abgros · February 9, 2025, 10:23pm

@steffahn I would argue that doing wrapping_anything alongside indexing is somewhat nonsensical because wrapping operations, by their very nature, mean that you're comfortable with your values jumping around near usize::MAX. But it would be interesting to see whether anyone is actually relying on the value of wrapping_rem(0) in unsafe code.

@jrose Yeah, I agree that wrapping_div(0) should return zero as well although I think division by zero is a bit more controversial than modulo zero.

abgros · February 10, 2025, 12:36am

By the way, if we do add new methods, I propose the names total_div and total_rem since the goal is for them to be total functions.

Neutron3529 · February 10, 2025, 3:57am

Actually wrapping_div(0) could be everything, since a = (any_legal_number) * 0 + a holds for any_legal_number.

No matter what wrapping_div(0) returns, a.wrapping_rem(0) == a is a consist result.

jrose · February 10, 2025, 6:11am

Fair point. I was still thinking of the idea of explaining that all the “wrapping” operators act as-if 0 is the size of the type, in which case we’d have x / 2^something, which could only be 0. But that isn’t strictly necessary for deciding that wrapping_rem should return 0 and also be consistent with wrapping_div in the quotient&remainder sense.

abgros · February 10, 2025, 1:32pm

It's worth noting that n / 0 == 0 has precedent in some other languages including Pony, Lean, and Coq.

bjorn3 · February 10, 2025, 1:38pm

IMO that is just a hack for those languages requiring functions to be total, but at the same time not wanting to use the equivalent of Option<i64> as return value for divisions for convenience reasons.

scottmcm · February 10, 2025, 4:26pm

I'm pretty sure this won't happen. If you want to do this, make your own method for it.

The biggest reason is that a very similar conversation happened as part of stabilizing ilog2: What should 0.ilog2() do in release mode?

There was a long conversation about potentially having that be -1 as _, which is a very natural result for it to give under the simple "well u32::ilog2 is just 31 - leading_zeroes" implementation.

But in the end, it was decided that wrapping results should only exist when there's a defined infinite-precision result. Wrapping an infinite-precision 257 to a finite 1_u8 is fine. But there's no coherent way to "wrap" -∞ into an integer type, thus 0.ilog2() and -2 / 0 both always panic.

And for % specifically, I think returning the LHS is particularly wrong because it's inconsistent with what would happen for a hypothetical mixed-type % that I wish we could provide. If you do uNN % uMM, the obvious return type is uMM, but then you can't just return the LHS if the RHS is zero.

I'm also unwilling to weaken postconditions like this. For example, there's a discussion about whether to just define slice.as_chunks::<0>() as ([], slice). But that's horrible to actually use, because you want to give a postcondition of tail.len() < N as the obvious description of what that method's doing, but that implementation for zero violates it, in exactly the same kind of way that a % 0 => a violates the expected postcondition of %.

Jules-Bertholet · February 12, 2025, 11:52pm

I don’t have a strong opinion on the ilog2 decision either way, but I this reasoning seems wrong to me. The only reason that arithmetic ops were chosen to wrap in release mode was performance. There’s nothing “more correct” about wrapping in release compared to any other behavior—it’s a logic error no matter what, if you actually want wrapping semantics you should use the dedicated methods or wrapper type. So I don’t see the value in consistency.

Jules-Bertholet · February 13, 2025, 1:51am

I think I agree with this. However, even if 0 were allowed, there would still be a postcondition that is not much looser: tail.len() <= N.wrapping_sub(1).

scottmcm · February 13, 2025, 6:47am

While I agree that if you want wrapping you should use the dedicated APIs for that, it's not true that all the other behaviours are equal.

Most importantly, x + 1 and x - 2 + 3 are equivalent in wrapping -- but not in certain other things, like saturating -- so wrapping is a particularly good choice in that it can often give the correct final answer even if the intermediate operations are outside the supported rage.

That's also a reason for division to differ from other operations. x * 2 - 2 and (x - 1) * 2 work the same in wrapping, but (x + 2) / 2 and x / 2 + 1 don't.

Boomshroom · February 13, 2025, 8:41pm

The most important property of modulo is that a = (a / b) * b + a % b. If a / 0 is defined as giving any finite value, than a % 0 should always be a. Regarding ilog2(0), I'd honestly say that it's ±∞. With Real integers, it approaches -∞, however in a 2-adic sense, 2^N gets closer to 0 as N approaches ∞ (if N is larger than the number of stored bits, then the number is indistinguishable from / congruent to 0), so you can argue that the logarithm should also be +∞.

Really it's this interpretation of 0 as "a sufficiently large 2^N" that makes a % 0 = a seem more reasonable. The same argument would then imply that a / 0 = 0, since 0 is effectively larger in magnitude than any representable nonzero number (but is still neither positive nor negative) and so the quotient would always be smaller in magnitude than 1.

This is definitely not the Real integer system that most people were taught, but it is still completely mathematically valid and satisfies the rules that integer division should preserve.

I'll also mention that RISC-V defines a / 0 = -1, for both signed and unsigned a, since it's the closest unsigned value to ∞, and it tends to be the natural output when trying to get a division circuit to compute a / 0.

scottmcm · February 13, 2025, 9:49pm

Note that I don't consider "what the hardware does" to be at all a justification for what Rust should do.

IMHO the behaviour of << is just a mistake, for example. The thing that rust does should be the "right" thing (which I would typically define as "the thing that gets the same result as if it had been computed in infinite precision, and importantly not something where doing it in a wider type gives something very different") and if there's some slightly-strange behaviour that's more efficient or situationally-useful that's what specific other methods are for.

I don't even like leading_zeros, because (my_u32 as u64).leading_zeros() as u32 isn't the same as my_u32.leading_zeros(). Thankfully we have ilog2 now, which avoids that problem.

Jules-Bertholet · February 14, 2025, 2:29am

What behavior would you prefer?

scottmcm · February 14, 2025, 3:05am

I'd prefer the one that unbounded_sh[rl] uses.

my_u32.unbounded_shl(n) and (my_32 as u64).unbounded_shl(n) as u32 do the same thing.

Then if someone really wants x << (n % BITS) (I don't know why they would, tbh), they can always just write that.

pitaj · February 14, 2025, 5:22am

Exactly. I'd like to see this changed in a future edition, but it's going to be difficult without subtly breaking code.

quaternic · February 14, 2025, 8:57am

The current behaviour is similar to other arithmetic in that shifting by an out-of-range value is considered an overflow, and panics in debug mode. I expect this helps find bugs in the common case where the shift amount should be in 0..BITS. If it was an unbounded shift, would you have negative amounts shift in the other direction, or alternatively not implement the operator for signed types at all?

IMO, the biggest issue is just not having the convenience of unbounded_{shl,shr} sooner (nor yet, on stable).

Many other operations are fundamentally tied to the bit-width of the type: rotate_{left,right},reverse_bits, {leading,trailing,count}_{zeros,ones} (consider 0 or negative inputs). But I can agree that for the cases where ilog2 is semantically appropriate, it's a big improvement over writing it in terms of leading_zeros.

tczajka · February 14, 2025, 1:33pm

The common case is often 0..=BITS, inclusive. Examples:

Implement a function that returns a bitmask with n lowest bits set. You want it to work for 0 and for BITS
Implement a function that shifts a multi-word bitmask by n. When n is a multiple of BITS, you'll need a bitshift by BITS or a special case.

Topic		Replies	Views
Wrapping shift operator code doing bitmasking twice internals	1	281	July 7, 2025
Ergonomics of wrapping operations ideas (deprecated)	46	13524	March 25, 2019
Remove panic from rotate_left and rotate_right? language design	26	1418	June 18, 2024
Where is std::num::Saturating? (Going to pre-RFC!) libs	51	3635	February 14, 2021
(Mathematical) modulo operator language design	29	64109	March 25, 2019

Wrapping_rem should allow modulo 0

Related topics