Isnt the const f64::MANTISSA_DIGITS wrongly set as 53 instead of 52?

vri · December 4, 2023, 7:55pm

Hello,

Isnt the const f64::MANTISSA_DIGITS wrong ? .

The reason for asking for the above is because Microsoft docs shows it as 52 and even numpy reports it as 52

 assert np.finfo('float64').nmant, 52

Before I raise a PR for the above, I wanted to confirm if it was intentional ?

bill_myers · December 4, 2023, 8:06pm

There are several alternative ways to define the mantissa bits (with/without the implicit leading 1 and sign bit), and the only important thing is that the choice is consistent between f32 and f64, which it is.

The constant value obviously cannot be changed for compatibility reasons, so there probably isn't much point in defining other flavors, since you can obtain them by adding or subtracting 1 and the number of mantissa bits is only useful in very niche cases.

Also, the Rust values are the ones that are listed most prominently on Wikipedia, so not an unreasonable choice.

However, I think the documentation can be improved to make it clear that it's value that includes either the implicit one bit or the sign bit.

quinedot · December 4, 2023, 8:08pm

f32::MAX_EXP and f64::MAX_EXP are documented incorrectly, and other associated constant woes

opened 10:37PM - 07 Sep 21 UTC

orlp

T-libs-api A-floating-point

`f32::MAX_EXP` and `f64::MAX_EXP` are documented as > Maximum possible power …of 2 exponent. This is straight up not true. They are respectively defined as `128` and `1024`, which is actually one *above* the maximum possible exponent of `f32` and `f64`. The doc comment needs to be changed. --- In general I feel the set of associated constants for the floating point types are questionable, and should be a candidate for deprecation and replacement by a better set of constants. It is painfully obvious that the constants were copied from C's `<float.h>`, with little regard of whether these constants are useful or well-named. I have no qualms with `MIN`, `MAX`, `NAN`, `INFINITY`, and `NEG_INFINITY` at all. They are sane and useful. However, the following set of constants are carbon copied from [`<float.h>`](https://en.cppreference.com/w/c/types/limits#Limits_of_floating_point_types): FLT_RADIX = 2 => f32::RADIX FLT_MIN = 1.175494e-38 => f32::MIN_POSITIVE FLT_EPSILON = 1.192093e-07 => f32::EPSILON FLT_DIG = 6 => f32::DIGITS FLT_MANT_DIG = 24 => f32::MANTISSA_DIGITS FLT_MIN_EXP = -125 => f32::MIN_EXP FLT_MIN_10_EXP = -37 => f32::MIN_10_EXP FLT_MAX_EXP = 128 => f32::MAX_EXP FLT_MAX_10_EXP = 38 => f32::MAX_10_EXP Going over them one by one (`f64` is entirely analogous): - `f32::RADIX` is just plain useless. It's always 2, Rust has no support for non-binary floating point. - `f32::MIN_POSITIVE` is badly named, because it's actually the smallest positive *normal* number. This is a useful constant, but the name is unacceptable in my opinion. - `f32::EPSILON` is somewhat badly named (`MACHINE_EPSILON` would be better), and slightly deceptive. However this is not necessarily the fault of the constant, but due to people misunderstanding what machine epsilon means. Should my [RFC for `next_up`/`next_down`](https://github.com/rust-lang/rfcs/pull/3173) get merged, this would make this constant unnecessary. Especially if we make a [`ulp`](https://en.wikipedia.org/wiki/Unit_in_the_last_place) method in the future. - `f32::DIGITS` is "the approximate number of significant digits in base 10". I don't know when you'd ever need this constant, or what 'approximate' here means at all. The constant is also deceptive, because one might interpret this as an upper bound on the number of digits needed to represent a `f32`. - `f32::MANTISSA_DIGITS` includes the implied 1. Thus it is off by one from the constant you almost always want when explicitly working with a mantissa in code: the number of *bits* that the mantissa is wide. - `f32::MIN_EXP`... I think the doc comment speaks for itself: "One greater than the minimum possible normal power of 2 exponent.". Not only is it off by one, it also ignores denormal floats. - `f32::MAX_EXP`, see start of this issue. - `f32::MIN_10_EXP`, also ignores denormal floats. - `f32::MAX_10_EXP`, sanely defined but also fairly useless since you can compute `f32::MAX.log10().floor()` if you really wanted to know this. I honestly believe the best way forward is to deprecate all of the above constants and replace them with a couple fundamental, sane and conservative constants. For example for `f32`: ```rust const EXPONENT_BIAS: i32 = 127; const EXPONENT_WIDTH: i32 = 8; const MANTISSA_WIDTH: i32 = 23; const MIN_POS_NORMAL: f32 = f32::from_bits(1 << f32::MANTISSA_WIDTH); ```

ExpHP · December 4, 2023, 11:59pm

Wow! That docstring seriously needs a disclaimer.

Like, it isn't wrong. 53 is certainly the number of significant digits in the floating point value. But this is such an unconventional way to define this constant.

eggyal · December 5, 2023, 12:06am

It's interesting that libs team approved deprecation of DIGITS in deprecate f{32,64}::DIGITS by workingjubilee · Pull Request #89238 · rust-lang/rust · GitHub, potentially with a view to deprecating (some of) the other constants, but it looks like it didn't get pushed through?

tczajka · December 5, 2023, 9:59am

I think 53 is the most reasonable value.

Imagine you had a format that was always 1 × 2ⁿ. Would you call it 1 bit of precision or 0 bits of precision? I think 1 is better because there is some precision there.

The IEEE 754-2008 standard says 53.

ExpHP · December 5, 2023, 12:09pm

But it's not called PRECISION, it's called MANTISSA_DIGITS!

Alright, so. Any set of constants that includes something called MANTISSA_DIGITS would almost certainly also include an EXPONENT_DIGITS. Clearly, the word "exponent" in this name must be referring to the representation of the exponent in the binary encoding (i.e. the biased integer), so why would the word "mantissa" not also refer to the representation of the mantissa?

@ExpHP read over his finished post. It looked nice, but something was missing.

Ah! Yes! We should link EXPONENT_DIGITS to the corresponding const in rust!

.......wait. That's weird. There isn't one.

Well surely my theory must be correct about other languages. Let's try C++!

.......omg C++ defines them just like rust.

...

farnz · December 5, 2023, 4:28pm

It depends which field from IEEE 754-2008 Table 3.5 you think matters. For binary formats, IEEE 754 uses a "sentinel" exponent value to indicate that the first digit of the significand is 0, otherwise it's always 1. There's a second sentinel used to indicate that this is either an infinity or a NaN, depending on the remaining bits of the number.

In this case, f64::MANTISSA_DIGITS corresponds to p (precision in bits) in the table, rather than t (trailing significand field width in bits). It's also the number of bits in the mantissa in the abstract, as opposed to the number of bits of the mantissa that get stored in the binary format, because of the encoding trick with the exponent that gives you one "hidden" bit of mantissa that's implied rather than stored. But given that the encoding trick is known to simply allow you to encode one bit of the abstract mantissa in the exponent field, getting t given that you have p is trivial.

simonbuchan · December 6, 2023, 4:06am

I've never been so disturbed by a philosophy of numbers question.

It seems that the answer has to 0 because n has to be the exponent, and you learn nothing by looking at the value but the exponent, or equivalently, the concrete representation would only have bits for n; but on the other hand, every value is representable by such a value (assuming infinite n is permitted) to within one bit by such a format, so it also seems it has to be 1!

I think the issue is that in decimal scientific notation the mantissa can be in [1,10), specifically excluding [0,1). With p digits of precision in base b, this gives you b^p - 1 possible values, which most of the time can be treated as b^p, but as b^p approaches 1 you have to start distinguishing bits of information in the mantissa and digits of precision.

For example, with one digit of precision in base 10 you have only 9 possible values in the mantissa with a little over three bits of information, in base 9 you have 8 possible values with three bits of information, or (base - 1) possible values with log_2(base - 1) bits of information in general. At the limit of base 2, there's only one value, which is 0 bits of information.

By this, the conventional meaning of "digits of precision" seems like it should include the leading 1.

But I hate it.

tczajka · December 6, 2023, 12:19pm

The interpretation that this is the size of a bitfield would be more reasonable if the constant was called MANTISSA_BITS rather than MANTISSA_DIGITS. With "digits" it's talking about the abstract properties of a number rather than its representation in memory. With "bits" it would be more ambiguous.

You could imagine a function f64::to_mantissa_exponent(self) -> Option<(i64, i16)> that includes the implicit bit in the mantissa. Option because of infinities and NaN.

BTW I think it would actually be a useful method to add (and its inverse). The exponent in the return value should be such that (a, b) represents a * 2^b.

system · March 5, 2024, 12:19pm

This topic was automatically closed 90 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Implementing a Fast, Correct Float Parser internals	4	4390	September 28, 2021
Add conversions of floating point to / from exponent+mantissa	16	621	December 3, 2024
Why `f32` and `f64` do not have `min_value/max_value` methods? libs	6	1974	March 25, 2019
RFC: make all f32/f64 methods const libs	4	1726	September 22, 2020
Code that formats f64?	2	210	December 20, 2024

Isnt the const f64::MANTISSA_DIGITS wrongly set as 53 instead of 52?

Related topics