Pre-RFC: Hex float literals

dobenour · September 8, 2017, 11:36pm

C allows hex float literals (of the form 0xffffff.fp0). These literals, have some uses – for example, they are guaranteed to map to an exact, obvious bit pattern.

However, Rust does not allow such literals. It has decent reasons for doing so (see issue#1433). In particular, they vastly complicate parsing: there is an ambiguity between a hex float literal and a field and/or method off of an integer. This is not, strictly speaking, ambiguous: integers have no fields, and trying to call an integer as a function will be rejected by the type-checker. So one can disambiguate 0xFFFFFFF.Fp3 as being a float if and only if it is not followed by an open parentheses. That, however, is very clumsy, and we really really don’t want to need to do that in the lexer.

A much simpler approach is to require the + or - sign in the exponent. 0xFF.Fp-1 is completely unambiguous: since integers have no fields, it will always be rejected by the typechecker, regardless of its context. As such, it is 100% backwards compatible (except to the writers of syntax extensions) to make 0xFF.Fp-1 a single lexeme.

Finally, I propose that the float parsing functions in the stdlib be modified to accept all forms accepted by strtod. The lexer can reject the ones that are invalid in Rust source trivially.

toc · September 9, 2017, 1:01am

It also wouldn’t be unreasonable, if fields for integers are added later, to require they be used like (123).field. Python e.g. does this.

hanna-kruppe · September 9, 2017, 8:20am

The hexf crate implements a procedural macro such that one can write hexf!("0xFF.Fp1") and have it evaluated at compile time. It also includes run-time parsing functions. While this is obviously a bit less nice than having literals in the language, it does raise the bar for changing the language. Personally, while I would have liked hex float support in the language, I believe the crate is good enough that it’s not worthwhile to try and change the language.

dobenour · September 10, 2017, 4:21am

True, but procedural macros are likely to be unstable for quite a while, whereas this is such a trivial change in the lexer that it can be stabilized very quickly – I suggest making waiting at most one release.

The only difficulty is that the lexer now must backtrack, but that should not be too difficult.

hanna-kruppe · September 10, 2017, 8:43am

Sorry, I should have been more specific. The crate uses macros 1.1, not old-style procedural macros, so it works on stable (since several releases ago).

system · March 25, 2019, 8:28am

This topic was automatically closed 90 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Hex float literals (alternative syntax) language design	6	1568	May 1, 2021
Idea: In the next edition, stop accepting `0.` as a valid float literal	32	3926	January 15, 2020
[Pre-RFC] Hex literals language design	6	2854	March 25, 2019
"Raw" hex and binary literals that adhere to IEE 754-2008 for floats language design	13	1161	June 3, 2023
Pre-RFC: ergonomics around NonZeroU* and literals language design	11	1702	March 25, 2019

Pre-RFC: Hex float literals

Related topics