New API: `char::is_ascii_octdigit`?

nerdypepper · August 31, 2022, 1:37pm

Is there a reason why char::is_ascii_digit and char::is_ascii_hexdigit exist, whereas there is no shorthand to check if a character is a valid octal? While it is as simple as matches!(c, '0'..='7'), I would prefer to have the proposed API because:

it is a lot more ergonomic in point-free form, i.e.: my_string.chars().filter(char::is_ascii_octdigit) as compared to my_string.chars().filter(|c| matches!(c, '0'..='7')) or my_string.chars().filter(|c| '0'..='7'.contains(c))
the intent is a lot more obvious
completeness, I just think it fills a gap(?) in the list of char functions

eggyal · August 31, 2022, 3:08pm

There's always c.is_digit(8) whose intent is perhaps a little more obvious than using explicit ranges.

eggyal · August 31, 2022, 3:13pm

And if the libs API is to be expanded, it should probably now use const generics to be more general eg fn is_digit_const<const BASE: u32>(self) -> bool which could then be invoked in "point-free" form as eg my_string.chars().filter(char::is_digit_const::<8>).

jrose · August 31, 2022, 7:39pm

I’m curious what you’re using octal for in 2022!

quinedot · August 31, 2022, 7:41pm

Can't speak for the OP, but chmod for example still takes octal here in the year 03746.

mathstuf · August 31, 2022, 8:09pm

I've used it when parsing git ls-tree output.

eggyal · August 31, 2022, 8:27pm

I myself happened to use it earlier today to represent bit lengths, because the second position onward represent the number of bytes and the units represent offset into the final byte.

tczajka · August 31, 2022, 8:42pm

Yeah octal is perfect for thinking about memory.

On x86-64 linux, in octal:

1 byte         = 10 bits
1 machine word = 100 bits
1 cache line   = 1000 bits
1 page         = 100000 bits
1 huge page    = 100000000 bits
1 huger page   = 100000000000 bits

nerdypepper · September 1, 2022, 11:22am

as @quinedot mentioned, i realized this method was not present when i was validating a string containing file permissions!

josh · September 1, 2022, 1:06pm

I would be happy to merge a PR adding this method (as unstable, with a tracking issue). As several people in this thread observed, it does still come up in the context of file modes.

nerdypepper · September 1, 2022, 2:52pm

thanks, i'd love to give that a shot!

fren_gor · September 5, 2022, 2:59pm

It would be great to have both a const generic version and char::is_ascii_octdigit as a special case since it's so useful.

system · December 4, 2022, 2:59pm

This topic was automatically closed 90 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
String -> Octal parsing language design	4	1188	March 17, 2023
Wild idea: deprecating APIs that conflate str and [u8] libs	59	3548	November 12, 2020
Feature Request: Utf8 prefix inspection for u8 language design	6	568	March 26, 2024
Fn char::as_ascii(self) -> Option<u8>	4	721	April 20, 2020
Pre-RFC: String from ASCII (not allowing UTF-8) libs	16	2340	August 8, 2021

New API: `char::is_ascii_octdigit`?

Related topics