To_upper speed

gilescope · February 2, 2021, 3:09pm

Whatever we do for the general case, I don't think there's anything that's going to be faster than a less than check on the byte being under 128. Given that spaces, line feeds and control chars are in that list is there any reason not to go ahead with the initial PR while we try and iterate on the general case. (It's a pretty significant speedup for one additional instruction)

zackw · February 2, 2021, 4:33pm

I don't see why not. It can always be changed again later.

bluss · February 2, 2021, 4:55pm

The only challenge is to have varied enough benchmarks or use cases to justify the special case. (Meta-comment, but that's why win-win PRs are so cool - if you can show you improve every use case, then your PR becomes irresistible, and there are not so many hard decisions about tradeoffs.)

For this particular PR, there are only benchmarks for the happy case - what about other text?

miccah · February 3, 2021, 4:34am

I've updated the PR with two more benchmarks (strictly ASCII / non-ASCII characters) and added a comment with the before / after results: Add a check for ASCII characters in to_upper and to_lower by mcastorina · Pull Request #81358 · rust-lang/rust · GitHub

As expected, the non-ASCII case performs the same. The all ASCII case gets a speedup of ~200,000 ns/iter.

gilescope · February 3, 2021, 7:03am

Excellent. The replacement char \u{FFFD} is going to appear a fair bit if people do String::utf8_from_lossy(). I'm assuming that the replacement char isn't going to change case at all...?

system · May 4, 2021, 7:04am

This topic was automatically closed 90 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Pre-pre-RFC: Support `write_uppercase(&self, &mut String)` libs	14	1583	May 29, 2022
Case Insensitive UTF-8 Comparison libs	11	5037	January 12, 2024
Benchmark for std::str::from_utf8()? libs	4	1126	March 25, 2019
Why's char not an utf8mb4? language design	18	2043	August 13, 2021
ASCII methods for u16	17	2848	April 11, 2021

To_upper speed

Related topics