[Pre-RFC] Change the ExactSizeIterator spec

stebalien · February 24, 2015, 4:26am

Currently, the ExactSizeIterator spec specifies that Iterator::size_hint must return (exact_size, Some(exact_size).

Iterator::size_hint must return the exact size of the iterator. Note that the size must fit in usize.

However, it also specifies that its own ExactSizeIterator::len method must also return exact_size.

I propose that the requirement on Iterator::size_hint be removed because it:

Is redundant.
Imposes a requirement on a method provided by a different trait.
Makes it impossible to implement an iterator that can quickly give an approximate size_hint but can take longer to give an exact len.

mzabaluev · February 24, 2015, 9:06am

Making one trait implementation impose additional requirements on another is not new, we have Eq and PartialEq.

size_hint is useful for generic code that should work if the exact size is not known, but needs a reasonable estimate for e.g. reserving capacity in a collecting container.

As for 3, if the cost of calculating the size is not (amortized) constant, perhaps the iterator should not implement ExactSizeIterator at all. To take the opposite to extremes, any cloneable or bidirectional iterator might implement ExactSizeIterator, but the performance of generic code using that as a bound would be anyone’s guess.

stebalien · February 24, 2015, 2:21pm

Good point.

I'm not suggesting getting rid of size_hint.

However, one could reasonably design an algorithm that can compute an exact size in log(n) time but give a estimate a bound in constant time (restrict n <= c). Generic code that just needs a reasonable estimate for allocation could use the constant time method but code that absolutely needs to know an exact size could call len() to get an exact size. I don't know of any algorithms like this but I feel that needlessly restricting the API is a bad idea.

Essentially, I argue that this constraint should be removed because it serves no purpose and could restrict future APIs.

bluss · February 24, 2015, 2:36pm

The use case it was introduced for, which is .enumerate().rev() – does not make sense for the scenario of an expensive version of length calculation.

Does ExactSizeIterator have a real purpose? Counter-proposal: Remove ExactSizeIterator.

stebalien · February 24, 2015, 8:23pm

A logarithmic length calculation definitely makes sense for iter.enumerate().rev().take(n) where n << iter.len().

How is this not a real purpose for ExactSizeIterator?

bluss · February 24, 2015, 8:56pm

It’s kind of a gotcha to have non-constant time .len(), but logarithmic isn’t so bad.

system · March 25, 2019, 8:23am

This topic was automatically closed 90 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
`ExactSizeIterator` and combinators libs	5	542	September 24, 2022
The Trusted Iterator Length Problem	29	7245	March 25, 2019
`size_hint`, correctness, reproducibility and documentation libs	6	2215	March 25, 2019
ExactSizeIntoIterator trait language design	7	353	October 3, 2024
Should we implement ExactSizeIterator for Chain? libs	10	2259	September 17, 2020

[Pre-RFC] Change the ExactSizeIterator spec

Related topics