Pre-RFC: Stabilize `#[bench]`, `Bencher` and `black_box`

alkis · October 27, 2017, 10:56pm

Is there an rfc/repo/thread for discussing what should end up in standard lib?

bluss · October 28, 2017, 9:29am

but black_box isn’t even the important part. Benchmarks should avoid relying on it, it also prevents some wanted optimizations. Most benchmarks can use just the black_box invocation that the bencher library does already on the value returned by the closure given to Bencher::iter.

llogiq · October 28, 2017, 9:49am

I mostly agree. Alas, most benchmarks are not enough, and it often takes some detailed analysis to find out what optimizations were applied, and if those are in the spirit of the benchmark. Sometimes it is easier to test a theory by applying black_box in the same location in multiple benchmarks and see the relative performance change.

alkis · November 2, 2017, 2:25pm

There a few things missing from Bencher that are essential for a bare bones micro benchmarking library.

ability to tell the bencher that the closure that is going to run involves N iterations (or ops). This makes the output of the benchmark much easier to compare. A motivating example is benchmarking a sort function: we want to benchmark sorting arrays that fit in l1, l2, l3 caches but what we want to report is how much time we spent to sort per element. This is much more meaningful number than reporting how much time it takes to sort the array that fits in l1 cache.
any kind of memory usage stats:
- the first and easy number to track is number of allocations per iteration
- another number is peak memory usage

NOTE: Unlike CPU time which can be normalized by dividing by number of ops, I don’t have a good suggestion on how to do the same for memory usage unless we let the bencher know the number of elements.

value and type parametrization. Right now this is done through macros in a very inside-out way. An example is here: https://github.com/jonhoo/ordsearch/blob/master/src/lib.rs. A lot of this would be easier if the bencher (or #[bench] attribute) allowed parametrization. Parametrization can take two forms:
- type parametrization: I want to run the same benchmark for HashMap, MyHashMap, ThisOtherHashMap.
- value parametrization: I want to run the same benchmark for the combination of N different populations and K different element sizes.

It would be nice if type and value parametrizations can be composed: I want to run the same benchmark for the combination of N populations and K different element sizes across L different implementations of the container and D different distributions (uniform, zipf, whatever).

Some of the above can be implemented on top of the current API with enough macros but I think they are so important in writing benchmarks that warrant first class support for ease of use/ergonomics.

Sample code as TLDR and starting point for discussion:


// size becomes a runtime param, ballast is a constant
#[bench(size=(1000..1_000_000).step_by(1000), ballast=[0, 4, 8, 16, 32, 64]]
fn lookup_hit(b: &mut Bencher, size: usize, ballast: usize) {
  let m = generate_map(size, [0u8; ballast]);
  let mut k = &m.map(|k,v| k).collect::Vec<_>();
  rng().shuffle(vec.as_mut_slice());
  let i = k.iter();
  b.iter(|| {
    if i.is_none() { i = k.iter(); }
    m.get(&i.next().unwrap())
  })
}

The above should generate proper names for the benchmark:

lookup_hit/size=1000/ballast=0  ...  Xns
lookup_hit/size=1000/ballast=4  ...  Xns
lookup_hit/size=1000/ballast=8  ...  Xns
...
lookup_hit/size=2000/ballast=0  ...  Xns
...

Benchmarking sort:

#[bench(size=[1000, 10_000, 1_000_000])
fn sort(b: &mut Bencher, size: usize) {
  let v = (0..size).map(rng().gen::<u32>()).collect::Vec::<_>();
  // Explicitly tell bencher this iteration involves `size` ops.
  b.iter_n(size, || {
    v.as_mut_slice().sort();
    v
  });

My rust foo is not powerful enough for type parametrization. My ideas so far involve macro like invocations - perhaps there is a way to do this without macros?

Manishearth · January 11, 2018, 7:33am

Opened an RFC

Gilnaa · January 12, 2018, 1:22pm

I was wondering, should black_box really be part of test/bench?

black_box can (and is) useful for other uses, wouldn’t it be a better fit for core::mem? (Or somewhere else in core)

Ixrec · January 12, 2018, 1:33pm

Yeah, @nagisa suggested a move to core on the RFC thread: https://github.com/rust-lang/rfcs/pull/2287#issuecomment-356940164 I never thought about it before but it seems perfectly sensible to me.

Gilnaa · January 12, 2018, 1:41pm

Oh great, I missed that one. Thanks

iopq · March 11, 2018, 4:59pm

Ah, this is what I was looking for. Using bencher now in my projects.

bluss · March 11, 2018, 6:33pm

Keep an eye on criterion too, it looks like an ambitious project to me.

system · March 25, 2019, 8:27am

This topic was automatically closed 90 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Idea: Semi-stabilization language design	37	4583	July 7, 2019
Getting more testing of unstable features	40	4401	March 25, 2019
#[bench] status libs	12	7800	March 25, 2019
Allow external crate to use unnecessary #[feature] on stable language design	38	3443	October 9, 2019
Keeping around unstable features until their replacements hit stable policy	6	1194	December 22, 2024

Pre-RFC: Stabilize `#[bench]`, `Bencher` and `black_box`

Related topics