Pre-RFC: Test groups

GoldsteinE · March 28, 2023, 8:01am

I haven’t been able to find anything like this either on IRLO or among existing RFCs. Please, direct me to the proposal if it already exists.

The problem: cargo test / libtest currently implement coarse-grained test filtering. As far as I understand, these are the options:

You can include/exclude all the tests (not) marked with #[ignore].
You can exclude all the tests marked with #[should_panic].
You can filter by name (by substring or full string matching).

There’re cases where it may be desirable to run only some tests. For example, there may be different reasons why the test is ignored by default. Consider this code:

#[test]
#[ignored]
fn im_ignored_because_im_long() { ... }

#[test]
#[ignored]
fn im_ignored_because_I_fail_in_runtime_outside_of_CI() { ... }

When developing locally, you may want to run test that are ignored because they take a lot of time, but not those which are guaranteed to fail. To do this currently, you need to either rely on by-name filtering or weed out unneeded tests with #[cfg]s (which requires recompilation).

Proposal: Add support for test groups in libtest and cargo test. Syntax is bikesheddable, but it should look something like this:

#[test(long)]
#[ignore]
fn im_ignored_because_im_long() { ... }

#[test(ci_only)]
#[ignore]
fn im_ignored_because_I_fail_in_runtime_outside_of_CI() { ... }

and then

cargo test -- --ignored --group=long

or

cargo test -- --enable-group=long

Using the same mechanism, you can disable certain tests if you want. For example, you may want to run long tests by default, but disable them for quick iteration with --disable-group=long.

An alternative I considered: do groups only for ignored tests, like this:

#[test]
#[ignore = "long"]
fn im_ignored_because_im_long() { ... }

and then

cargo t -- --ignored=long

This still solves the “groups of ignored tests” problem, but is kinda less flexible.

What do you think? I could write a formal RFC sometime soon.

eggyal · March 28, 2023, 9:07am

You already can use conditional compilation to only include tests when particular features are enabled which you then do using cargo test --features.

jyn514 · March 28, 2023, 9:11am

Why does this need to be part of upstream libtest instead of a custom test framework on crates.io?

GoldsteinE · March 28, 2023, 9:12am

Conditional compilation requires re-compilation. Test groups would be a runtime feature, removing the need to recompile everything (both the crate itself and test crates, since there is no way to specify a feature only for tests) for every test run with different group chosen.

GoldsteinE · March 28, 2023, 9:14am

Rust doesn’t have a great support for custom test harnesses. You could do something like this with a procmacro, but how would you, for example, parse arguments passed to the test binary? You would need to completely replace a test harness, which is not optimal IMO.

jyn514 · March 28, 2023, 9:15am

Take a look at how https://nexte.st/ works. I agree it's less convenient at first, but it also lets you experiment with real code without first having to get an RFC approved.

GoldsteinE · March 28, 2023, 9:20am

It’s possible to implement this in user code, either by completely replacing a test runner (what nextest does, a lot of work) or passing an environment variable (easy, but not a great UI) or making a wrapper around cargo test which passes said variable. These all are viable workarounds, but I don’t think they’re quite good enough to say that considering implementation of this feature in libtest is not worthwhile.

jyn514 · March 28, 2023, 9:22am

I'm saying something different - try this out with an env variable, get people to use it, and then that adoption shows a strong motivation for the RFC to be accepted.

GoldsteinE · March 28, 2023, 9:35am

Poor adoption of a solution with poor UX doesn’t say much about whether or not a solution with good UX should be implemented. It’s probably easier to use name patterns + filtering by name or #[cfg] attributes rather then groups implemented in user code, for a few reasons:

Implementing groups in user code would require a procmacro. Procmacros break tooling, they cause spurious rust-analyzer crashes, they’re poorly supported by IntelliJ Rust and sometimes they cause compiler error to appear in a weird place.
Passing an env var is just awkward. Wrapper would work, but it’s a change in workflow, a tool that needs to be installed in CI and it doesn’t work with other cargo plugins, like cargo-hack, cargo-miri and cargo-nextest.

I don’t expect good adoption of such a tool, because I personally wouldn’t use it for mentioned reasons.

ehuss · March 28, 2023, 11:45am

Is there prior art in other test frameworks (in other languages/ecosystems)?

GoldsteinE · March 28, 2023, 12:24pm

Yes, notably @pytest.mark and JUnit @Category. Tasty in Haskell has test groups, but they’re different — all tests in one group are defined together.

mathstuf · March 28, 2023, 3:08pm

CTest has the LABELS property for tests.

max-sixty · March 29, 2023, 7:01pm

Poor adoption of a solution with poor UX doesn’t say much about whether or not a solution with good UX should be implemented.

I don’t expect good adoption of such a tool, because I personally wouldn’t use it for mentioned reasons.

I think what people are saying is: it's difficult to show this is a big problem if you're also claiming that a relatively small burden, such as an env var, is greater than the problem itself

(FWIW I agree with you that this would be a good feature! I use it in python a lot. Though my prior is also that it doesn't necessarily belong in the language itself)

GoldsteinE · March 29, 2023, 8:04pm

I don’t claim that this is a large burden. Test groups, if implemented, would be a QoL feature. I think that QoL features are important too, and there’s no reason to keep libtest barebones, especially when something can’t be cleanly implemented in user code.

Nearly any tooling can be in some form implemented in user code. You can have external build system, test framework, package manager (C++, Java and Haskell are some examples), but I think there’s value in having some batteries included, and I think this particular battery is a good candidate for inclusion.

This would be less of a problem if Rust properly supported custom test harnesses, of course.

jdahlstrom · March 30, 2023, 11:22am

Can/could normal (sub)modules used for test grouping, using the existing filter-by-path/name functionality? That would preclude using the same test in two groups without some duplication/factoring to a common function though – is that a desired feature?

GoldsteinE · March 30, 2023, 11:42am

That would work for the simplest cases. My main usecase for this feature, though, is grouping tests that are declared in different crates inside of a workspace, which makes this impossible. It also prevents grouping together unit tests and integration tests, and even just unit tests that need to be declared in different modules due to privacy rules.

system · June 28, 2023, 11:42am

This topic was automatically closed 90 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Pre-RFC: Provide ignore message when the test ignored libs	5	1388	April 10, 2022
Cargo test should fail if the specified test to run isn't found cargo	2	187	December 10, 2024
Use the TestType in test library to control running of integration tests libs	6	632	February 24, 2023
How to skip/disable unit test cases while rustc testing using the package Cargo.toml file traits working group	5	1357	May 17, 2024
Skippable Tests language design	5	379	October 27, 2024

Pre-RFC: Test groups

Related topics