Extraction of code as part of compiler integration tests for CI

matu3ba · July 28, 2020, 3:46am

Sorry, if my post appears to be less informed about the Rust CI. I am new to this forum and it was suggested to me on reddit to present my ideas here. I would like to know from you what you think about it regarding usefulness.

Some flaw I do see on the current testing is that there seems to be no keeping track of integration tests (in a structured manner) and they are not necessarily nice to extract the issue number or meaningful information for a CI system. However I might be flawed on this in the estimation of its usefulness or simply uninformed, what is there.

The ideas:

1. use Github Issues to extract code as integration tests (and test it)

2. run automatically chosen integration tests against nightly and stable HEAD

3. show results as part of perf

How is the typical workflow of handling an issue?

The issue comes in(usually reports are not that clean)

On nightly x86_64 this

fn main() {
    let mut u = (1,);
    *&mut u.0 = 5;
    assert_eq!( { u.0 }, 5);
}

gives

thread 'main' panicked at 'assertion failed: `(left == right)`
  ...

Output of rustc --version --verbose

rustc 1.47.0-nightly (39d5a61f2 2020-07-17)
binary: rustc
commit-hash: 39d5a61f2e4e237123837f5162cc275c2fd7e625
commit-date: 2020-07-17
host: x86_64-unknown-linux-gnu
release: 1.47.0-nightly
LLVM version: 10.0

It is manually tested (via copypaste to godbolt or to a local file) to be confirmed.
It is tagged accordingly
Hopefully soon somebody fixes the issue.
A regression test is written and a PR is links to the issue.
bors closes the issue on commiting the PR

The minimal test case might be used with minimal changes as regression test).

What if we can remove the steps 2 + 5 and make sure (backports/things) are not overlooked?

it is better to use computers for repetitive tasks

1. use Github Issues to extract code as integration tests

In an ideal case the user creates issues looking like this

Staring forever on this code, this looks very broken me.

// run-pass
fn main() {
    let mut u = (1,);
    *&mut u.0 = 5;
    assert_eq!( { u.0 }, 5);
}

thread 'main' panicked at 'assertion failed: `(left == right)`
  ...

rustc --version --verbose

rustc 1.47.0-nightly (39d5a61f2 2020-07-17)
binary: rustc
commit-hash: 39d5a61f2e4e237123837f5162cc275c2fd7e625
commit-date: 2020-07-17
host: x86_64-unknown-linux-gnu
release: 1.47.0-nightly
LLVM version: 10.0

The CI then parses the field in a simplified manner and either marks the issue as CONFIRMED or NEEDS-REVIEW and tests against HEAD of nightly and stable for occurence.

Alternatively the godbolt API [can be used](compiler-explorer/API.md at master · compiler-explorer/compiler-explorer · GitHub), but likely takes more effort.

The reviewer adds additional labels for later placement of the integration test, fixes the integration test and pings the right people.
The fix gets commited.
bors closes the issue

2. run automatically chosen integration tests against nightly and stable HEAD

goal: keeping track of issues and rarely allow regressions on stable/nightly to happen

Probably the CI has no logic what tests should be included or does not check missing tests or the incident could not have happened.

Having no guideline how the issues must be linked for integration tests does not help either.

On 25 issues per day this will only get harder over time to miss tedious things and I have no idea how to estimate complexity from issues, before it becomes unbearable.

Maybe I am missing a repo, where you host all code issues?

3. show results as part of perf

goal: show that stuff works and does not regress

The known working and known failing (but indended to be fixed) integration tests may be part of sporadic perf results and releases. Alternatively they could be scheduled by a fixed time interval.

How should tests be formatted?

Rust tests appear to be handcrafted, but with help of x.py and added not specifically [consistent](Adding new tests - Guide to Rustc Development](https://rustc-dev-guide.rust-lang.org/tests/adding.html).

"For regression tests – basically, some random snippet of code that came in from the internet – we often name the test after the issue plus a short description."

Topic		Replies	Views
Rust CI and submodule crates	23	4190	March 25, 2019
Handling compilation failures in integration tests cargo	1	882	March 25, 2019
Incremental test runner compiler	1	682	July 16, 2021
Testing the compiler compiler	14	3063	March 25, 2019
IDEA: `cargo bisect-rustc` -- a tool to help users help us	14	1681	March 25, 2019

Extraction of code as part of compiler integration tests for CI

1. use Github Issues to extract code as integration tests (and test it)

2. run automatically chosen integration tests against nightly and stable HEAD

3. show results as part of perf

How is the typical workflow of handling an issue?

What if we can remove the steps 2 + 5 and make sure (backports/things) are not overlooked?

1. use Github Issues to extract code as integration tests

2. run automatically chosen integration tests against nightly and stable HEAD

3. show results as part of perf

How should tests be formatted?

Related

Curious

Extraction of code as part of compiler integration tests for CI

1. use Github Issues to extract code as integration tests (and test it)

2. run automatically chosen integration tests against nightly and stable HEAD

3. show results as part of perf

How is the typical workflow of handling an issue?

What if we can remove the steps 2 + 5 and make sure (backports/things) are not overlooked?

1. use Github Issues to extract code as integration tests

2. run automatically chosen integration tests against nightly and stable HEAD

3. show results as part of perf

How should tests be formatted?

Related

Curious

Related topics