A case for a new output format for the compiler?

glandium · February 1, 2024, 10:40pm

The rust compiler currently supports three output formats:

json (self explanatory)
human

warning: unused variable: `a`
 --> src/main.rs:2:9
  |
2 |     let a = 1;
  |         ^ help: if this is intentional, prefix it with an underscore: `_a`
  |
  = note: `#[warn(unused_variables)]` on by default

warning: `foo` (bin "foo") generated 1 warning (run `cargo fix --bin "foo"` to apply 1 suggestion)
    Finished dev [unoptimized + debuginfo] target(s) in 0.00s

short

   Compiling foo v0.1.0 (/tmp/foo)
src/main.rs:2:9: warning: unused variable: `a`
warning: `foo` (bin "foo") generated 1 warning (run `cargo fix --bin "foo"` to apply 1 suggestion)
    Finished dev [unoptimized + debuginfo] target(s) in 0.17s

Now, compare with the output of what is also commonly used when building rust code:

gcc

foo.c: In function ‘foo’:
foo.c:2:1: warning: control reaches end of non-void function [-Wreturn-type]
    2 | }
      | ^

clang

foo.c:2:1: warning: non-void function does not return a value [-Wreturn-type]
    2 | }
      | ^
1 warning generated.

Do you notice the difference? The warning message is on the same line as the file name. This is extremely convenient to extract warnings/errors from large build logs.

To some extent, that's what the short output does, but it's also too short, because it skips everything else, and you have to do another build to get the full message.

I, for one, would like a format that shows the message like the short format, but with the full context like the human format. So here's my question: should this be a new format, or would a change to the human format be considered?

glandium · February 1, 2024, 11:06pm

Come to think of it, it would also be nice if the short version had the unused_variables in it too (and likewise for the hypothetical new format)

eggyal · February 1, 2024, 11:08pm

Am I correct in thinking such extraction is an automated process? Isn't the JSON output format best for that?

glandium · February 2, 2024, 12:01am

Am I correct in thinking such extraction is an automated process? Isn't the JSON output format best for that?

The extraction is an automated process, but the same logs are also for human consumption. I guess a wrapper that takes the json output and munges it would work, but then the nice coloring is lost (the json doesn't contain it) for the cases where you do want the colors. Rebuilding the original output from the json with color is certainly possible, but it's also a lot of work.

epage · February 2, 2024, 1:04am

Cargo exclusively uses rustc's json output for all of its processing, including the colored message you see. Granted, I don't remember under what conditions cargo will report back up the colored messages, if at all.

glandium · February 2, 2024, 6:35am

I'd rather not have to reimplement what cargo does to display colors. (and now that you mention that, I'm rather confused as to why cargo doesn't cache the json messages rather than the formatted output, ultimately leading to Message caching does not allow switching --message-format between short and human · Issue #9003 · rust-lang/cargo · GitHub)

bjorn3 · February 2, 2024, 8:00am

Cargo uses --error-format=json --json=diagnostic-rendered-ansi to get colors with the json error message format.

glandium · February 2, 2024, 10:10am

But cargo build --message-format=json doesn't show it, and --json=anything is not a supported option.

bjorn3 · February 2, 2024, 12:00pm

I think adding an equivalent option to cargo would be the best course of action then.

afetisov · February 2, 2024, 12:04pm

GCC was started in the 80s, when there was no standard tooling-friendly output format besides "print one item per line in a sorta-stable format, parse it with grep and hope for the best". Clang copied it due to compatibility reasons. I don't see a good reason to represent this bug-prone approach in modern tooling. Json output is already intended to be easily and unambiguously parseable. If it's missing some information, that's a good reason to extend its structured output, rather than invent yet another unstructured one.

By the way, not every terminal renders colours, so it would be nice to have an environment-portable encoding of colours anyway.

system · May 2, 2024, 12:05pm

This topic was automatically closed 90 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Do you have a tool which parses error messages? announcements	3	1784	March 25, 2019
`--message-format` short doesn't give enough information for mismatched types	8	192	October 30, 2024
Rustc's json output format tools and infrastructure	11	5008	March 25, 2019
New error format	16	11438	March 25, 2019
Editor compatibility and the new error format tools and infrastructure	42	9075	March 25, 2019

A case for a new output format for the compiler?

Related topics