Is it possible to isolate internal changes on transitive dependency with rmeta

ashi009 · September 26, 2024, 9:26am

When we try to optimize build performance for our rust project. It appears that any internal transitive dependency change (not touching the exposed types, say just adding a comment) will invalidate the whole build cache.

It turned out the generated rmeta for the crate is different for any code change because it embeds the source file info:

/// Holds information about a rustc_span::SourceFile imported from another crate.
/// See `imported_source_file()` for more information.

The same idea applies to internal changes that don't affect the exposed types. Given all that, rmeta has little advantage compared to rlib, as any change will still propagate through the entire graph, and rlibs will still be regenerated. rmeta only gives an early start for the following build process.

My understanding is that to produce the final binary/dylib, we always need all transitive rlibs to present. But for intermediate build steps that generate rlibs, we just need type info (and perhaps some other details) to allow codegen, and all other stuff could be reassembled at the final step with rlibs.

A similar design is found in golang's compiler (Go at Google: Language Design in the Service of Software Engineering - The Go Programming Language) It avoids putting internal details to the intermediate build artifacts, to avoid transitive cache invalidation from changes that are not visible from outside.

The process is more automatic and even more efficient than in Plan 9 C, though: the data being read when evaluating the import is just "exported" data, not general program source code. The effect on overall compilation time can be huge, and scales well as the code base grows.

I'm wondering if there are any toggles/hacks that I can play with rmeta, to avoid leaking internal details to the generated rmeta files.

bjorn3 · September 26, 2024, 10:55am

With incr comp most of the work is skipped when an upstream crate changes. As for avoiding recompilations entirely, there has been some discussion on zulip, but nothing concrete.

josh · September 26, 2024, 3:37pm

You may want to read this Zulip thread, where Piotr is prototyping a mechanism to do exactly this: https://rust-lang.zulipchat.com/#narrow/stream/246057-t-cargo/topic/Dynamically.20pruning.20jobs.20from.20the.20work.20queue

ashi009 · September 27, 2024, 7:12am

Nice! However, we are building our project with bazel, this approach is too deep into the rustc internals and how rustc handles its build cache.

I'm looking for something that could be done from the outside, say still using rmeta and letting the external build system trigger actions based on the the artifacts.

Topic		Replies	Views
Crate dependency discovery compiler	24	3947	May 14, 2020
Don't rebuild dependents when certain criteria is met cargo	7	441	December 2, 2025
Pre-RFC: Alternative approach to -Zno-link/-Zlink-only split linking compiler	14	1396	September 6, 2021
"Interface-only" crate type? compiler	13	2403	June 9, 2019
Librustc_driver.so not reproducible compiler	19	1504	January 7, 2024

Is it possible to isolate internal changes on transitive dependency with rmeta

Related topics