(EXPERIMENT) Replace zero-deps nodes with a singleton#154162
(EXPERIMENT) Replace zero-deps nodes with a singleton#154162Zalathar wants to merge 2 commits intorust-lang:mainfrom
Conversation
|
@bors try @rust-timer queue |
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
(EXPERIMENT) Replace zero-deps nodes with a singleton
|
For example, it might be the case that we only want to perform this simplification when the query+key is not eligible for disk caching, as disk caching might rely on having a distinct |
|
💔 Test for fbe5cb6 failed: CI. Failed job:
|
This comment has been minimized.
This comment has been minimized.
|
Yeah, looks like there's more subtlety here that I'll have to dig into. |
|
Let's try a narrower version that only applies to queries that don't have @bors try @rust-timer queue |
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
(EXPERIMENT) Replace zero-deps nodes with a singleton
This comment has been minimized.
This comment has been minimized.
|
Finished benchmarking commit (2f12db4): comparison URL. Overall result: ❌✅ regressions and improvements - please read the text belowBenchmarking this pull request means it may be perf-sensitive – we'll automatically label it not fit for rolling up. You can override this, but we strongly advise not to, due to possible changes in compiler perf. Next Steps: If you can justify the regressions found in this try perf run, please do so in sufficient writing along with @bors rollup=never Instruction countOur most reliable metric. Used to determine the overall result above. However, even this metric can be noisy.
Max RSS (memory usage)Results (primary -1.1%, secondary 2.1%)A less reliable metric. May be of interest, but not used to determine the overall result above.
CyclesResults (secondary 3.3%)A less reliable metric. May be of interest, but not used to determine the overall result above.
Binary sizeThis benchmark run did not return any relevant results for this metric. Bootstrap: 485.042s -> 483.702s (-0.28%) |
|
Now that #154122 has landed, let's do a clean perf run to measure just these new changes. @bors try @rust-timer queue |
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
(EXPERIMENT) Replace zero-deps nodes with a singleton
This comment has been minimized.
This comment has been minimized.
|
Finished benchmarking commit (d41f91c): comparison URL. Overall result: ❌✅ regressions and improvements - please read the text belowBenchmarking this pull request means it may be perf-sensitive – we'll automatically label it not fit for rolling up. You can override this, but we strongly advise not to, due to possible changes in compiler perf. Next Steps: If you can justify the regressions found in this try perf run, please do so in sufficient writing along with @bors rollup=never Instruction countOur most reliable metric. Used to determine the overall result above. However, even this metric can be noisy.
Max RSS (memory usage)Results (primary -1.8%)A less reliable metric. May be of interest, but not used to determine the overall result above.
CyclesResults (primary 2.4%, secondary -2.4%)A less reliable metric. May be of interest, but not used to determine the overall result above.
Binary sizeThis benchmark run did not return any relevant results for this metric. Bootstrap: 485.484s -> 485.487s (0.00%) |
|
Looks like a nice reduction in dep graph size, but clearly there are some subtle side effects that I’ll want to investigate. |
|
Let's try again, but this time exclude queries that have recoverable keys. If a query doesn't cache on disk, and can't recover keys, then in theory there should be no reason for no-deps nodes to exist. @bors try @rust-timer queue |
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
(EXPERIMENT) Replace zero-deps nodes with a singleton
This comment has been minimized.
This comment has been minimized.
|
Finished benchmarking commit (16b8f32): comparison URL. Overall result: ❌✅ regressions and improvements - please read the text belowBenchmarking this pull request means it may be perf-sensitive – we'll automatically label it not fit for rolling up. You can override this, but we strongly advise not to, due to possible changes in compiler perf. Next Steps: If you can justify the regressions found in this try perf run, please do so in sufficient writing along with @bors rollup=never Instruction countOur most reliable metric. Used to determine the overall result above. However, even this metric can be noisy.
Max RSS (memory usage)Results (primary -1.8%, secondary -1.6%)A less reliable metric. May be of interest, but not used to determine the overall result above.
CyclesResults (secondary 0.6%)A less reliable metric. May be of interest, but not used to determine the overall result above.
Binary sizeThis benchmark run did not return any relevant results for this metric. Bootstrap: 485.484s -> 484.656s (-0.17%) |
|
Hmm, there’s still something weird going on in incr-full builds. In theory this change should not substantially affect them, outside of the smaller dep graph, but benchmark detailed results show some unexpected changes in query execution/hit counts. |
|
Closing this experiment for now; I might return to it later. |
View all comments
Experimental follow-up to #154122.
This might have subtle and undesired consequences for disk caching, so I'll have to do more thinking and testing before I propose it as a real PR.