exclude unreachable code paths from having coverage instrumentation by andrewrk · Pull Request #21236 · ziglang/zig

andrewrk · 2024-08-29T01:16:19Z

Follow-up to #21075.

Closes #20992 by moving code coverage instrumentation to the LLVM backend explicitly, leaving LLVM's sancov pass to only do TraceCmp.

cc @kristoff-it - implications for https://github.com/kristoff-it/zig-afl-kit/ are that you can remove these:

void __sanitizer_cov_trace_pc_indir () {}
void __sanitizer_cov_8bit_counters_init () {}
void __sanitizer_cov_pcs_init () {}

Although leaving them in is harmless. And meanwhile you should see a speedup in iterations per second since it no longer has redundant instrumentation. Unfortunately you have to leave __sancov_lowest_stack for now due to llvm/llvm-project#106464.

Before:

This is after it reaches the panic line:

After

The extra green dots on the panic line are the "else" clauses from each if statement. This is tracked by #20989 and will be solved by making empty else blocks not emit points of interest.

The final red dot is because when the input is found that triggers the panic, the process crashes, including libfuzzer, including the test runner, so the coverage is not reported. Ideally, any block that is terminated with a @panic would also be excluded from coverage for the same reason as auto-generated safety checks.

see #20992 Co-authored-by: Jacob Young <jacobly0@users.noreply.github.com>

It's not actually useful after all.

instead of relying on the LLVM sancov pass. The LLVM pass is still executed if trace_pc_guard is requested, disabled otherwise. The LLVM backend emits the instrumentation directly. It uses `__sancov_pcs1` symbol name instead of `__sancov_pcs` because each element is 1 usize instead of 2. AIR: add CoveragePoint to branch hints which indicates whether those branches are interesting for code coverage purposes. Update libfuzzer to use the new instrumentation. It's simplified since we no longer need the constructor and the pcs are now in a continguous list. This is a regression in the fuzzing functionality because the instrumentation for comparisons is no longer emitted, resulting in worse fuzzer inputs generated. A future commit will add that instrumentation back.

It's useful to have TraceCmp based on the results of LLVM optimizations, while the code coverage bits were emitted by Zig manually, allowing more careful correlation to points of interest in the source code. This re-enables the sancov pass in `-ffuzz` mode, but only TraceCmp. Notably, IndirectCalls is off, which needs to be implemented manually in the LLVM backend, and StackDepth remains off, because it is not used by libfuzzer or AFL either. If stack depth is re-introduced, it can be done with better performance characteristics by being function call graph aware, and only lowered in call graph cycles, where its heuristic properties come in useful. Fixes the fuzzing regression.

This matches what LLVM's sancov pass does and is required so that optimization passes do not delete the instrumentation. However, this is currently triggering an error: "members of llvm.compiler.used must be named" so the next commit will add names to those globals.

because it marks the linker section, preventing garbage collection. Also, name the members because that is required by this intrinsic. Also, enable the StackDepth option in the sancov pass as a workaround for llvm/llvm-project#106464, otherwise, LLVM enables TracePCGuard even though we explicitly disable it.

* the pcs list is unsorted * use the function address Fixes entry points in ReleaseSafe mode.

andrewrk and others added 9 commits August 28, 2024 18:07

llvm.Builder: add !nosanitize API

df52073

see #20992 Co-authored-by: Jacob Young <jacobly0@users.noreply.github.com>

llvm.Builder: revert adding !nosanitize API

a3d622b

It's not actually useful after all.

print_air: print cond_br branch hints

43dc8db

LLVM: disable inline 8-bit counters when using trace pc guard

1bec824

fuzzing: fix entry address logic

13b5cee

* the pcs list is unsorted * use the function address Fixes entry points in ReleaseSafe mode.

andrewrk merged commit e9a00ba into master Aug 29, 2024

andrewrk deleted the fuzz branch August 29, 2024 06:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

exclude unreachable code paths from having coverage instrumentation#21236

exclude unreachable code paths from having coverage instrumentation#21236
andrewrk merged 9 commits intomasterfrom
fuzz

andrewrk commented Aug 29, 2024 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

andrewrk commented Aug 29, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Before:

After

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

andrewrk commented Aug 29, 2024 •

edited

Loading