Add benchcomp parser for kani perf tests by karkhaz · Pull Request #2327 · model-checking/kani

karkhaz · 2023-03-29T06:31:31Z

This commit adds a parser that emits CBMC-generated metrics to benchcomp. It is intended to be run in a kani checkout after running kani-perf.sh. A minimal working example would be to run benchcomp in a directory containing the following benchcomp.yaml:

variants:
  kani_0.17:
    config:
      directory: ~/src/kani
      command_line: git checkout .; git checkout kani-0.17.0 && rm -rf target && cargo build-dev && ./scripts/kani-perf.sh ; true
  kani_0.24:
    config:
      directory: ~/src/kani
      command_line: git checkout .; git checkout main; rm -rf target && cargo build-dev && ./scripts/kani-perf.sh ; true
run:
  suites:
    kani_perf:
      parser:
        module: kani_perf
      variants:
        - kani_0.17
        - kani_0.24

This commit also contains minor fixes to ensure that the test suite runs to completion even if it exited with a non-zero return code.

Testing:

How is this change tested?

Running as indicated in the commit message should write a result.yaml file to the current directory.

Is this a refactor change?

No

Checklist

Each commit message has a non-empty body, explaining why the change was made
Methods or procedures are documented
Regression or unit tests are included, or existing tests cover the modified code
My PR is restricted to a single feature or bugfix

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 and MIT licenses.

tools/benchcomp/benchcomp/parsers/kani_perf.py

qinheping · 2023-03-29T18:43:34Z

Can we add a test in CI to make sure the parser will not be broken by future PRs that modify Kani's output?

karkhaz · 2023-04-05T19:54:43Z

@qinheping I've added a unit test. Regression test has the following problems:

Actually running kani-perf.sh seems like overkill for this regression. Is there a way to run just a single test from that suite?
Even if we run one test, the actual metrics will be different on different machines. So we can't check the values of the metrics, we can only check that the metrics were found in the log.

If there's an easy way to write a regression test so that it passes fairly quickly, I'd be happy to do that.

qinheping · 2023-04-05T21:23:08Z

@karkhaz thank you for the unit test!

Actually running kani-perf.sh seems like overkill for this regression. Is there a way to run just a single test from that suite?

The current unit test check that the parser works correctly on the output given by the current Kani. I suggest to run Kani on two benchmarks (one success and one fail) and dump the output before testing the parser. For example, add the following commands in https://github.com/karkhaz/kani/blob/kk-wip-benchcomp/tools/benchcomp/test/run

kani ../../tests/kani/Unwind-Attribute/fixme_lib.rs > test/unit/kani_unwind-attribute_fixme_lib_out
kani ../../tests/kani/Arbitrary/arbitrary_impls.rs > test/unit/kani_Arbitrary_arbitrary_impls_out

And then run the parser on kani_unwind-attribute_fixme_lib_out and test/unit/kani_Arbitrary_arbitrary_impls_out. As we can't predict the running time. Let's just check that verification_time, solver_runtime, symex_runtime are correctly parsed as numbers; and "success" is True for the kani_unwind-attribute_fixme_lib_out and is False for kani_Arbitrary_arbitrary_impls_out. We can remove the two output files after the test.

Also, do we have a plan to integrate all benchcomp tests in CI?

zhassan-aws

LGTM modulo @qinheping's comments.

This visualization prints the results to stdout as a YAML file. This is useful for piping benchcomp output into other scripts, and also for parsing the output of regression tests.

This commit adds a parser that emits CBMC-generated metrics to benchcomp. It is intended to be run in a kani checkout after running `kani-perf.sh`. A minimal working example would be to run `benchcomp` in a directory containing the following benchcomp.yaml: variants: kani_0.17: config: directory: ~/src/kani command_line: git checkout .; git checkout kani-0.17.0 && rm -rf target && cargo build-dev && ./scripts/kani-perf.sh ; true kani_0.24: config: directory: ~/src/kani command_line: git checkout .; git checkout main; rm -rf target && cargo build-dev && ./scripts/kani-perf.sh ; true run: suites: kani_perf: parser: module: kani_perf variants: - kani_0.17 - kani_0.24 This commit also contains minor fixes to ensure that the test suite runs to completion even if it exited with a non-zero return code.

karkhaz · 2023-04-11T15:09:18Z

@qinheping Thank you, I removed the unit test as it's not as useful as the regression test.

karkhaz · 2023-04-11T15:10:06Z

@qinheping I'll post an Action that uses this feature as soon as this is merged, blocking CI required this and the error_on_regression patch.

qinheping

Sounds great! Thank you @karkhaz

karkhaz force-pushed the kk-wip-benchcomp branch from 438e1ab to 049eca4 Compare March 29, 2023 07:00

karkhaz marked this pull request as ready for review March 29, 2023 07:00

karkhaz requested a review from a team as a code owner March 29, 2023 07:00

karkhaz enabled auto-merge (squash) March 29, 2023 07:01

zhassan-aws reviewed Mar 29, 2023

View reviewed changes

karkhaz force-pushed the kk-wip-benchcomp branch 4 times, most recently from f8fbc52 to 83ef6b0 Compare April 5, 2023 17:42

karkhaz force-pushed the kk-wip-benchcomp branch from 83ef6b0 to 17d9a8e Compare April 5, 2023 19:56

zhassan-aws approved these changes Apr 5, 2023

View reviewed changes

karkhaz force-pushed the kk-wip-benchcomp branch from 17d9a8e to 004d55a Compare April 6, 2023 23:29

karkhaz disabled auto-merge April 6, 2023 23:32

karkhaz marked this pull request as draft April 6, 2023 23:36

Add dump_yaml visualization to benchcomp

366c28b

This visualization prints the results to stdout as a YAML file. This is useful for piping benchcomp output into other scripts, and also for parsing the output of regression tests.

karkhaz force-pushed the kk-wip-benchcomp branch from 004d55a to 0da05fb Compare April 11, 2023 14:27

karkhaz marked this pull request as ready for review April 11, 2023 14:27

karkhaz force-pushed the kk-wip-benchcomp branch from 0da05fb to 4c6d12a Compare April 11, 2023 14:29

qinheping approved these changes Apr 11, 2023

View reviewed changes

karkhaz merged commit 3336996 into model-checking:main Apr 11, 2023

karkhaz deleted the kk-wip-benchcomp branch April 11, 2023 15:40

github-actions bot mentioned this pull request Apr 22, 2025

Automatic toolchain upgrade to nightly-2025-04-22 #4039

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add benchcomp parser for kani perf tests#2327

Add benchcomp parser for kani perf tests#2327
karkhaz merged 2 commits intomodel-checking:mainfrom
karkhaz:kk-wip-benchcomp

karkhaz commented Mar 29, 2023

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

qinheping commented Mar 29, 2023 •

edited

Loading

Uh oh!

karkhaz commented Apr 5, 2023

Uh oh!

qinheping commented Apr 5, 2023

Uh oh!

zhassan-aws left a comment

Uh oh!

karkhaz commented Apr 11, 2023

Uh oh!

karkhaz commented Apr 11, 2023

Uh oh!

qinheping left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

karkhaz commented Mar 29, 2023

Testing:

Checklist

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

qinheping commented Mar 29, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

karkhaz commented Apr 5, 2023

Uh oh!

qinheping commented Apr 5, 2023

Uh oh!

zhassan-aws left a comment

Choose a reason for hiding this comment

Uh oh!

karkhaz commented Apr 11, 2023

Uh oh!

karkhaz commented Apr 11, 2023

Uh oh!

qinheping left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

qinheping commented Mar 29, 2023 •

edited

Loading