ARROW-5071: [Archery] Implement running benchmark suite #4249

fsaintjacques · 2019-05-03T19:16:06Z

Implements the archery benchmark run sub-command that performs a single run without comparing to another run. This is the main basic blocks to run a benchmark and publish the result to a database, e.g. codespeed and/or custom db. Push is not provided in this PR.
Implements serializing/de-serializing the result of a run in JSON.
Improved the from_rev_or_path method to also support the previous point json output. Thus, you can compare a mixture of commit(s)/build(s)/run's output(s). Effectively adding support for comparing on cached results of a run.
Implements redirecting the output to a named file via --output, - means stdout (the defaults).
Improved the GoogleBenchmarkRunner to output partial results to stdout for progress feedback.

The C++ benchmark would not provide feedback due to capture stdout JSON result. Refactored this such that benchmarks outputs to both stdout in text format and json in a named filed.

Supports writing the result into a file. This will allow more stable composition.

Allows running benchmark suites for a specific build.

fsaintjacques · 2019-05-03T19:18:32Z

@pitrou this adds your requested feature to compare against offline data.

- Implement json serialization for Benchmark and BenchmarkSuite

codecov-io · 2019-05-03T20:29:54Z

Codecov Report

Merging #4249 into master will decrease coverage by 24.57%.
The diff coverage is n/a.

@@             Coverage Diff             @@
##           master    #4249       +/-   ##
===========================================
- Coverage   88.19%   63.62%   -24.58%     
===========================================
  Files         660      375      -285     
  Lines       72992    53019    -19973     
  Branches     1251        0     -1251     
===========================================
- Hits        64378    33735    -30643     
- Misses       8501    19284    +10783     
+ Partials      113        0      -113

Impacted Files	Coverage Δ
cpp/src/arrow/util/memory.h	`0% <0%> (-100%)`	⬇️
cpp/src/parquet/hasher.h	`0% <0%> (-100%)`	⬇️
cpp/src/arrow/extension_type.h	`0% <0%> (-100%)`	⬇️
cpp/src/arrow/util/sse-util.h	`0% <0%> (-100%)`	⬇️
cpp/src/arrow/flight/server.h	`0% <0%> (-100%)`	⬇️
cpp/src/arrow/json/chunker.h	`0% <0%> (-100%)`	⬇️
cpp/src/arrow/util/uri.cc	`0% <0%> (-100%)`	⬇️
cpp/src/arrow/flight/client_auth.h	`0% <0%> (-100%)`	⬇️
cpp/src/arrow/compute/logical_type.h	`0% <0%> (-100%)`	⬇️
cpp/src/arrow/compute/kernels/filter.cc	`0% <0%> (-100%)`	⬇️
... and 695 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 6c626c8...1ca0aa9. Read the comment docs.

nealrichardson · 2019-05-03T20:39:16Z

Should the JSON benchmark output format be documented? Any special instructions for those writing or modifying benchmarks (naming, how to maintain continuity with historical data and when to break it, etc.) so that they work correctly in this system?

fsaintjacques · 2019-05-06T15:58:38Z

I don't think it's worth documenting yet, as the goal is to simply pass the serialized state from one archery sub-command to another.

Adding documentation on writing/modifying benchmarks is definitively worth it. It reminds me that we need to add version to benchmark metadata once we want to track this in a (persisted) database.

nealrichardson · 2019-05-06T16:07:50Z

Ok.

How necessary is version in the benchmark metadata itself? Isn't version knowable based on timestamp?

fsaintjacques · 2019-05-06T16:27:48Z

I'd say git commit is a better proxy for version. The problem with having only the proxy (say commit or timestamp), is that you can't decided whether 2 samples are comparable.

By version, I didn't mean an actual human/semver version, but a key to differentiate if 2 named benchmarks are comparable. Usually, the shasum of the function body (file in our case because it's the easiest proxy) will dictate if the benchmarks are comparable.

Note that this is low-medium priority as benchmark body don't change often, and you can always work around this by renaming the benchmark (making the old version not comparable with the new name) and making this an official policy.

kszucs

LGTM, thanks @fsaintjacques!

fsaintjacques added 3 commits April 29, 2019 09:07

Run benchmark with feedback to stdout

b304f70

The C++ benchmark would not provide feedback due to capture stdout JSON result. Refactored this such that benchmarks outputs to both stdout in text format and json in a named filed.

Add benchmark-diff --output option

7340185

Supports writing the result into a file. This will allow more stable composition.

Implement archery benchmark run

257e1e2

Allows running benchmark suites for a specific build.

fsaintjacques added 2 commits May 3, 2019 15:36

Implement storing benchmark result offline

166d4fc

- Implement json serialization for Benchmark and BenchmarkSuite

Add documentation

1ca0aa9

fsaintjacques force-pushed the ARROW-5071-benchmark-run branch from 29034c6 to 1ca0aa9 Compare May 3, 2019 19:36

kszucs self-requested a review May 7, 2019 18:18

kszucs approved these changes May 8, 2019

View reviewed changes

kszucs closed this in c3c8e76 May 8, 2019

asfimport mentioned this pull request May 8, 2019

[Benchmarking] Performs a benchmark run with archery #21561

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ARROW-5071: [Archery] Implement running benchmark suite #4249

ARROW-5071: [Archery] Implement running benchmark suite #4249

Uh oh!

fsaintjacques commented May 3, 2019 •

edited

Loading

Uh oh!

fsaintjacques commented May 3, 2019

Uh oh!

codecov-io commented May 3, 2019

Uh oh!

nealrichardson commented May 3, 2019

Uh oh!

fsaintjacques commented May 6, 2019

Uh oh!

nealrichardson commented May 6, 2019

Uh oh!

fsaintjacques commented May 6, 2019

Uh oh!

kszucs left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ARROW-5071: [Archery] Implement running benchmark suite #4249

ARROW-5071: [Archery] Implement running benchmark suite #4249

Uh oh!

Conversation

fsaintjacques commented May 3, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fsaintjacques commented May 3, 2019

Uh oh!

codecov-io commented May 3, 2019

Codecov Report

Uh oh!

nealrichardson commented May 3, 2019

Uh oh!

fsaintjacques commented May 6, 2019

Uh oh!

nealrichardson commented May 6, 2019

Uh oh!

fsaintjacques commented May 6, 2019

Uh oh!

kszucs left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

fsaintjacques commented May 3, 2019 •

edited

Loading