ARROW-5071: [C++] CMake benchmark wrapper #4077

fsaintjacques · 2019-03-29T15:45:08Z

When benchmarks are ran under ctest (via ninja benchmark or make benchmark), a benchmarks directory under the build directory will be created.

For each benchmark, 2 files are generated, the $name.json.original which is the json output of google benchmark and $name.json which is the output expected by the import script of the benchmarks ddl found in dev/benchmarking.

kszucs · 2019-04-02T09:22:32Z

cpp/build-support/convert-benchmark.py

It's better to define the named arguments explicitly:

__init__(self, date=None, executable=None)

bkietz

Could you comment on your organization? For example your classes don't map exactly onto https://github.com/apache/arrow/tree/master/dev/benchmarking/ddl/ (which some might expect)

cpp/build-support/run-benchmark.sh

When benchmarks are ran under ctest (via `ninja benchmark` or `make benchmark`), a `benchmark` directory under the build directory will be created. For each benchmark, 2 files are generated, the `$name.json.original` which is the json output of google benchmark and `$name.json` which is the output expected by the import script of the benchmarks ddl found in `dev/benchmarking`.

pitrou

LGTM in general. Just a couple nits.

pitrou · 2019-04-11T13:29:57Z

cpp/build-support/convert-benchmark.py

+        head = run_cmd("git rev-parse HEAD")
+        # %ai: author date, ISO 8601-like format
+        fmt = "%ai"
+        timestamp = run_cmd(f"git log -1 --pretty='{fmt}' {head}")


Why don't you pass "HEAD" directly instead of the explicit changeset id?

pitrou · 2019-04-11T13:33:09Z

cpp/build-support/convert-benchmark.py

+        if maybe_median:
+            return maybe_median[0].value
+        # fallback
+        return self.runs[int(self.n_runs / 2)].value


That's correct only if n_runs is odd (otherwise it should probably be the average of the two "middle" values)... Is there a circumstance where gbenchmark does several runs but doesn't output the median / mean / stddev ?

I added the statistics just for the sake of completeness with min/q1/q3/max (which are not provided by google benchmark). According to wikipedia, the median is n*n+1/2 when odd, I'll fix it.

pitrou · 2019-04-11T13:37:14Z

cpp/build-support/convert-benchmark.py

+        return [cls(k, version, list(bs)) for k, bs in groups]
+
+
+def as_arrow(version, payload):


Would be nice to add a comment or docstring somewhere explaining what this script does. It will make maintenance easier in 6 months ;-)

pitrou · 2019-04-11T13:48:48Z

cpp/build-support/convert-benchmark.py

+
+    benchmark_json = json.load(in_fd)
+    converted = as_arrow(version, benchmark_json)
+    json.dump(converted, out_fd)


Might want to add indent=2 to get a pretty-printed JSON output.

fsaintjacques · 2019-04-15T14:43:43Z

I'll re-open this once #4141 is merged, I'll re-implement this functionality as part of archery benchmark as it will be much cleaner.

kou force-pushed the master branch from 114985c to 57de5c3 Compare March 31, 2019 20:22

kszucs reviewed Apr 2, 2019

View reviewed changes

fsaintjacques force-pushed the ARROW-5071-cmake-benchmark-wrapper branch from 03ef28e to 3a2b53a Compare April 4, 2019 14:04

bkietz reviewed Apr 4, 2019

View reviewed changes

cpp/build-support/run-benchmark.sh Outdated Show resolved Hide resolved

kou reviewed Apr 5, 2019

View reviewed changes

cpp/build-support/run-benchmark.sh Outdated Show resolved Hide resolved

fsaintjacques added 4 commits April 8, 2019 10:28

Explicit argument per comment

2d7d987

Increase number of repetitions

beca393

Address comments and shellcheck pass

04b6bdc

fsaintjacques force-pushed the ARROW-5071-cmake-benchmark-wrapper branch from 3a2b53a to 04b6bdc Compare April 8, 2019 14:38

pitrou reviewed Apr 11, 2019

View reviewed changes

fsaintjacques closed this Apr 15, 2019

asfimport mentioned this pull request May 8, 2019

[Benchmarking] Performs a benchmark run with archery #21561

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ARROW-5071: [C++] CMake benchmark wrapper #4077

ARROW-5071: [C++] CMake benchmark wrapper #4077

Uh oh!

fsaintjacques commented Mar 29, 2019 •

edited

Loading

Uh oh!

kszucs Apr 2, 2019

Uh oh!

bkietz left a comment

Uh oh!

Uh oh!

Uh oh!

pitrou left a comment

Uh oh!

pitrou Apr 11, 2019

Uh oh!

pitrou Apr 11, 2019

Uh oh!

fsaintjacques Apr 11, 2019

Uh oh!

pitrou Apr 11, 2019

Uh oh!

pitrou Apr 11, 2019

Uh oh!

fsaintjacques commented Apr 15, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

		return [cls(k, version, list(bs)) for k, bs in groups]


		def as_arrow(version, payload):

ARROW-5071: [C++] CMake benchmark wrapper #4077

ARROW-5071: [C++] CMake benchmark wrapper #4077

Uh oh!

Conversation

fsaintjacques commented Mar 29, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kszucs Apr 2, 2019

Choose a reason for hiding this comment

Uh oh!

bkietz left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

pitrou left a comment

Choose a reason for hiding this comment

Uh oh!

pitrou Apr 11, 2019

Choose a reason for hiding this comment

Uh oh!

pitrou Apr 11, 2019

Choose a reason for hiding this comment

Uh oh!

fsaintjacques Apr 11, 2019

Choose a reason for hiding this comment

Uh oh!

pitrou Apr 11, 2019

Choose a reason for hiding this comment

Uh oh!

pitrou Apr 11, 2019

Choose a reason for hiding this comment

Uh oh!

fsaintjacques commented Apr 15, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

fsaintjacques commented Mar 29, 2019 •

edited

Loading