ARROW-11624: [Rust] Move Arrow benchmarks to its own crate #9493

Dandandan · 2021-02-14T14:39:12Z

To reduce the amount of dependencies in Arrow, this speeds up compile times, by removing the criterion dependencies.

On my machine time to build the crate with cargo test:

	Old	New
Nr dependencies	151	110
execution time	38.92	28.24

github-actions · 2021-02-14T14:39:34Z

https://issues.apache.org/jira/browse/ARROW-11624

codecov-io · 2021-02-14T15:07:25Z

Codecov Report

Merging #9493 (466f4b0) into master (8547c61) will not change coverage.
The diff coverage is n/a.

@@           Coverage Diff           @@
##           master    #9493   +/-   ##
=======================================
  Coverage   82.12%   82.12%           
=======================================
  Files         235      235           
  Lines       54729    54729           
=======================================
  Hits        44944    44944           
  Misses       9785     9785

Impacted Files	Coverage Δ
rust/parquet/src/encodings/encoding.rs	`94.86% <0.00%> (-0.20%)`	⬇️
rust/arrow/src/array/transform/fixed_binary.rs	`84.21% <0.00%> (+5.26%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 8547c61...466f4b0. Read the comment docs.

jorgecarleitao · 2021-02-15T04:41:01Z

Doesn't this make it more difficult to develop? I usually need to run a benchmark multiple times during dev of e.g. a kernel, and now I need to have two vs code opened for this.

I do not get the benefits either: doesn't the times that you refer to only relevant in the dev and in the first build? Running cargo test will not re-build criterion and others, right? The same applies to the number of dependencies.

I think that the concern in the mailing list is not about dev dependencies but about non-dev dependencies.

alamb · 2021-02-15T11:14:49Z

I think that the concern in the mailing list is not about dev dependencies but about non-dev dependencies.

It is true that I was originally thinking about non-dev dependencies, though speeding up dev cycles itself is also a worthy goal.

alamb · 2021-02-15T11:18:26Z

rust/arrow/Cargo.toml

 memory-check = []

 [dev-dependencies]
-criterion = "0.3"


I guess I was imagining something more like adding a

[features] benches = ["criterion"]

So then if one wanted to have the benches built they could run a command like cargo bench --features benches or something

Might be interesting to try out something like that out. It will keep the dev flow more similar to what we have now while reducing the compile times when you don't want to run any benchmarks.

@jorgecarleitao would something like this work for you?

jorgecarleitao · 2021-02-15T11:26:59Z

I am just trying to understand how this reduces the dev cycle: maybe I am not using the correct flow.

My flow atm on the arrow crate is (on a vs terminal opened on rust/arrow)

git checkout master
git checkout -b feature/A
# > remove all workspaces from `../Cargo.toml`  since rust-analyzer builds all and I do not care about them.

# > change something
cargo fmt
cargo clippy
cargo test --lib
cargo bench --bench X
# repeat step until happy

# > commit

git checkout master
cargo bench --bench X
git checkout feature/A
cargo bench --bench X
# > copy bench results

Open PR with bench results and pray that other dependencies pass. If not, iterate on them.

I am curious as to how others have been working here.

alamb · 2021-03-03T11:32:09Z

@Dandandan what do you think we should do with this PR? Is it something we should work on getting in? Or should we park the discussion for now?

I doubt this is going to improve any compile times when benchmarks are involved, but it will improve times for those PRs which don't need to rerun benchmarks

Dandandan · 2021-03-03T12:08:09Z

@alamb let's park the discussion for now and see if there are other areas of improvement. I think it still could be a good idea to decrease compile / dev times, but the benchmarks are also quite central to the arrow crate. Maybe it makes more sense to do that it the DataFusion crate? Let's see

alamb · 2021-03-03T13:58:35Z

Closing this PR for now

@alamb

…ject Same idea as #9493 but for examples in DataFusion FYI @alamb Clean + building with `cargo test`. Moving the micro benchmarks out of the crate is also another possibility. | | Old | New | | ------------- |:-------------:| -----:| | Nr dependencies | 309 | 249 | | build time(s) | 77 |68 | Closes #9494 from Dandandan/move_datafusion_examples Lead-authored-by: Heres, Daniel <danielheres@gmail.com> Co-authored-by: Daniël Heres <danielheres@gmail.com> Signed-off-by: Andrew Lamb <andrew@nerdnetworks.org>

@alamb

…ject Same idea as apache/arrow#9493 but for examples in DataFusion FYI @alamb Clean + building with `cargo test`. Moving the micro benchmarks out of the crate is also another possibility. | | Old | New | | ------------- |:-------------:| -----:| | Nr dependencies | 309 | 249 | | build time(s) | 77 |68 | Closes #9494 from Dandandan/move_datafusion_examples Lead-authored-by: Heres, Daniel <danielheres@gmail.com> Co-authored-by: Daniël Heres <danielheres@gmail.com> Signed-off-by: Andrew Lamb <andrew@nerdnetworks.org>

Move benches to its own crate

07c2600

github-actions bot added the Component: Rust label Feb 14, 2021

Add licence

6b09cc4

This was referenced Feb 14, 2021

ARROW-11626: [Rust][DataFusion] Move [DataFusion] examples to own project #9494

Closed

ARROW-11298: [Rust][DataFusion] Implement Postgres String Functions [Splitting to separate PRs] #9243

Closed

Dandandan added 5 commits February 14, 2021 20:21

Add to prepare-test

6db1c5a

Reorder

679f294

Remove?

cac4427

Add to both

d621981

Fix

466f4b0

alamb reviewed Feb 15, 2021

View reviewed changes

alamb closed this Mar 3, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ARROW-11624: [Rust] Move Arrow benchmarks to its own crate #9493

ARROW-11624: [Rust] Move Arrow benchmarks to its own crate #9493

Uh oh!

Dandandan commented Feb 14, 2021 •

edited

Loading

Uh oh!

github-actions bot commented Feb 14, 2021

Uh oh!

codecov-io commented Feb 14, 2021 •

edited

Loading

Uh oh!

jorgecarleitao commented Feb 15, 2021

Uh oh!

alamb commented Feb 15, 2021

Uh oh!

alamb Feb 15, 2021

Uh oh!

Dandandan Feb 15, 2021

Uh oh!

jorgecarleitao commented Feb 15, 2021 •

edited

Loading

Uh oh!

alamb commented Mar 3, 2021

Uh oh!

Dandandan commented Mar 3, 2021

Uh oh!

alamb commented Mar 3, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ARROW-11624: [Rust] Move Arrow benchmarks to its own crate #9493

ARROW-11624: [Rust] Move Arrow benchmarks to its own crate #9493

Uh oh!

Conversation

Dandandan commented Feb 14, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Feb 14, 2021

Uh oh!

codecov-io commented Feb 14, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

jorgecarleitao commented Feb 15, 2021

Uh oh!

alamb commented Feb 15, 2021

Uh oh!

alamb Feb 15, 2021

Choose a reason for hiding this comment

Uh oh!

Dandandan Feb 15, 2021

Choose a reason for hiding this comment

Uh oh!

jorgecarleitao commented Feb 15, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

alamb commented Mar 3, 2021

Uh oh!

Dandandan commented Mar 3, 2021

Uh oh!

alamb commented Mar 3, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Dandandan commented Feb 14, 2021 •

edited

Loading

codecov-io commented Feb 14, 2021 •

edited

Loading

jorgecarleitao commented Feb 15, 2021 •

edited

Loading