perf: Parallelize siteRDF with ProductIterator #1882

rprospero · 2024-04-23T13:48:10Z

This adds a ProductIterator class that enables iterating over two separate indices. As a bit of a refresher:

PairIterator looks at distinct pairs from a single source set
ZipIterator examines pairs of indices for different iterators joined together, so each index is seen once.
ProductIterator examines every possible combination of indices from two different sources.

We then use this product iterator to parallelise the siteRDF calculation.

The algorithms really enforce that the iterator must return a reference.

trisyoungs · 2024-06-04T09:42:04Z

@rprospero and @RobBuchananCompPhys I may have an answer for you. I assumed that you were benchmarking previously by running the SiteRDF system test, and wondered if the reason behind the lack of improvement in parallel was because the calculation was only a tiny part of the overall test, and the biggest bottleneck was disk I/O. I just implemented a proper benchmark of the SiteRDF module in its current form, removing all other overhead including the I/O, and the results for the 1k water box are:

std::execution::seq          39.7 ms per iteration over 18 iterations
std::execution::par_seq    4.1 ms per iteration over 139 iterations

So almost a factor 10 improvement! For the large 5k argon box:

std::execution::seq          3927 ms per iteration over 1 iteration
std::execution::par_seq    128 ms per iteration over 5 iterations

A factor 30 improvement. Not too shabby! I can push up the new benchmarks on the end of this PR if you want to test for yourself.

RobBuchananCompPhys · 2024-06-04T13:16:55Z

@rprospero and @RobBuchananCompPhys I may have an answer for you. I assumed that you were benchmarking previously by running the SiteRDF system test, and wondered if the reason behind the lack of improvement in parallel was because the calculation was only a tiny part of the overall test, and the biggest bottleneck was disk I/O. I just implemented a proper benchmark of the SiteRDF module in its current form, removing all other overhead including the I/O, and the results for the 1k water box are:
std::execution::seq          39.7 ms per iteration over 18 iterations
std::execution::par_seq    4.1 ms per iteration over 139 iterations
So almost a factor 10 improvement! For the large 5k argon box:
std::execution::seq          3927 ms per iteration over 1 iteration
std::execution::par_seq    128 ms per iteration over 5 iterations
A factor 30 improvement. Not too shabby! I can push up the new benchmarks on the end of this PR if you want to test for yourself.

Wow, great stuff. Would be good to get those benchmarks.

rprospero · 2024-06-06T13:00:39Z

Just realised that this PR is no longer actually using the ProductIterator. I'm going to refactor to use it and check the benchmarks. If it's faster, we'll use it. Otherwise, I'll drop the class for now.

rprospero · 2024-06-06T13:49:42Z

After benchmarking, the ProductIterator had comparable speed in series, but was slower in parallel than just using the inner loop. Ultimately, the iterator is supposed to pay for its built in performance penalty by smoothing out the cases where certain pairs take significantly longer to calculate than others. I've removed the unused class, since I'm now less convinced that it will ever be needed.

src/modules/siteRDF/process.cpp

rprospero added 5 commits April 23, 2024 11:24

Use product iterator through siteRDF sites

e5e0886

Fix productIterator

e4081ff

The algorithms really enforce that the iterator must return a reference.

Claculate bins in parallel

a9e5b29

Parallelise actual binning procedure

e861f8a

Everything in a single parallel cycle

d2cb599

rprospero changed the title ~~pref: Parallelize siteRDF with ProductIterator~~ perf: Parallelize siteRDF with ProductIterator Apr 23, 2024

rprospero added 4 commits April 23, 2024 17:05

ProductIterator directly accesses iterators

fdb9c88

Performance improvements on ProductIterator

a8113eb

Remove unimplemented functions from ProductIterator

6ba5f26

Only loop over a single Site

ab6f2e8

trisyoungs added 3 commits June 4, 2024 10:52

Allow SpeciesSiteVector keywords to be set programmatically.

eebcf03

Add sites to water and argon benchmark test inputs.

dc5897e

Add SiteRDF benchmark.

e01da0d

Fix formatting

e1485ca

rprospero marked this pull request as ready for review June 6, 2024 12:53

rprospero requested a review from trisyoungs June 6, 2024 12:53

rprospero marked this pull request as draft June 6, 2024 12:55

Remove product iterator

23e6c30

rprospero marked this pull request as ready for review June 6, 2024 13:49

trisyoungs approved these changes Jun 7, 2024

View reviewed changes

src/modules/siteRDF/process.cpp Outdated Show resolved Hide resolved

RobBuchananCompPhys added 4 commits June 7, 2024 16:05

added combinable constructor for value type

b08f46c

parallel execution

71b8e14

formatted

bd65432

use histogram instance rather than lambda

f2ef31e

RobBuchananCompPhys merged commit 449c6fd into develop Jun 10, 2024

RobBuchananCompPhys deleted the productIterator branch June 10, 2024 13:21

RobBuchananCompPhys mentioned this pull request Jun 24, 2024

perf: Combinable histograms for angle module parallel execution #1937

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: Parallelize siteRDF with ProductIterator #1882

perf: Parallelize siteRDF with ProductIterator #1882

Uh oh!

rprospero commented Apr 23, 2024

Uh oh!

trisyoungs commented Jun 4, 2024

Uh oh!

RobBuchananCompPhys commented Jun 4, 2024

Uh oh!

rprospero commented Jun 6, 2024

Uh oh!

rprospero commented Jun 6, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

perf: Parallelize siteRDF with ProductIterator #1882

perf: Parallelize siteRDF with ProductIterator #1882

Uh oh!

Conversation

rprospero commented Apr 23, 2024

Uh oh!

trisyoungs commented Jun 4, 2024

Uh oh!

RobBuchananCompPhys commented Jun 4, 2024

Uh oh!

rprospero commented Jun 6, 2024

Uh oh!

rprospero commented Jun 6, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants