New AlltoAllV (Imbalanced AlltoAll) benchmark. #157

babusid · 2023-07-27T21:17:10Z

Alltoallv benchmark

This pull request incorporates a new AllToAllV benchmark. This allows for testing a wider range of communication patterns found in real-world workloads, and allows end users to easily create custom benchmarks by adjusting test parameters. It can even be used to simulate communication patterns that have dedicated tests currently, without having to compile all of them.

The benchmark requires a parameterization CSV file which contains a square matrix of dimension NxN where N is the number of ranks. Each entry in this matrix must contain a fraction between 0 and 1 (inclusive). Entry I,J determines the amount of data sent from Rank I to Rank J. The amount is equal to the fraction times the data size specified for the test. For example, if the benchmark is being run at 256M, and a particular entry has a value of 0.5, then the sending rank will send 128M to the receiving rank.

This PR also adds the ability to parameterize any test with a setup file with the -s flag. The AllToAllV benchmark implementation can be referenced for an example on how to use this.

cc: @sjeaugey (I don't have the ability to add reviewers)

Changes: - Added variable count of elements to send/recv based on sending/recieving peers - Added new file to make file Notes: - Current method of uniquely identifying the peers that are sending (thread_local of thread number) may not work correctly. Not sure if that is the appropriate way to determine rank.

…d rank

- alltoallv2.cu testfile: Parameterizes with alltoallv_param.csv - run_a2av.sh script: -- Runs the built test with an arbitrarily named CSV instead of the static name -- Passes through other arguments to the testfile

Each Rank is guaranteed to send X/nranks data in some distribution.

babusid · 2023-08-02T23:12:52Z

cc: @AddyLaddy

Sidharth Babu and others added 20 commits June 2, 2023 23:09

Initialized alltoallv test

071d8ac

changed to use nccl rank finding function. This should now use a vali…

086e318

…d rank

created evaluation code for a2av static imbalancing

2f0cb03

convenience script

e8e2bf5

added a default load case

5c0ea18

created second atav testfile for granular test development

53bc496

First draft of more granular alltoallv

5644a03

- alltoallv2.cu testfile: Parameterizes with alltoallv_param.csv - run_a2av.sh script: -- Runs the built test with an arbitrarily named CSV instead of the static name -- Passes through other arguments to the testfile

This version passes row == 1 distribution guarantee.

ef00fa9

Each Rank is guaranteed to send X/nranks data in some distribution.

added CLI arg for param file, switched to using variable in testcase

6b7e790

added documentation

025333a

rename + docs

f94647f

Merge branch 'NVIDIA:master' into alltoallv_tests

cc5bb18

removed unnecessary script

2782aae

cleanup

8ad54bd

Removed builtin usage, replaced with function argument

8d6c8d8

added some testcases to use as reference

cd6da41

Renamed CLI arg

d4fb4d0

changed filepath limit

5cbb12e

adjusted parse function

86577ee

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

New AlltoAllV (Imbalanced AlltoAll) benchmark. #157

New AlltoAllV (Imbalanced AlltoAll) benchmark. #157

Uh oh!

babusid commented Jul 27, 2023 •

edited

Loading

Uh oh!

babusid commented Aug 2, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

New AlltoAllV (Imbalanced AlltoAll) benchmark. #157

Are you sure you want to change the base?

New AlltoAllV (Imbalanced AlltoAll) benchmark. #157

Uh oh!

Conversation

babusid commented Jul 27, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Alltoallv benchmark

Uh oh!

babusid commented Aug 2, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

babusid commented Jul 27, 2023 •

edited

Loading