ipdps-25

For The CWRU Pioneer Cluster

Make sure that your environment is setup as follows:

module load Python/3.10.8-GCCcore-12.2.0
Utilize PyTorch 2.4.0 compiled with CUDA 12.1 in your virtual environment
module load CUDA/12.1.1
module load Ninja/1.11.1-GCCcore-12.2.0

I will use the /Sparse_FlashAttention_CSR/ folder as an example for creating a function bound to PyTorch.

This folder should be copied under the torch folder in your virtual environment. My path for example:

/home/nkt8/cust_pt/testing/lib/python3.10/site-packages/torch/

The *.cpp file is in charge of actually creating the binding and performing PyTorch's checks on the input data. The *.cu file contains the kernel and the function that calls the kernel. setup.py is what you actually run to compile the function.

You'll notice that setup.py points to the *.cpp and *.cu file and gives the module a name, in my case I called it spfa_csr. This will be the name of the package itself.

Please use mine as a template, but for further info you can refer to: https://pytorch.org/tutorials/advanced/cpp_extension.html

To compile (for all CWRU HPC GPU architectures), please run the following:

TORCH_CUDA_ARCH_LIST="6.0 7.0 7.5 8.0" python setup.py install

Once you have compiled your example, please use the testing script located in /verification/ to verify that your output matches what PyTorch outputs. You will need to import your own function and call it as well to verify. Please make sure that you are using the MATH backend (what should be uncommented within the code).

Please note that the verification process is done with fully dense matrices to verify that our process exactly matches that of attention.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
COO_and_CSR		COO_and_CSR
Sparse_FlashAttention_BSR		Sparse_FlashAttention_BSR
Sparse_FlashAttention_COO		Sparse_FlashAttention_COO
Sparse_FlashAttention_CSR		Sparse_FlashAttention_CSR
Sparse_FlashAttention_Global_No_Local		Sparse_FlashAttention_Global_No_Local
Sparse_FlashAttention_Local		Sparse_FlashAttention_Local
verification		verification
README.md		README.md
version-coo.cu		version-coo.cu

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ipdps-25

For The CWRU Pioneer Cluster

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ipdps-25

For The CWRU Pioneer Cluster

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages