[TRITON]: Adding Lean + Paged Attention, for decode #376

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Merged

rahulbatra85 merged 10 commits into ROCm:main from alexdutu:lean-atten-paged

Jul 18, 2025

Contributor

alexdutu commented May 6, 2025 •

edited

Loading

alexdutu added 2 commits

May 5, 2025 23:58


          Adding Lean + Paged Attention, for decode

1f47b2b


          Removing unused vars

9a666f4

alexdutu requested review from rahulbatra85 and vgokhale

May 6, 2025 00:11


          Adding a benchmark script for lean+paged attention

6d301a5

rahulbatra85 mentioned this pull request

Add leanAttention op and test #361

Closed

alexdutu added 2 commits

June 25, 2025 23:31


          Merge branch 'main' into lean-atten-paged

5bdc6a1


          Replacing torch_to_tl_dtype with torch_to_triton_dtype

8461d29

Contributor

rahulbatra85 commented Jun 26, 2025

@alexdutu Can you please address the issues reported by the black/linter CI?
It's a required CI, so will need to fix those before this PR can be merged. Thanks!

rahulbatra85 reviewed

View reviewed changes

aiter/ops/triton/lean_atten_paged.py Show resolved Hide resolved

rahulbatra85 reviewed

View reviewed changes

aiter/ops/triton/lean_atten_paged.py Outdated Show resolved Hide resolved

rahulbatra85 reviewed

View reviewed changes

aiter/ops/triton/lean_atten_paged.py Outdated Show resolved Hide resolved

rahulbatra85 reviewed

View reviewed changes

aiter/ops/triton/lean_atten_paged.py Outdated Show resolved Hide resolved

rahulbatra85 reviewed

View reviewed changes

aiter/ops/triton/lean_atten_paged.py Outdated Show resolved Hide resolved

rahulbatra85 reviewed

View reviewed changes

aiter/ops/triton/lean_atten_paged.py Outdated Show resolved Hide resolved

rahulbatra85 reviewed

View reviewed changes

aiter/ops/triton/lean_atten_paged.py Outdated Show resolved Hide resolved

rahulbatra85 reviewed

View reviewed changes

op_tests/op_benchmarks/triton/bench_la_paged_decode.py Outdated Show resolved Hide resolved

rahulbatra85 reviewed

View reviewed changes

op_tests/op_benchmarks/triton/bench_la_paged_decode.py Outdated Show resolved Hide resolved

rahulbatra85 reviewed

View reviewed changes

op_tests/triton_tests/test_la_paged.py Outdated Show resolved Hide resolved

rahulbatra85 reviewed

View reviewed changes

op_tests/triton_tests/test_la_paged.py Outdated Show resolved Hide resolved

rahulbatra85 requested changes

View reviewed changes

Contributor

rahulbatra85 left a comment

Please remove commented code. Thanks!

alexdutu added 2 commits

June 26, 2025 16:18


          Comments removed and black fixes

93e418a


          More comments removed

9816a7d

Contributor Author

alexdutu commented Jun 26, 2025

@rahulbatra85 I removed all the comments you've mentioned and some extra ones, and added the fixes from the black tool.

alexdutu added 2 commits

June 27, 2025 08:59


          Merge branch 'main' into lean-atten-paged

b415fd9


          Merge branch 'main' into lean-atten-paged

dd5e5ef

rahulbatra85 changed the title ~~Adding Lean + Paged Attention, for decode~~ [TRITON]: Adding Lean + Paged Attention, for decode


          Merge branch 'main' into lean-atten-paged

a3df6dc

rahulbatra85 self-requested a review

July 18, 2025 18:10

rahulbatra85 approved these changes

View reviewed changes

rahulbatra85 merged commit 69ba678 into ROCm:main

10 checks passed

cagrikymk pushed a commit that referenced this pull request


          [TRITON]: Adding Lean + Paged Attention, for decode (#376)

d3d76f0

* Adding Lean + Paged Attention, for decode

* Removing unused vars

* Adding a benchmark script for lean+paged attention

* Replacing torch_to_tl_dtype with torch_to_triton_dtype

* Comments removed and black fixes

* More comments removed

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet