LeanAttention code modularization #765

valechen · 2025-08-04T19:09:35Z

Move "calculate_max_output_tiles_analytically" to lean_atten.py
Combine "get_num_splits_and_buffer_sizes" and "get_lean_attention_params"
Create _attention_inner() @jit for inner loop attention calculation
Add handling for total_tiles < num_SMs case

valechen added 4 commits August 4, 2025 12:04

LeanAttention code clean-up and modualarization

b6118dd

black fix

5eef99a

reviewdog fix

e25443b

black fix

81eb5c9

valechen requested a review from rahulbatra85 August 4, 2025 22:28

Merge branch 'main' into la_opt2

87bd265

valechen requested a review from vgokhale August 5, 2025 16:26

valechen added 2 commits August 5, 2025 09:26

Merge branch 'main' into la_opt2

898fcc6

Merge branch 'main' into la_opt2

045092f

vgokhale approved these changes Aug 6, 2025

View reviewed changes

valechen merged commit 86a6fd2 into main Aug 6, 2025
14 checks passed

valechen deleted the la_opt2 branch August 6, 2025 16:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

LeanAttention code modularization #765

LeanAttention code modularization #765

Uh oh!

valechen commented Aug 4, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

LeanAttention code modularization #765

LeanAttention code modularization #765

Uh oh!

Conversation

valechen commented Aug 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

valechen commented Aug 4, 2025 •

edited

Loading