Skip to content

feat: add flash-attn==2.7.4.post1 to backend dependencies#622

Merged
terrykong merged 31 commits intomainfrom
flash-attn-2.7.4
Jul 8, 2025
Merged

feat: add flash-attn==2.7.4.post1 to backend dependencies#622
terrykong merged 31 commits intomainfrom
flash-attn-2.7.4

Conversation

@terrykong
Copy link
Copy Markdown
Collaborator

@terrykong terrykong commented Jul 7, 2025

Adding flash-attn to vllm/mcore extras (added an automodel one in preparation for #578 ). Adding it to the core dependencies leads to some headache since linting and anything simple in the CI is painful b/c flash-attn either needs to be built for a GPU runner or it can't be in the case of CPU GIthub hosted runners.

Also added a script and instructions to warm the cache if working outside a container. Dockerfile needed to be changed a little to prefetch flash-attn to avoid cache misses since it's such an expensive dependency to build (45min-1.5hr)

Here's a reference run from #300 (red) and run again with the container from this PR (green)

image

@github-actions github-actions Bot added the CI Relating to CI label Jul 7, 2025
@terrykong terrykong added CI:L1 Run doctests, unit tests, and functional tests and removed CI Relating to CI labels Jul 7, 2025
terrykong and others added 10 commits July 7, 2025 21:42
Signed-off-by: Terry Kong <terryk@nvidia.com>

fix

Signed-off-by: Terry Kong <terryk@nvidia.com>

fix setuptools

Signed-off-by: Terry Kong <terryk@nvidia.com>

bug

Signed-off-by: Terry Kong <terryk@nvidia.com>

unset variables to speed it up

Signed-off-by: Terry Kong <terryk@nvidia.com>

deal with flash-attn

Signed-off-by: Terry Kong <terryk@nvidia.com>

2.8.0.post2 prebuilt wheels!

Signed-off-by: Terry Kong <terryk@nvidia.com>

Revert "2.8.0.post2 prebuilt wheels!"

This reverts commit f945c50.

Signed-off-by: Terry Kong <terryk@nvidia.com>

revert

Signed-off-by: Terry Kong <terryk@nvidia.com>

uv.lock

Signed-off-by: Terry Kong <terryk@nvidia.com>

try wheel

Signed-off-by: Terry Kong <terryk@nvidia.com>

try whl

Signed-off-by: Terry Kong <terryk@nvidia.com>

comment

Signed-off-by: Terry Kong <terryk@nvidia.com>
Signed-off-by: Terry Kong <terryk@nvidia.com>
Signed-off-by: Terry Kong <terryk@nvidia.com>
Signed-off-by: Terry Kong <terryk@nvidia.com>
Signed-off-by: Terry Kong <terryk@nvidia.com>
Signed-off-by: Sahil Jain <sahilj@nvidia.com>
Signed-off-by: Sahil Jain <sahilj@nvidia.com>
Signed-off-by: Sahil Jain <sahilj@nvidia.com>
Signed-off-by: Terry Kong <terryk@nvidia.com>
Signed-off-by: Terry Kong <terryk@nvidia.com>
@github-actions github-actions Bot added the CI Relating to CI label Jul 7, 2025
@terrykong terrykong added CI:L1 Run doctests, unit tests, and functional tests and removed CI:L1 Run doctests, unit tests, and functional tests labels Jul 7, 2025
terrykong added 2 commits July 7, 2025 17:42
Signed-off-by: Terry Kong <terryk@nvidia.com>
Signed-off-by: Terry Kong <terryk@nvidia.com>
@terrykong terrykong added CI:L1 Run doctests, unit tests, and functional tests and removed CI:L1 Run doctests, unit tests, and functional tests labels Jul 8, 2025
@terrykong terrykong changed the title Flash attn 2.7.4 feat: add flash-attn==2.7.4.post1 to core dependencies Jul 8, 2025
terrykong added 6 commits July 7, 2025 18:55
Signed-off-by: Terry Kong <terryk@nvidia.com>
Signed-off-by: Terry Kong <terryk@nvidia.com>
Signed-off-by: Terry Kong <terryk@nvidia.com>
Signed-off-by: Terry Kong <terryk@nvidia.com>
Signed-off-by: Terry Kong <terryk@nvidia.com>
setuptools build meta backend

Signed-off-by: Terry Kong <terryk@nvidia.com>
Signed-off-by: Terry Kong <terryk@nvidia.com>
@terrykong terrykong added CI:L1 Run doctests, unit tests, and functional tests and removed CI:L1 Run doctests, unit tests, and functional tests labels Jul 8, 2025
terrykong added 3 commits July 7, 2025 20:53
This reverts commit fe90b1b.

Signed-off-by: Terry Kong <terryk@nvidia.com>
This reverts commit be282ef.

Signed-off-by: Terry Kong <terryk@nvidia.com>
This reverts commit cd2e221.

Signed-off-by: Terry Kong <terryk@nvidia.com>
@terrykong terrykong added CI:L1 Run doctests, unit tests, and functional tests and removed CI:L1 Run doctests, unit tests, and functional tests labels Jul 8, 2025
terrykong added 2 commits July 8, 2025 06:00
Signed-off-by: Terry Kong <terryk@nvidia.com>
Signed-off-by: Terry Kong <terryk@nvidia.com>
@terrykong terrykong added CI:L1 Run doctests, unit tests, and functional tests and removed CI:L1 Run doctests, unit tests, and functional tests labels Jul 8, 2025
@terrykong terrykong changed the title feat: add flash-attn==2.7.4.post1 to core dependencies feat: add flash-attn==2.7.4.post1 to backend dependencies Jul 8, 2025
@terrykong terrykong added this pull request to the merge queue Jul 8, 2025
@terrykong terrykong added the r0.3.0 Release r0.3.0 label Jul 8, 2025
Merged via the queue into main with commit 099730b Jul 8, 2025
33 of 36 checks passed
@terrykong terrykong deleted the flash-attn-2.7.4 branch July 8, 2025 19:56
jialei777 pushed a commit to jialei777/nemo-rl that referenced this pull request Jul 23, 2025
…o#622)

Signed-off-by: Terry Kong <terryk@nvidia.com>
Signed-off-by: Sahil Jain <sahilj@nvidia.com>
Co-authored-by: Sahil Jain <sahilj@nvidia.com>
Signed-off-by: Jialei Chen <jialeic@google.com>
KiddoZhu pushed a commit that referenced this pull request Jul 28, 2025
Signed-off-by: Terry Kong <terryk@nvidia.com>
Signed-off-by: Sahil Jain <sahilj@nvidia.com>
Co-authored-by: Sahil Jain <sahilj@nvidia.com>
FannYYW pushed a commit to xxman-google/NeMo-RL that referenced this pull request Aug 5, 2025
…o#622)

Signed-off-by: Terry Kong <terryk@nvidia.com>
Signed-off-by: Sahil Jain <sahilj@nvidia.com>
Co-authored-by: Sahil Jain <sahilj@nvidia.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CI:L1 Run doctests, unit tests, and functional tests CI Relating to CI r0.3.0 Release r0.3.0

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants