[coloattention] coloattention support flash attention 2 by flybird11111 · Pull Request #4347 · hpcaitech/ColossalAI

flybird11111 · 2023-07-28T06:10:10Z

📌 Checklist before creating the PR

I have created an issue for this PR for traceability
The title follows the standard format: [doc/gemini/tensor/...]: A concise description
I have added relevant tags if possible for us to better distinguish different PRs

🚨 Issue number

Link this PR to your issue with words like fixed to automatically close the linked issue upon merge

e.g. fixed #1234, closed #1234, resolved #1234

#4322

📝 What does this PR do?

Summarize your work here.
if you have any plots/diagrams/screenshots/tables, please attach them here.

Improved ColoAttention to utilize the latest flash attention 2:

flash_attn_func is used for attention with no paddings
Attention with paddings use SeqLenInfo, unpad, repad in order to work with flash_attn_varlen_func
Flash attention 2 only supports fp16/bf16 on Ampere or better GPUs. For other precisions or hardwares, we still use xformers to accelerate attention

💥 Checklist before requesting a review

I have linked my PR to an issue (instruction)
My issue clearly describes the problem/feature/proposal, with diagrams/charts/table/code if possible
I have performed a self-review of my code
I have added thorough tests.
I have added docstrings for all the functions/methods I implemented

⭐️ Do you enjoy contributing to Colossal-AI?

🌝 Yes, I do.
🌚 No, I don't.

Tell us more if you don't enjoy contributing to Colossal-AI.

github-actions · 2023-07-31T19:30:36Z

The code coverage for the changed files is 15%.

Click me to view the complete report

Name                                                       Stmts   Miss  Cover
------------------------------------------------------------------------------
colossalai/kernel/cuda_native/flash_attention.py             298    298     0%
colossalai/kernel/cuda_native/flash_attn/flash_attn_2.py      36     19    47%
colossalai/kernel/cuda_native/flash_attn/mem_eff_attn.py      33     25    24%
colossalai/kernel/cuda_native/scaled_softmax.py               96     65    32%
tests/test_utils/test_flash_attention.py                      92     62    33%
------------------------------------------------------------------------------
TOTAL                                                        555    469    15%

github-actions · 2023-08-01T08:01:25Z

The code coverage for the changed files is 33%.

Click me to view the complete report

Name                                                 Stmts   Miss  Cover
------------------------------------------------------------------------
colossalai/kernel/cuda_native/fmha/flash_attn_2.py      36     19    47%
colossalai/kernel/cuda_native/fmha/mem_eff_attn.py      33     25    24%
colossalai/kernel/cuda_native/scaled_softmax.py         96     65    32%
tests/test_utils/test_flash_attention.py                92     62    33%
------------------------------------------------------------------------
TOTAL                                                  257    171    33%

[shardformer] coloattention support flash attention 2 [shardformer] coloattention support flash attention 2 [shardformer] coloattention support flash attention 2 [shardformer] coloattention support flash attention 2 [shardformer] coloattention support flash attention 2 [shardformer] coloattention support flash attention 2 [shardformer] coloattention support flash attention 2 [shardformer] coloattention support flash attention 2 [shardformer] coloattention support flash attention 2 [shardformer] coloattention support flash attention 2 [shardformer] coloattention support flash attention 2

flybird11111 · 2023-08-04T05:20:02Z

All tests have passed.

flybird11111 requested a review from kurisusnowdeng July 28, 2023 06:10

flybird11111 force-pushed the update-coloattention branch 2 times, most recently from 765bc08 to f161d8a Compare August 1, 2023 07:17

flybird11111 closed this Aug 1, 2023

flybird11111 force-pushed the update-coloattention branch from f161d8a to 5187c96 Compare August 1, 2023 07:34

flybird11111 reopened this Aug 1, 2023

flybird11111 closed this Aug 1, 2023

flybird11111 reopened this Aug 1, 2023

flybird11111 force-pushed the update-coloattention branch from 9d28850 to e3dccfe Compare August 1, 2023 07:48

kurisusnowdeng suggested changes Aug 2, 2023

View reviewed changes

flybird11111 force-pushed the update-coloattention branch from e3dccfe to 4b8df44 Compare August 2, 2023 08:01

kurisusnowdeng suggested changes Aug 2, 2023

View reviewed changes

Comment thread colossalai/kernel/cuda_native/mha/mem_eff_attn.py Outdated

flybird11111 force-pushed the update-coloattention branch from 4b8df44 to 91f57e6 Compare August 2, 2023 10:05

kurisusnowdeng suggested changes Aug 2, 2023

View reviewed changes

flybird11111 force-pushed the update-coloattention branch 5 times, most recently from 63fbeda to 9478c96 Compare August 4, 2023 03:47

flybird11111 force-pushed the update-coloattention branch from 9478c96 to d604dd6 Compare August 4, 2023 03:56

kurisusnowdeng approved these changes Aug 4, 2023

View reviewed changes

kurisusnowdeng merged commit 25c57b9 into hpcaitech:main Aug 4, 2023

This was referenced Aug 4, 2023

[flashattn] Support Flash Attention to v2.0.1 #4322

Closed

Update flash_attention to version 2.0.1 #4323

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[coloattention] coloattention support flash attention 2#4347

[coloattention] coloattention support flash attention 2#4347
kurisusnowdeng merged 1 commit intohpcaitech:mainfrom
flybird11111:update-coloattention

flybird11111 commented Jul 28, 2023 •

edited by kurisusnowdeng

Loading

Uh oh!

github-actions Bot commented Jul 31, 2023

Uh oh!

github-actions Bot commented Aug 1, 2023

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

flybird11111 commented Aug 4, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

flybird11111 commented Jul 28, 2023 • edited by kurisusnowdeng Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

📌 Checklist before creating the PR

🚨 Issue number

📝 What does this PR do?

💥 Checklist before requesting a review

⭐️ Do you enjoy contributing to Colossal-AI?

Uh oh!

github-actions Bot commented Jul 31, 2023

Uh oh!

github-actions Bot commented Aug 1, 2023

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

flybird11111 commented Aug 4, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

flybird11111 commented Jul 28, 2023 •

edited by kurisusnowdeng

Loading