Feature/npu by FrankLeeeee · Pull Request #10 · FrankLeeeee/ColossalAI

FrankLeeeee · 2024-01-18T06:58:30Z

📌 Checklist before creating the PR

I have created an issue for this PR for traceability
The title follows the standard format: [doc/gemini/tensor/...]: A concise description
I have added relevant tags if possible for us to better distinguish different PRs

🚨 Issue number

Link this PR to your issue with words like fixed to automatically close the linked issue upon merge

e.g. fixed #1234, closed #1234, resolved #1234

📝 What does this PR do?

Summarize your work here.
if you have any plots/diagrams/screenshots/tables, please attach them here.

💥 Checklist before requesting a review

I have linked my PR to an issue (instruction)
My issue clearly describes the problem/feature/proposal, with diagrams/charts/table/code if possible
I have performed a self-review of my code
I have added thorough tests.
I have added docstrings for all the functions/methods I implemented

⭐️ Do you enjoy contributing to Colossal-AI?

🌝 Yes, I do.
🌚 No, I don't.

Tell us more if you don't enjoy contributing to Colossal-AI.

* [accelerator] init the accelerator module * polish code * polish code * polish code * polish code

* update fused attn * update spda * tri attn * update triangle * import * fix * fix

* update extension * update cpu adam * update is * add doc for cpu adam * update kernel * update commit * update flash * update memory efficient * update flash attn * update flash attention loader * update api * fix * update doc * update example time limit * reverse change * fix doc * remove useless kernel * fix * not use warning * update * update

* update accelerator * fix timer * fix amp * update * fix * update bug * add error raise * fix autocast * fix set device * remove doc accelerator * update doc * update doc * update doc * use nullcontext * update cpu * update null context * change time limit for example * udpate * update * update * update * [npu] polish accelerator code --------- Co-authored-by: Xuanlei Zhao <xuanlei.zhao@gmail.com> Co-authored-by: zxl <43881818+oahzxl@users.noreply.github.com>

* [ci] fixed ddp test * polish

* fix ci fix * revert: revert p2p * feat: add enable_metadata_cache option * revert: enable t5 tests --------- Co-authored-by: Wenhao Chen <cwher@outlook.com>

* [doc] fix annotation display * [doc] fix llama2 doc

* fix: fix misleading mbs arg * feat: add pp sanity check * fix: fix 1f1b sanity check

* [workflow] fixed oom tests * polish * polish * polish

* fix ci fix * fix test * revert: revert p2p * feat: add enable_metadata_cache option * revert: enable t5 tests * fix --------- Co-authored-by: Wenhao Chen <cwher@outlook.com>

…pcaitech#5246) * support gradients acc fix fix fix fix fix fix fix fix fix fix fix fix fix * fix fix * fix fix fix

…llelism (hpcaitech#5230)

[sync] sync npu branch with main

FrankLeeeee and others added 17 commits November 30, 2023 13:25

[accelerator] init the accelerator module (hpcaitech#5129)

f4e72c9

* [accelerator] init the accelerator module * polish code * polish code * polish code * polish code

[npu] support triangle attention for llama (hpcaitech#5130)

d6df19b

* update fused attn * update spda * tri attn * update triangle * import * fix * fix

[hotfix] removed unused flag (hpcaitech#5242)

9102d65

[ci] fixed ddp test (hpcaitech#5254)

2b83418

* [ci] fixed ddp test * polish

fix typo in applications/ColossalEval/README.md (hpcaitech#5250)

756c400

[ci] fix shardformer tests. (hpcaitech#5255)

e830ef9

* fix ci fix * revert: revert p2p * feat: add enable_metadata_cache option * revert: enable t5 tests --------- Co-authored-by: Wenhao Chen <cwher@outlook.com>

[doc] fix doc typo (hpcaitech#5256)

c174c4f

* [doc] fix annotation display * [doc] fix llama2 doc

[hotfix]: add pp sanity check and fix mbs arg (hpcaitech#5268)

ef4f0ee

* fix: fix misleading mbs arg * feat: add pp sanity check * fix: fix 1f1b sanity check

[workflow] fixed incomplete bash command (hpcaitech#5272)

04244aa

[workflow] fixed oom tests (hpcaitech#5275)

d69cd2e

* [workflow] fixed oom tests * polish * polish * polish

[ci] fix test_hybrid_parallel_plugin_checkpoint_io.py (hpcaitech#5276)

2a0558d

* fix ci fix * fix test * revert: revert p2p * feat: add enable_metadata_cache option * revert: enable t5 tests * fix --------- Co-authored-by: Wenhao Chen <cwher@outlook.com>

[shardformer] hybridparallelplugin support gradients accumulation. (h…

46e0916

…pcaitech#5246) * support gradients acc fix fix fix fix fix fix fix fix fix fix fix fix fix * fix fix * fix fix fix

[hotfix] Fix ShardFormer test execution path when using sequence para…

5d9a0ae

…llelism (hpcaitech#5230)

Merge branch 'main' into sync/npu

1484693

Merge pull request hpcaitech#5278 from ver217/sync/npu

d66e698

[sync] sync npu branch with main

FrankLeeeee closed this Jan 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/npu#10

Feature/npu#10
FrankLeeeee wants to merge 17 commits intomainfrom
feature/npu

FrankLeeeee commented Jan 18, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

Conversation

FrankLeeeee commented Jan 18, 2024

📌 Checklist before creating the PR

🚨 Issue number

📝 What does this PR do?

💥 Checklist before requesting a review

⭐️ Do you enjoy contributing to Colossal-AI?

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants