fra by jamesthesnake · Pull Request #76 · jamesthesnake/ColossalAI

jamesthesnake · 2023-06-30T12:37:37Z

📌 Checklist before creating the PR

I have created an issue for this PR for traceability
The title follows the standard format: [doc/gemini/tensor/...]: A concise description
I have added relevant tags if possible for us to better distinguish different PRs

🚨 Issue number

Link this PR to your issue with words like fixed to automatically close the linked issue upon merge

e.g. fixed #1234, closed #1234, resolved #1234

📝 What does this PR do?

Summarize your work here.
if you have any plots/diagrams/screenshots/tables, please attach them here.

💥 Checklist before requesting a review

I have linked my PR to an issue (instruction)
My issue clearly describes the problem/feature/proposal, with diagrams/charts/table/code if possible
I have performed a self-review of my code
I have added thorough tests.
I have added docstrings for all the functions/methods I implemented

⭐️ Do you enjoy contributing to Colossal-AI?

🌝 Yes, I do.
🌚 No, I don't.

Tell us more if you don't enjoy contributing to Colossal-AI.

* refactor: adapt boost API in base and naive strategies * fix: initialize plugin after setup_distributed * fix: fix save_pretrained fn * refactor: adapt boost API in DDPStrategy * to: add _post_init check * to: fix ddp backward, modify ddp dataloader and unwrap * feat: adapt boost API in ColossalAIStrategy * fix: call setup_distributed before use get_current_device * fix: fix save_model and save_optimizer * test: remove save_sharded_optimizer test * style: apply formatter * fix: fix stage check and add comments * feat: allow dict type arg in strategy.prepare * to: temporarily remove lr_scheduler for testing * style: simplify init of ColossalAIStrategy * fix: fix lr_scheduler in sft and rm * style: modify comments * test: add train_prompts tests * fix: fix inference only case and use in train_prompts * test: skip failed tests in ci * style: fix CodeFactor check * fix: do not use model.to('cpu') with GeminiPlugin * test: enable colossalai_gemini tests * test: set CUDA_VISIBLE_DEVICES in ci * docs: add note

…g_searching [gemini] Rename arguments in chunk configuration searching

* fix chat eval * fix utils * fix utils * add comment --------- Co-authored-by: Qianran Ma <qianranm@luchentech.com>

* copy resnet example * add pytest package * skip test_ci * skip test_ci * skip test_ci

* fix some typos and problems in doc * fix some typos and problems in doc * add doc test

* to: add SLTrainer * refactor: refactor RMTrainer and SFTTrainer * fix: fix init file * feat: remove on_learn_epoch fn as not used * fix: align with modified gemini arguments * to: add OnPolicyTrainer * revert: add _on_learn_epoch fn * refactor: refactor PPOTrainer * style: rename PPOTrainer argument * fix: align with modified PPO arguments * test: align with modified train_prompts arguments * chore: modify train_prompts * docs: align with modified arguments * fix: remove unnecessary output * fix: move dataloader to fit fn of SLTrainer * fix: move dataloader to fit fn of OnPolicyTrainer * fix: modify usage of prompt and pretrain dataloader

* feat: remove on_learn_epoch fn as not used * revert: add _on_learn_epoch fn * to: remove the use of NaiveStrategy * test: remove NaiveStrategy tests * feat: remove NaiveStrategy * style: modify comments and params * feat: split ColossalAIStrategy into LowLevelZeroStrategy and GeminiStrategy * fix: remove naive * fix: align with modified colossal strategy * fix: fix ddp _try_init_dist arg

FrankLeeeee and others added 15 commits June 22, 2023 14:41

[workflow] cover all public repositories in weekly report (#4069)

b463651

[gemini] fix argument naming during chunk configuration searching

0bb0b48

Merge pull request #4056 from Fridge003/hotfix/fix_gemini_chunk_confi…

2c8ae37

…g_searching [gemini] Rename arguments in chunk configuration searching

[chat]: fix chat evaluation possible bug (#4064)

e89b127

* fix chat eval * fix utils * fix utils * add comment --------- Co-authored-by: Qianran Ma <qianranm@luchentech.com>

[hotfix]fix argument naming in docs and examples (#4083)

4da324c

[testing] move pytest to be inside the function (#4087)

95e95b6

[examples] copy resnet example to image (#4090)

31dc302

* copy resnet example * add pytest package * skip test_ci * skip test_ci * skip test_ci

[workflow] added status check for test coverage workflow (#4106)

1ee947f

fix #3852 path error (#4058)

2d40759

fix typo docs/ (#4033)

769cddc

[doc] update and revise some typos and errs in docs (#4107)

711e2b4

* fix some typos and problems in doc * fix some typos and problems in doc * add doc test

[nfc]fix ColossalaiOptimizer is not defined (#4122)

09fe9dc

jamesthesnake merged commit 22936eb into jamesthesnake:ra Jun 30, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fra#76

fra#76
jamesthesnake merged 15 commits intojamesthesnake:rafrom
hpcaitech:main

jamesthesnake commented Jun 30, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Conversation

jamesthesnake commented Jun 30, 2023

📌 Checklist before creating the PR

🚨 Issue number

📝 What does this PR do?

💥 Checklist before requesting a review

⭐️ Do you enjoy contributing to Colossal-AI?

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants