[chat] refactor trainer class by cwher · Pull Request #4080 · hpcaitech/ColossalAI

cwher · 2023-06-25T10:33:24Z

📌 Checklist before creating the PR

I have created an issue for this PR for traceability
The title follows the standard format: [doc/gemini/tensor/...]: A concise description
I have added relevant tags if possible for us to better distinguish different PRs

🚨 Issue number

Link this PR to your issue with words like fixed to automatically close the linked issue upon merge

e.g. fixed #1234, closed #1234, resolved #1234

Close #4077.

📝 What does this PR do?

Summarize your work here.
if you have any plots/diagrams/screenshots/tables, please attach them here.

💥 Checklist before requesting a review

I have linked my PR to an issue (instruction)
My issue clearly describes the problem/feature/proposal, with diagrams/charts/table/code if possible
I have performed a self-review of my code
I have added thorough tests.
I have added docstrings for all the functions/methods I implemented

⭐️ Do you enjoy contributing to Colossal-AI?

🌝 Yes, I do.
🌚 No, I don't.

Tell us more if you don't enjoy contributing to Colossal-AI.

* to: add SLTrainer * refactor: refactor RMTrainer and SFTTrainer * fix: fix init file * feat: remove on_learn_epoch fn as not used * fix: align with modified gemini arguments * to: add OnPolicyTrainer * revert: add _on_learn_epoch fn * refactor: refactor PPOTrainer * style: rename PPOTrainer argument * fix: align with modified PPO arguments * test: align with modified train_prompts arguments * chore: modify train_prompts * docs: align with modified arguments * fix: remove unnecessary output * fix: move dataloader to fit fn of SLTrainer * fix: move dataloader to fit fn of OnPolicyTrainer * fix: modify usage of prompt and pretrain dataloader

cwher added 13 commits June 27, 2023 11:02

to: add SLTrainer

c9c159f

refactor: refactor RMTrainer and SFTTrainer

36c52ef

fix: fix init file

e6a56f0

feat: remove on_learn_epoch fn as not used

89f77e4

fix: align with modified gemini arguments

03d0079

to: add OnPolicyTrainer

c1fb823

revert: add _on_learn_epoch fn

b6f7687

refactor: refactor PPOTrainer

523b57c

style: rename PPOTrainer argument

4a69297

fix: align with modified PPO arguments

8bd57a9

test: align with modified train_prompts arguments

8a277dc

chore: modify train_prompts

a2443e2

docs: align with modified arguments

dc82ade

cwher force-pushed the main branch from 312c347 to dc82ade Compare June 27, 2023 03:13

cwher requested a review from ver217 June 27, 2023 03:37

ver217 reviewed Jun 27, 2023

View reviewed changes

Comment thread applications/Chat/coati/trainer/base.py Outdated

ver217 reviewed Jun 27, 2023

View reviewed changes

Comment thread applications/Chat/coati/trainer/ppo.py Outdated

ver217 reviewed Jun 27, 2023

View reviewed changes

Comment thread applications/Chat/coati/trainer/sft.py Outdated

cwher added 3 commits June 27, 2023 14:01

fix: remove unnecessary output

9568185

fix: move dataloader to fit fn of SLTrainer

211dfb5

fix: move dataloader to fit fn of OnPolicyTrainer

69ab71b

cwher added the chatgpt ChatGPT Application label Jun 27, 2023

fix: modify usage of prompt and pretrain dataloader

128ec4f

cwher requested a review from ver217 June 29, 2023 02:17

ver217 approved these changes Jun 29, 2023

View reviewed changes

ver217 merged commit b03d64d into hpcaitech:main Jun 29, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[chat] refactor trainer class#4080

[chat] refactor trainer class#4080
ver217 merged 17 commits intohpcaitech:mainfrom
cwher:main

cwher commented Jun 25, 2023

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

cwher commented Jun 25, 2023

📌 Checklist before creating the PR

🚨 Issue number

📝 What does this PR do?

💥 Checklist before requesting a review

⭐️ Do you enjoy contributing to Colossal-AI?

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants