[chat] refactor actor class by cwher · Pull Request #3968 · hpcaitech/ColossalAI

cwher · 2023-06-12T10:17:11Z

📌 Checklist before creating the PR

I have created an issue for this PR for traceability
The title follows the standard format: [doc/gemini/tensor/...]: A concise description
I have added relevant tags if possible for us to better distinguish different PRs

🚨 Issue number

Link this PR to your issue with words like fixed to automatically close the linked issue upon merge

e.g. fixed #1234, closed #1234, resolved #1234

Close #3955.

📝 What does this PR do?

Summarize your work here.
if you have any plots/diagrams/screenshots/tables, please attach them here.

💥 Checklist before requesting a review

I have linked my PR to an issue (instruction)
My issue clearly describes the problem/feature/proposal, with diagrams/charts/table/code if possible
I have performed a self-review of my code
I have added thorough tests.
I have added docstrings for all the functions/methods I implemented

⭐️ Do you enjoy contributing to Colossal-AI?

🌝 Yes, I do.
🌚 No, I don't.

Tell us more if you don't enjoy contributing to Colossal-AI.

* unwrap_model returns model not wrapped by Strategy * get_base_model returns HF model for Actor, Critic and RewardModel

* refactor: separate log_probs fn from Actor forward fn * refactor: separate generate fn from Actor class * feat: update unwrap_model and get_base_model * unwrap_model returns model not wrapped by Strategy * get_base_model returns HF model for Actor, Critic and RewardModel * feat: simplify Strategy.prepare * style: remove get_base_model method of Actor * perf: tokenize text in batches * refactor: move calc_action_log_probs to utils of model * test: update test with new forward fn * style: rename forward fn args * fix: do not unwrap model in save_model fn of naive strategy * test: add gemini test for train_prompts * fix: fix _set_default_generate_kwargs

cwher added 8 commits June 12, 2023 16:07

refactor: separate log_probs fn from Actor forward fn

034f95b

refactor: separate generate fn from Actor class

b76cc73

feat: update unwrap_model and get_base_model

0640ac4

* unwrap_model returns model not wrapped by Strategy * get_base_model returns HF model for Actor, Critic and RewardModel

feat: simplify Strategy.prepare

85da317

style: remove get_base_model method of Actor

aa31f97

perf: tokenize text in batches

121f2ec

refactor: move calc_action_log_probs to utils of model

62ec8a3

test: update test with new forward fn

a86b2f1

cwher force-pushed the main branch from 4ebeab7 to a86b2f1 Compare June 13, 2023 02:36

ver217 reviewed Jun 13, 2023

View reviewed changes

Comment thread applications/Chat/coati/models/base/actor.py Outdated

ver217 reviewed Jun 13, 2023

View reviewed changes

Comment thread applications/Chat/coati/trainer/strategies/naive.py Outdated

cwher added 4 commits June 13, 2023 10:51

style: rename forward fn args

36c53f0

fix: do not unwrap model in save_model fn of naive strategy

fe9b4f6

test: add gemini test for train_prompts

1b40e74

fix: fix _set_default_generate_kwargs

97e6f67

ver217 approved these changes Jun 13, 2023

View reviewed changes

ver217 merged commit 9d02590 into hpcaitech:main Jun 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[chat] refactor actor class#3968

[chat] refactor actor class#3968
ver217 merged 12 commits intohpcaitech:mainfrom
cwher:main

cwher commented Jun 12, 2023 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

cwher commented Jun 12, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

📌 Checklist before creating the PR

🚨 Issue number

📝 What does this PR do?

💥 Checklist before requesting a review

⭐️ Do you enjoy contributing to Colossal-AI?

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

cwher commented Jun 12, 2023 •

edited

Loading