Skip to content

[coati] fix RM & MDP#3645

Closed
ht-zhou wants to merge 7 commits intomainfrom
coati-mppo
Closed

[coati] fix RM & MDP#3645
ht-zhou wants to merge 7 commits intomainfrom
coati-mppo

Conversation

@ht-zhou
Copy link
Copy Markdown
Contributor

@ht-zhou ht-zhou commented Apr 26, 2023

📌 Checklist before creating the PR

  • I have created an issue for this PR for traceability
  • The title follows the standard format: [doc/gemini/tensor/...]: A concise description
  • I have added relevant tags if possible for us to better distinguish different PRs

🚨 Issue number

Link this PR to your issue with words like fixed to automatically close the linked issue upon merge

e.g. fixed #1234, closed #1234, resolved #1234

Close #4125, #3374.

📝 What does this PR do?

Summarize your work here.
if you have any plots/diagrams/screenshots/tables, please attach them here.

💥 Checklist before requesting a review

  • I have linked my PR to an issue (instruction)
  • My issue clearly describes the problem/feature/proposal, with diagrams/charts/table/code if possible
  • I have performed a self-review of my code
  • I have added thorough tests.
  • I have added docstrings for all the functions/methods I implemented

⭐️ Do you enjoy contributing to Colossal-AI?

  • 🌝 Yes, I do.
  • 🌚 No, I don't.

Tell us more if you don't enjoy contributing to Colossal-AI.

initial_model: Actor,
kl_coef: float = 0.1) -> None:
kl_coef: float = 0.1,
max_length: int = 128,
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

These are all generation kwargs. Is it essential to declare these kwargs here explicitly?

@TongLi3701 TongLi3701 added chatgpt ChatGPT Application code reviewing labels Jun 2, 2023
@cwher cwher self-assigned this Jun 29, 2023
@cwher cwher mentioned this pull request Jun 30, 2023
@cwher cwher marked this pull request as draft July 24, 2023 02:16
@cwher cwher mentioned this pull request Jul 25, 2023
10 tasks
@cwher cwher closed this Aug 3, 2023
@cwher cwher deleted the coati-mppo branch December 1, 2023 03:50
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

chatgpt ChatGPT Application code reviewing

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[chat] fix RM & MDP

4 participants