Skip to content

[chat] refactor trainer class #4077

@cwher

Description

@cwher

Description

The base Trainer class contains methods like _on_fit_startand _on_fit_end, which is only used in RL training.

TODOs

  • Separate current base Trainer into SLTrainer and RLTrainer.
  • Update SFTTrainer.
  • Update RMTrainer.
  • Update PPOTrainer.

Metadata

Metadata

Assignees

Labels

chatgptChatGPT Application

Type

No type

Projects

Status

Done

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions