[BUG]: ChatGPT - There is a possibility that the generation is not working properly.

### 🐛 Describe the bug

`generate_kwargs` is not properly passed to `PPOTrainer`. So, generating sentences might be incorrect. (At leat in my case)

https://github.com/hpcaitech/ColossalAI/blob/5d5f475d758347b5e61dbb4b0ccb6108821e3e93/applications/ChatGPT/chatgpt/trainer/ppo.py#L64-L76

Like Huggingface GPT2 Model, `prepare_inputs_fn` and `update_model_kwargs_fn` should be passed to generate. But in that code, these functions are not applied because `_set_default_generate_kwargs()` is called after `super().__init__()`

So, I think the order of call should be changed.

### Environment

_No response_

	experience_maker = NaiveExperienceMaker(actor, critic, reward_model, initial_model, kl_coef)
	replay_buffer = NaiveReplayBuffer(train_batch_size, buffer_limit, buffer_cpu_offload)
	super().__init__(strategy, experience_maker, replay_buffer, experience_batch_size, max_epochs, tokenizer,
	sample_replay_buffer, dataloader_pin_memory, callbacks, **generate_kwargs)
	self.actor = actor
	self.critic = critic

	self.actor_loss_fn = PolicyLoss(eps_clip)
	self.critic_loss_fn = ValueLoss(value_clip)

	self.actor_optim = actor_optim
	self.critic_optim = critic_optim
	self._set_default_generate_kwargs(generate_kwargs, actor)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG]: ChatGPT - There is a possibility that the generation is not working properly. #3088

🐛 Describe the bug

Environment

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[BUG]: ChatGPT - There is a possibility that the generation is not working properly. #3088

Description

🐛 Describe the bug

Environment

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions