forked from yifan123/flow_grpo
-
Notifications
You must be signed in to change notification settings - Fork 3
Open
Description
Hi @happynear, thanks for your great work!
I have a question about replacing flow-cps with flow-sde. When I made the switch, I noticed my model's rollout performance improved, but the training process became very unstable.
Did you just swap flow-cps for flow-sde directly, or did you have to change other hyperparameters to get it to work? I tried increasing num_steps and the noise level, thinking it might help since the final step in flow-cps has no standard deviation, but that approach failed. Thanks a lot!
Metadata
Metadata
Assignees
Labels
No labels