Skip to content

Conversation

@gongshaotian
Copy link
Collaborator

In RL training, CUDAGraph capture can only be performed after the weight is updated, and the capture needs to last for a period of time. The status of the worker will be updated immediately after the weight is updated in the original implementation, but the worker may not be ready at this time.

@paddle-bot
Copy link

paddle-bot bot commented Sep 1, 2025

Thanks for your contribution!

Copy link
Member

@SigureMo SigureMo left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTMeow 🐾

@Jiang-Jia-Jun Jiang-Jia-Jun merged commit d3e4ae3 into PaddlePaddle:develop Sep 10, 2025
23 of 28 checks passed
qingqing01 pushed a commit that referenced this pull request Sep 11, 2025
* Adjust processing order

* fix bug

* fix update_parameters bug

* refine code
Jiang-Jia-Jun pushed a commit that referenced this pull request Sep 22, 2025
…#4178)

* Adjust processing order

* fix bug

* fix update_parameters bug

* refine code
@gongshaotian gongshaotian deleted the rl_multi_device branch November 3, 2025 06:49
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants