[Cherry-Pick][New][RL] Support Rollout Routing Replay (#5405) #5408

gongshaotian · 2025-12-05T14:14:01Z

[RL] Support Rollout Routing Replay
add routing indices cache
fix config bug and moe forward bug
R3 Support GLM
support eb4.5
fix merge bug
Apply suggestion from @Copilot
Apply suggestion from @Copilot
Apply suggestion from @Copilot
Apply suggestion from @Copilot
add routing replay ci
support glm topk
support orther top_k
fix ci bug
pre-commit
only support chatcmpl
Revert "Revert "[RL] Support Rollout Routing Replay ([Reverted][RL] Support Rollout Routing Replay #5321)" (Revert "[RL] Support Rollout Routing Replay" #5402)"

This reverts commit c45e064.

Fix XPU and NPU bug

Motivation

💡 If this PR is a Cherry Pick, the PR title needs to follow the format by adding the [Cherry-Pick] label at the very beginning and appending the original PR ID at the end. For example, [Cherry-Pick][CI] Add check trigger and logic(#5191)

💡 如若此PR是Cherry Pick，PR标题需遵循格式，在最开始加上[Cherry-Pick]标签，以及最后面加上原PR ID，例如[Cherry-Pick][CI] Add check trigger and logic(#5191)

Modifications

Usage or Command

Accuracy Tests

Checklist

Add at least a tag in the PR title.
- Tag list: [[FDConfig],[APIServer],[Engine], [Scheduler], [PD Disaggregation], [Executor], [Graph Optimization], [Speculative Decoding], [RL], [Models], [Quantization], [Loader], [OP], [KVCache], [DataProcessor], [BugFix], [Docs], [CI], [Optimization], [Feature], [Benchmark], [Others], [XPU], [HPU], [GCU], [DCU], [Iluvatar], [Metax]]
- You can add new tags based on the PR content, but the semantics must be clear.
Format your code, run pre-commit before commit.
Add unit tests. Please write the reason in this PR if no unit tests.
Provide accuracy results.
If the current PR is submitting to the release branch, make sure the PR has been submitted to the develop branch, then cherry-pick it to the release branch with the [Cherry-Pick] PR tag.

* [RL] Support Rollout Routing Replay * add routing indices cache * fix config bug and moe forward bug * R3 Support GLM * support eb4.5 * fix merge bug * Apply suggestion from @Copilot Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * Apply suggestion from @Copilot Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * Apply suggestion from @Copilot Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * Apply suggestion from @Copilot Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * add routing replay ci * support glm topk * support orther top_k * fix ci bug * pre-commit * only support chatcmpl * Revert "Revert "[RL] Support Rollout Routing Replay (PaddlePaddle#5321)" (PaddlePaddle#5402)" This reverts commit c45e064. * Fix XPU and NPU bug --------- Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> Co-authored-by: Yuanle Liu <yuanlehome@163.com>

paddle-bot · 2025-12-05T14:14:20Z

Thanks for your contribution!

codecov-commenter · 2025-12-05T15:28:20Z

Codecov Report

❌ Patch coverage is 59.03614% with 102 lines in your changes missing coverage. Please review.
⚠️ Please upload report for BASE (release/2.4@c45e064). Learn more about missing BASE report.

Files with missing lines	Patch %	Lines
...model_executor/layers/moe/routing_indices_cache.py	56.75%	59 Missing and 5 partials ⚠️
...el_executor/layers/moe/fused_moe_triton_backend.py	11.11%	6 Missing and 2 partials ⚠️
..._executor/layers/moe/fused_moe_deepgemm_backend.py	0.00%	7 Missing ⚠️
...del_executor/layers/moe/fused_moe_wint2_backend.py	0.00%	5 Missing ⚠️
fastdeploy/model_executor/layers/moe/moe.py	76.19%	4 Missing and 1 partial ⚠️
...l_executor/layers/moe/fused_moe_cutlass_backend.py	42.85%	4 Missing ⚠️
fastdeploy/engine/args_utils.py	70.00%	2 Missing and 1 partial ⚠️
...el_executor/layers/moe/fused_moe_marlin_backend.py	0.00%	3 Missing ⚠️
...odel_executor/layers/moe/fused_moe_backend_base.py	50.00%	2 Missing ⚠️
fastdeploy/worker/gpu_model_runner.py	94.11%	0 Missing and 1 partial ⚠️

Additional details and impacted files

@@              Coverage Diff               @@
##             release/2.4    #5408   +/-   ##
==============================================
  Coverage               ?   59.09%           
==============================================
  Files                  ?      327           
  Lines                  ?    40560           
  Branches               ?     6154           
==============================================
  Hits                   ?    23967           
  Misses                 ?    14745           
  Partials               ?     1848

Flag	Coverage Δ
GPU	`59.09% <59.03%> (?)`

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

EmmonsCurse approved these changes Dec 5, 2025

View reviewed changes

Jiang-Jia-Jun merged commit 707d1a1 into PaddlePaddle:release/2.4 Dec 8, 2025
34 of 46 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Cherry-Pick][New][RL] Support Rollout Routing Replay (#5405) #5408

[Cherry-Pick][New][RL] Support Rollout Routing Replay (#5405) #5408

Uh oh!

gongshaotian commented Dec 5, 2025 •

edited

Loading

Uh oh!

paddle-bot bot commented Dec 5, 2025

Uh oh!

codecov-commenter commented Dec 5, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[Cherry-Pick][New][RL] Support Rollout Routing Replay (#5405) #5408

[Cherry-Pick][New][RL] Support Rollout Routing Replay (#5405) #5408

Uh oh!

Conversation

gongshaotian commented Dec 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Modifications

Usage or Command

Accuracy Tests

Checklist

Uh oh!

paddle-bot bot commented Dec 5, 2025

Uh oh!

codecov-commenter commented Dec 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

gongshaotian commented Dec 5, 2025 •

edited

Loading

codecov-commenter commented Dec 5, 2025 •

edited

Loading