refactor rejection sampler #4758

realliujiaxu · 2025-12-06T06:25:46Z

What this PR does / why we need it?

Currently, we are using AscendRejctionSampler that extends from RejctionSampler in spec decoding. AscendRejctionSampler override forward of RejctionSampler, only aming to replace rejection_sample func. This
causes a lot of code of RejctionSampler cannot be reused, for example:

Proposed Change:

Delete AscendRejctionSampler and use RejctionSampler directly in model runner.
Patch RejctionSampler.expand_batch_to_tokens and RejctionSampler.rejection_sample, maybe a better way is to make them as custom ops.
Modify NPUModelRunner following [V1][spec decode] return logprobs for spec decoding vllm#26060

Does this PR introduce any user-facing change?

How was this patch tested?

test logits processor for spec decoding
test logprobs for spec decoding
test logprobs for spec decoding + async shcheduling
vLLM version: v0.12.0
vLLM main: vllm-project/vllm@ad32e3e

github-actions · 2025-12-06T06:25:55Z

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

A PR should do only one thing, smaller PRs enable faster reviews.
Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

github-actions · 2025-12-06T06:26:43Z

This pull request has conflicts, please resolve those before we can evaluate the pull request.

gemini-code-assist

Code Review

This pull request refactors the rejection sampler implementation to better align with the upstream vLLM project. The custom AscendRejectionSampler class is removed in favor of using the standard vllm.v1.sample.rejection_sampler.RejectionSampler and monkey-patching its dependencies with Ascend-optimized implementations. This is a solid architectural improvement that will enhance maintainability. The sample_tokens method in NPUModelRunner has also been cleanly refactored into smaller, more focused helper methods. The changes appear correct and logically sound. I have not found any issues of high or critical severity.

github-actions · 2025-12-06T06:33:34Z

This pull request has conflicts, please resolve those before we can evaluate the pull request.

github-actions · 2025-12-06T06:37:30Z

This pull request has conflicts, please resolve those before we can evaluate the pull request.

Signed-off-by: realliujiaxu <realliujiaxu@163.com>

github-actions · 2025-12-06T09:20:03Z

This pull request has conflicts, please resolve those before we can evaluate the pull request.

github-actions bot added the merge-conflicts label Dec 6, 2025

realliujiaxu marked this pull request as draft December 6, 2025 06:27

gemini-code-assist bot reviewed Dec 6, 2025

View reviewed changes

github-actions bot removed the merge-conflicts label Dec 6, 2025

realliujiaxu force-pushed the refactor_rejection_sampler branch from 5f90fbc to bb549e1 Compare December 6, 2025 06:33

github-actions bot added merge-conflicts documentation Improvements or additions to documentation module:tests module:ops module:core module:quantization and removed merge-conflicts labels Dec 6, 2025

realliujiaxu force-pushed the refactor_rejection_sampler branch from bb549e1 to 418f847 Compare December 6, 2025 06:37

github-actions bot added the merge-conflicts label Dec 6, 2025

realliujiaxu added 2 commits December 6, 2025 14:39

refactor

081c1eb

Signed-off-by: realliujiaxu <realliujiaxu@163.com>

delete AscendRejectionSampler

95ed77f

Signed-off-by: realliujiaxu <realliujiaxu@163.com>

realliujiaxu force-pushed the refactor_rejection_sampler branch from 418f847 to 95ed77f Compare December 6, 2025 06:39

github-actions bot added merge-conflicts and removed merge-conflicts documentation Improvements or additions to documentation module:tests module:ops module:core module:quantization labels Dec 6, 2025

fix lint

147c6f6

Signed-off-by: realliujiaxu <realliujiaxu@163.com>

realliujiaxu force-pushed the refactor_rejection_sampler branch from 651f652 to 147c6f6 Compare December 6, 2025 06:46

fix lint

388964c

Signed-off-by: realliujiaxu <realliujiaxu@163.com>

realliujiaxu force-pushed the refactor_rejection_sampler branch from 1362dab to 388964c Compare December 6, 2025 07:16

github-actions bot added the merge-conflicts label Dec 6, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

refactor rejection sampler #4758

refactor rejection sampler #4758

realliujiaxu commented Dec 6, 2025 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented Dec 6, 2025

Uh oh!

github-actions bot commented Dec 6, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

github-actions bot commented Dec 6, 2025

Uh oh!

github-actions bot commented Dec 6, 2025

Uh oh!

github-actions bot commented Dec 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

refactor rejection sampler #4758

Are you sure you want to change the base?

refactor rejection sampler #4758

Conversation

realliujiaxu commented Dec 6, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Proposed Change:

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

github-actions bot commented Dec 6, 2025

Uh oh!

github-actions bot commented Dec 6, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

github-actions bot commented Dec 6, 2025

Uh oh!

github-actions bot commented Dec 6, 2025

Uh oh!

github-actions bot commented Dec 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

realliujiaxu commented Dec 6, 2025 •

edited by github-actions bot

Loading