[Feature][OP] Append Attn Support CUDA-PDL #5072

ckl117 · 2025-11-16T14:31:15Z

Motivation

Before enabling PDL, there is a bubble between two kernels.

After enabling PDL, there is almost no bubbles between kernels.

Modifications

A new env FD_ENABLE_PDL, with a default value of 1, which means CUDA-PDL is enabled when sm>=90.
Modified the startup method of the kernel.
Changed some kernel names (when input's dtype is int).

Usage or Command

export FD_ENABLE_PDL=0 will disable CUDA-PDL.

Accuracy Tests

Append Attn Support CUDA-PDL

Checklist

Add at least a tag in the PR title.
- Tag list: [[FDConfig],[APIServer],[Engine], [Scheduler], [PD Disaggregation], [Executor], [Graph Optimization], [Speculative Decoding], [RL], [Models], [Quantization], [Loader], [OP], [KVCache], [DataProcessor], [BugFix], [Docs], [CI], [Optimization], [Feature], [Benchmark], [Others], [XPU], [HPU], [GCU], [DCU], [Iluvatar], [Metax]]
- You can add new tags based on the PR content, but the semantics must be clear.
Format your code, run pre-commit before commit.
Add unit tests. Please write the reason in this PR if no unit tests.
Provide accuracy results.
If the current PR is submitting to the release branch, make sure the PR has been submitted to the develop branch, then cherry-pick it to the release branch with the [Cherry-Pick] PR tag.

…into develop

paddle-bot · 2025-11-16T14:31:22Z

Thanks for your contribution!

ckl117 added 4 commits November 14, 2025 16:27

support pdl for append attn

c539a66

Merge branch 'develop' of https://github.com/PaddlePaddle/FastDeploy …

438d7d4

…into develop

encoder attn kernel

f86551c

more kernel support pdl

6b12124

ckl117 added 2 commits November 16, 2025 23:48

code check

a1ff42f

all append_attn kernel support pdl

2d8f0e6

ckl117 changed the title ~~[OP] Append Attn Support CUDA-PDL~~ [Feature][OP] Append Attn Support CUDA-PDL Nov 17, 2025

zhoutianzi666 approved these changes Nov 17, 2025

View reviewed changes

zhoutianzi666 merged commit d58c1db into PaddlePaddle:develop Nov 17, 2025
14 of 16 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature][OP] Append Attn Support CUDA-PDL #5072

[Feature][OP] Append Attn Support CUDA-PDL #5072

Uh oh!

ckl117 commented Nov 16, 2025 •

edited

Loading

Uh oh!

paddle-bot bot commented Nov 16, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[Feature][OP] Append Attn Support CUDA-PDL #5072

[Feature][OP] Append Attn Support CUDA-PDL #5072

Uh oh!

Conversation

ckl117 commented Nov 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Modifications

Usage or Command

Accuracy Tests

Checklist

Uh oh!

paddle-bot bot commented Nov 16, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ckl117 commented Nov 16, 2025 •

edited

Loading