add fake for MLA RoPE operator #1714

mqhc2020 · 2025-12-23T09:46:37Z

Motivation

For the function fused_qk_rope_cat_and_cache_mla, SGLang needs fake for it to pass torch compile.

Technical Details

This commit will need another SGLang commit merged simultaneously, because the API is changed.

Test Plan

Test Result

Submission Checklist

Look over the contributing guidelines at https://github.com/ROCm/ROCm/blob/develop/CONTRIBUTING.md#pull-requests.

Copilot

Pull request overview

This PR adds torch.compile support for the fused_qk_rope_cat_and_cache_mla function by introducing a fake tensor function that simulates tensor shapes and dtypes without actual computation. This is required for SGLang's torch.compile integration.

Adds fused_qk_rope_cat_and_cache_mla_fake_tensor function to generate fake tensors for torch.compile
Updates return type to always return 5 tensors (including q_nope_zeros_out) for consistency
Adds type hints and improves type annotations for better code clarity

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

aiter/ops/triton/fused_kv_cache.py

k50112113

Looks good! Thanks for the addition, I think we are not going to let torch compile see inside this function in any cases, so this is a pretty decent change.

add gen_fake for MLA RoPE operator

50eca12

mqhc2020 requested review from a team and Copilot December 23, 2025 09:46

Copilot started reviewing on behalf of mqhc2020 December 23, 2025 09:47 View session

Copilot AI reviewed Dec 23, 2025

View reviewed changes

mqhc2020 requested a review from azaidy December 23, 2025 13:16

azaidy requested a review from k50112113 December 23, 2025 15:01

mqhc2020 and others added 5 commits December 23, 2025 22:26

fix code stype

835e9dc

sync logic in fake with actual function

442664f

fix black error

464e7dc

Merge branch 'main' into marv/torch_compile_mla_rope

b399369

fix black error again

da6f0d8

mqhc2020 changed the title ~~add gen_fake for MLA RoPE operator~~ add fake for MLA RoPE operator Dec 24, 2025

k50112113 approved these changes Dec 26, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add fake for MLA RoPE operator #1714

add fake for MLA RoPE operator #1714

Uh oh!

mqhc2020 commented Dec 23, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

k50112113 left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

add fake for MLA RoPE operator #1714

Are you sure you want to change the base?

add fake for MLA RoPE operator #1714

Uh oh!

Conversation

mqhc2020 commented Dec 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Technical Details

Test Plan

Test Result

Submission Checklist

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

k50112113 left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

mqhc2020 commented Dec 23, 2025 •

edited

Loading