Enable gptoss_sink #1753

LJ-underdog · 2025-12-30T04:06:14Z

Motivation

This PR adds support for a sink_ptr parameter across the multi-head attention (MHA) forward pass implementations. The sink_ptr enables the "gptoss_sink" feature, which is a mechanism for attention sink tokens in transformer models.

Technical Details

Added sink_ptr parameter to all MHA forward function signatures (regular, varlen, and batch prefill variants)
Added validation logic to ensure sink_ptr matches device and shape requirements, with automatic dtype conversion to float32
Propagated sink_ptr through the call stack from Python interfaces to C++/CUDA implementations

Test Plan

Test in ck repo

Test Result

local test passd

Submission Checklist

Look over the contributing guidelines at https://github.com/ROCm/ROCm/blob/develop/CONTRIBUTING.md#pull-requests.

Signed-off-by: Linjun-AMD <Jun.Lin@amd.com>

Copilot

Pull request overview

This PR adds support for a sink_ptr parameter across the multi-head attention (MHA) forward pass implementations. The sink_ptr enables the "gptoss_sink" feature, which is a mechanism for attention sink tokens in transformer models.

Key changes:

Added sink_ptr parameter to all MHA forward function signatures (regular, varlen, and batch prefill variants)
Added validation logic to ensure sink_ptr matches device and shape requirements, with automatic dtype conversion to float32
Propagated sink_ptr through the call stack from Python interfaces to C++/CUDA implementations

Reviewed changes

Copilot reviewed 10 out of 10 changed files in this pull request and generated 5 comments.

Show a summary per file

File	Description
csrc/py_itfs_cu/asm_mha_varlen_fwd.cu	Added nullptr sink_ptr argument to mha_fwd_args constructor
csrc/py_itfs_cu/asm_mha_fwd.cu	Added nullptr sink_ptr argument to mha_fwd_args constructor
csrc/py_itfs_ck/mha_varlen_fwd_kernels.cu	Added sink_ptr parameter handling and validation in varlen forward kernels
csrc/py_itfs_ck/mha_fwd_kernels.cu	Added sink_ptr parameter to standard MHA forward kernels
csrc/py_itfs_ck/mha_batch_prefill_kernels.cu	Added sink_ptr parameter to batch prefill kernels
csrc/include/torch/mha_varlen_fwd.h	Updated header to include sink_ptr parameter in function signature
csrc/include/torch/mha_fwd.h	Updated header to include sink_ptr parameter in function signature
csrc/include/torch/mha_batch_prefill.h	Updated header to include sink_ptr parameter in function signature
csrc/include/rocm_ops.hpp	Added sink_ptr pybind argument definitions for all MHA variants
aiter/ops/mha.py	Added sink_ptr parameter to all Python MHA functions with device/shape validation

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

csrc/py_itfs_ck/mha_batch_prefill_kernels.cu

aiter/ops/mha.py

csrc/include/torch/mha_batch_prefill.h

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Signed-off-by: Linjun-AMD <Jun.Lin@amd.com>

enable gptoss_sink

1f4df0b

Signed-off-by: Linjun-AMD <Jun.Lin@amd.com>

LJ-underdog requested review from a team and Copilot December 30, 2025 04:06

Copilot AI reviewed Dec 30, 2025

View reviewed changes

LJ-underdog and others added 5 commits December 30, 2025 13:41

Update aiter/ops/mha.py

eb84d96

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Update csrc/py_itfs_ck/mha_batch_prefill_kernels.cu

50990ca

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Update aiter/ops/mha.py

e5ecc87

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Update csrc/include/torch/mha_batch_prefill.h

1ce2e29

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Update mha_batch_prefill_kernels.cu

544a1f6

LJ-underdog changed the title ~~enable gptoss_sink~~ Enable gptoss_sink Dec 30, 2025

update mha_bwd parameter

361f94d

Signed-off-by: Linjun-AMD <Jun.Lin@amd.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Enable gptoss_sink #1753

Enable gptoss_sink #1753

Uh oh!

LJ-underdog commented Dec 30, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Enable gptoss_sink #1753

Are you sure you want to change the base?

Enable gptoss_sink #1753

Uh oh!

Conversation

LJ-underdog commented Dec 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Technical Details

Test Plan

Test Result

Submission Checklist

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

LJ-underdog commented Dec 30, 2025 •

edited

Loading