Skip to content

Conversation

@shay-li77
Copy link
Contributor

Motivation

Technical Details

Test Plan

Test Result

Submission Checklist

@shay-li77 shay-li77 requested review from a team and Copilot December 24, 2025 07:08
Copy link
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

This PR updates the dispatch logic for the GFX950 FMHA v3 forward kernel to handle the 192x128 head dimension configuration differently. The changes rename kernel identifiers from "hd192_hd128" to "hd192x128" and introduce special-case dispatch logic for this configuration.

  • Renames kernel name strings from "hd192_hd128" to "hd192x128" for consistency
  • Adds conditional dispatch logic to handle 192-dimension heads with different grid dimensions
  • Forces tune_opt to 0 for the 192x128 configuration in both wrapper functions

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Comment on lines +191 to +194
if (a.hdim_q == 192 && a.hdim_v == 128)
{
tune_opt = 0;
}
Copy link

Copilot AI Dec 24, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This logic is duplicated at lines 191-194 and 258-261. Consider extracting this into a helper function or consolidating the logic to avoid duplication.

Copilot uses AI. Check for mistakes.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants