Gemm a8w8 bpreshuffle api fix #682

junhaha666 · 2025-07-21T03:04:45Z

No description provided.

Copilot

Pull Request Overview

This PR fixes API inconsistencies in the GEMM A8W8 bpreshuffle operation by standardizing type annotations and updating the configuration lookup logic. The changes focus on improving type consistency and modifying the fallback behavior for unsupported configurations.

Standardizes type annotations from generic Tensor to torch.Tensor and adds return type annotation
Updates configuration caching to support multiple tuned files simultaneously
Removes fallback to ASM implementation and adds explicit assertions for supported data types

Comments suppressed due to low confidence (1)

aiter/ops/gemm_op_a8w8.py:36

Parameter name 'Out' uses inconsistent capitalization. It should be 'out' to match the original naming convention and align with the variable 'Y' used in the function body.

    Out: torch.Tensor,

aiter/ops/gemm_op_a8w8.py

valarLip

LGTM

* add check before gemm_a8w8_bpreshuffle_ck * disable gemm_a8w8_ASM in gemm_a8w8_bpreshuffle api * fix get_CKGEMM_config

junhaha666 added 3 commits July 18, 2025 06:32

add check before gemm_a8w8_bpreshuffle_ck

7b0b565

disable gemm_a8w8_ASM in gemm_a8w8_bpreshuffle api

d94149f

fix get_CKGEMM_config

eaeae41

Copilot AI review requested due to automatic review settings July 21, 2025 03:04

Merge branch 'main' into gemm_a8w8_bpreshuffle_api_fix

f121da8

Copilot AI reviewed Jul 21, 2025

View reviewed changes

aiter/ops/gemm_op_a8w8.py Show resolved Hide resolved

valarLip approved these changes Jul 21, 2025

View reviewed changes

valarLip merged commit 0439b31 into main Jul 21, 2025
13 checks passed

valarLip deleted the gemm_a8w8_bpreshuffle_api_fix branch July 21, 2025 06:34

cagrikymk pushed a commit that referenced this pull request Jul 30, 2025

Gemm a8w8 bpreshuffle api fix (#682)

f1c20db

* add check before gemm_a8w8_bpreshuffle_ck * disable gemm_a8w8_ASM in gemm_a8w8_bpreshuffle api * fix get_CKGEMM_config

tjtanaa mentioned this pull request Aug 27, 2025

[Issue]: gemm_a8w8_bpreshuffle is mixing gemm_a8w8_ASM and gemm_a8w8_bpreshuffle_ck which have different layout. #671

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Gemm a8w8 bpreshuffle api fix #682

Gemm a8w8 bpreshuffle api fix #682

Uh oh!

junhaha666 commented Jul 21, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

valarLip left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Gemm a8w8 bpreshuffle api fix #682

Gemm a8w8 bpreshuffle api fix #682

Uh oh!

Conversation

junhaha666 commented Jul 21, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Uh oh!

valarLip left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants