[MetaxGPU][test] adjust warp size in mhc_pre_big_fuse by yesuweiYYYY · Pull Request #4 · MetaX-MACA/TileKernels-Metax

yesuweiYYYY · 2026-04-29T06:22:11Z

fix data mismatch 20% bug in mhc_pre_big_fuse.py in maca

gemini-code-assist

Code Review

This pull request increases the thread count from 96 to 128 and updates the thread binding threshold for shared memory operations from 32 to 64. A critical race condition was identified where threads reading from shared memory might do so before the writing threads have finished, necessitating the addition of a synchronization barrier.

gemini-code-assist · 2026-04-29T06:23:31Z

                T.copy(mixes, mixes_shared, disable_tma=True)

-            if T.get_thread_binding() < 32:
+            if T.get_thread_binding() < 64:


There is a race condition between the threads writing to mixes_shared (threads 0-63) and the threads reading from it (threads 64-127 in the else block). Since these two groups belong to different warps (assuming a warp size of 32 or 64), a synchronization barrier is required to ensure that the data written to shared memory by the first group is visible to the second group before it is accessed. Adding T.syncthreads() before the second conditional block will resolve this issue and likely fix the data mismatch mentioned in the PR description.

Suggested change

if T.get_thread_binding() < 64:

T.syncthreads()

if T.get_thread_binding() < 64:

[MetaxGPU][test] adjust warp size in mhc_pre_big_fuse

de8e007

gemini-code-assist Bot reviewed Apr 29, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MetaxGPU][test] adjust warp size in mhc_pre_big_fuse#4

[MetaxGPU][test] adjust warp size in mhc_pre_big_fuse#4
yesuweiYYYY wants to merge 1 commit intoMetaX-MACA:devfrom
yesuweiYYYY:dev_pr2

yesuweiYYYY commented Apr 29, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot Apr 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

	if T.get_thread_binding() < 64:
	T.syncthreads()
	if T.get_thread_binding() < 64:

Conversation

yesuweiYYYY commented Apr 29, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Apr 29, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant