Skip to content

Use full wave for all kernels except monolith#24

Open
doru1004 wants to merge 1 commit intoamd-integrationfrom
use-block-dim-64
Open

Use full wave for all kernels except monolith#24
doru1004 wants to merge 1 commit intoamd-integrationfrom
use-block-dim-64

Conversation

@doru1004
Copy link
Copy Markdown

On AMD GPUs the wave size is 64. Use it for all kernels except monolith (no effect).

Copilot AI review requested due to automatic review settings April 24, 2026 15:47
Copy link
Copy Markdown

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Updates constraint-solver kernel launch configuration to use a 64-thread block size (full wave on AMD GPUs) across the decomposed solver path and several core solver routines, while leaving the monolithic solver unchanged.

Changes:

  • Switched multiple ti.loop_config/qd.loop_config occurrences from block_dim=32 to block_dim=64.
  • Kept the monolithic solve kernel at block_dim=32 (per PR intent).

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 1 comment.

File Description
genesis/engine/solvers/rigid/constraint/solver_breakdown.py Updates decomposed-step kernels to use block_dim=64.
genesis/engine/solvers/rigid/constraint/solver.py Updates several solver loops (Hessian/Cholesky + gradient update paths) to block_dim=64, leaving monolith at 32.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Comment thread genesis/engine/solvers/rigid/constraint/solver_breakdown.py
@doru1004 doru1004 force-pushed the use-block-dim-64 branch 2 times, most recently from b02257a to d4dca4e Compare April 24, 2026 19:49
@jamesETsmith
Copy link
Copy Markdown

Genesis tests pass locally, perf is statistically equivalent to amd-integration

@jamesETsmith
Copy link
Copy Markdown

/run-ci

1 similar comment
@yaoliu13
Copy link
Copy Markdown
Collaborator

/run-ci

@yaoliu13
Copy link
Copy Markdown
Collaborator

This PR is not based on the latest amd-integration: use-block-dim-64...ROCm:Genesis:amd-integration

@doru1004
Copy link
Copy Markdown
Author

/run-ci

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants