feat(amdgpu): per-kernel LLVM function attributes via @qd.kernel(fn_attrs=...) by kevinjosephamd · Pull Request #11 · ROCm/quadrants

kevinjosephamd · 2026-04-23T15:32:33Z

Lets users override AMDGPU function attributes per kernel without editing the JIT pipeline. Attributes must be pre-registered in quadrants/program/fn_attrs_registry.h; unknown backend or attribute names raise QuadrantsSyntaxError at decoration time. Currently registered: amdgpu-max-num-workgroups, amdgpu-agpr-alloc, amdgpu-waves-per-eu, amdgpu-flat-work-group-size.

Examples:

@qd.kernel(fn_attrs={"amdgpu": {"amdgpu-max-num-workgroups": "128,1,1"}})
def k(...): ...

@qd.kernel(fn_attrs={"amdgpu": {"amdgpu-waves-per-eu": "1,2"}})
def k(...): ...

Plumbed Python decorator -> Kernel.fn_attrs -> set_fn_attrs pybind -> codegen_llvm.cpp addFnAttr -> jit_amdgpu.cpp (defaults gated by hasFnAttribute so user values win). Included in both fastcache and frontend offline cache keys so changing fn_attrs forces a rebuild.

kevinjosephamd · 2026-04-23T18:01:13Z

This change should have no impact on performance and is purely a nice-to-have feature for developers trying to optimize existing kernels(especially occupancy related issues). By exposing these attributes at the python DSL layer we can keep the JIT C++ backend relatively clean and not have to hard-code intricate rules to conditionally apply attributes based on things like the kernel name.

yaoliu13 · 2026-04-24T07:36:10Z

/run-ci

jamesETsmith

This looks great, thanks @kevinjosephamd!

yaoliu13

pre-submit is 776,886

kevinjosephamd · 2026-04-28T06:43:04Z

pre-submit is 776,886

@yaoliu13 I don't expect this PR in isolation to have any impact on this number, and is a new feature that can be used from Genesis. See: ROCm/Genesis#39

yaoliu13 · 2026-04-29T07:10:10Z

pre-submit is 776,886

@yaoliu13 I don't expect this PR in isolation to have any impact on this number, and is a new feature that can be used from Genesis. See: ROCm/Genesis#39

Sounds good. Let's make sure this PR doesn't hurt pre-submit throughput.

Depends on ROCm/quadrants#11 (per-kernel fn_attrs support). Values picked from a per-kernel sweep of (min,max) occupancy hints: kernel_step_1: 3,4 kernel_step_2: 1,4 func_solve_init: 2,4

…ttrs=...) Lets users override AMDGPU codegen attributes per kernel without editing the JIT pipeline. Attributes must be pre-registered in quadrants/program/fn_attrs_registry.h; unknown backend or attribute names raise QuadrantsSyntaxError at decoration time. Currently registered: amdgpu-max-num-workgroups, amdgpu-agpr-alloc, amdgpu-waves-per-eu, amdgpu-flat-work-group-size. Examples: ```python @qd.kernel(fn_attrs={"amdgpu": {"amdgpu-max-num-workgroups": "128,1,1"}}) def k(...): ... @qd.kernel(fn_attrs={"amdgpu": {"amdgpu-waves-per-eu": "1,2"}}) def k(...): ... ``` Plumbed Python decorator -> Kernel.fn_attrs -> set_fn_attrs pybind -> codegen_llvm.cpp addFnAttr -> jit_amdgpu.cpp (defaults gated by hasFnAttribute so user values win). Included in both fastcache and frontend offline cache keys so changing fn_attrs forces a rebuild.

kevinjosephamd · 2026-04-30T05:39:10Z

/run-ci

yaoliu13

LGTM

Depends on ROCm/quadrants#11 (per-kernel fn_attrs support). Values picked from a per-kernel sweep of (min,max) occupancy hints: kernel_step_1: 3,4 kernel_step_2: 1,4 func_solve_init: 2,4

kevinjosephamd requested review from deepsek and jamesETsmith April 23, 2026 18:08

jamesETsmith approved these changes Apr 24, 2026

View reviewed changes

kevinjosephamd force-pushed the kejoseph/feature/expose_function_attr_to_dsl branch 2 times, most recently from 10c8e42 to 077e8e6 Compare April 27, 2026 00:26

kevinjosephamd mentioned this pull request Apr 27, 2026

[PERF IMPROVEMENT] Add amdgpu-waves-per-eu fn_attrs to kernels. ROCm/Genesis#39

Open

yaoliu13 requested changes Apr 28, 2026

View reviewed changes

kevinjosephamd force-pushed the kejoseph/feature/expose_function_attr_to_dsl branch from 077e8e6 to 033b9c5 Compare April 30, 2026 05:38

yaoliu13 approved these changes Apr 30, 2026

View reviewed changes

yaoliu13 merged commit f71121d into amd-integration Apr 30, 2026
38 of 46 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(amdgpu): per-kernel LLVM function attributes via @qd.kernel(fn_attrs=...)#11

feat(amdgpu): per-kernel LLVM function attributes via @qd.kernel(fn_attrs=...)#11
yaoliu13 merged 1 commit intoamd-integrationfrom
kejoseph/feature/expose_function_attr_to_dsl

kevinjosephamd commented Apr 23, 2026 •

edited

Loading

Uh oh!

kevinjosephamd commented Apr 23, 2026 •

edited

Loading

Uh oh!

yaoliu13 commented Apr 24, 2026

Uh oh!

jamesETsmith left a comment

Uh oh!

yaoliu13 left a comment

Uh oh!

kevinjosephamd commented Apr 28, 2026 •

edited

Loading

Uh oh!

yaoliu13 commented Apr 29, 2026 •

edited

Loading

Uh oh!

kevinjosephamd commented Apr 30, 2026

Uh oh!

yaoliu13 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

kevinjosephamd commented Apr 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kevinjosephamd commented Apr 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

yaoliu13 commented Apr 24, 2026

Uh oh!

jamesETsmith left a comment

Choose a reason for hiding this comment

Uh oh!

yaoliu13 left a comment

Choose a reason for hiding this comment

Uh oh!

kevinjosephamd commented Apr 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

yaoliu13 commented Apr 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kevinjosephamd commented Apr 30, 2026

Uh oh!

yaoliu13 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

kevinjosephamd commented Apr 23, 2026 •

edited

Loading

kevinjosephamd commented Apr 23, 2026 •

edited

Loading

kevinjosephamd commented Apr 28, 2026 •

edited

Loading

yaoliu13 commented Apr 29, 2026 •

edited

Loading