Skip to content

Closed: Opened against wrong base#546

Closed
deepsek wants to merge 8 commits intoGenesis-Embodied-AI:mainfrom
ROCm:refactor/deepsek/amdgpu-addrspace-at-source
Closed

Closed: Opened against wrong base#546
deepsek wants to merge 8 commits intoGenesis-Embodied-AI:mainfrom
ROCm:refactor/deepsek/amdgpu-addrspace-at-source

Conversation

@deepsek
Copy link
Copy Markdown

@deepsek deepsek commented Apr 22, 2026

Opened against wrong base by mistake. Closed.

jamesETsmith and others added 8 commits April 10, 2026 14:42
Keep the clean 0.4.5-based branch buildable in Docker without bundling the int64 backport changes into the same commit.

Made-with: Cursor
…d removes some warning logic around int32. It does not make quadrants globally indexed by int64.
Copy link
Copy Markdown

@claude claude Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Claude Code Review

This pull request is from a fork — automated review is disabled. A repository maintainer can comment @claude review to run a one-time review.

@deepsek deepsek closed this Apr 22, 2026
@deepsek deepsek changed the title [AMDGPU] Refactor address-space tagging to source visitors + fix block_dim coercion Closed: Opened against wrong base Apr 22, 2026
Copy link
Copy Markdown

@chatgpt-codex-connector chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 6c9889f5a3

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

  • Open a pull request for review
  • Mark a draft as ready
  • Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

Comment on lines +2028 to +2031
if (kernel_argument_struct_in_kernarg()) {
context_param_type = context_ty;
} else {
context_param_type = llvm::PointerType::get(context_ty, 0);
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

P2 Badge Use pointer context when emitting debug arithmetic calls

After switching AMDGPU kernels to take RuntimeContext by value, get_arg(0) is no longer a RuntimeContext* for those kernels. The debug integer overflow helpers (debug_add/sub/mul/shl_*) still pass get_arg(0), so with compile_config.debug=true and integral ops, codegen will hit a function-signature mismatch (the runtime helpers expect RuntimeContext*). Please route these calls through get_context() so kernarg-by-value mode still passes a pointer.

Useful? React with 👍 / 👎.

Comment on lines +82 to +84
void AMDGPUDriver::malloc_async(void **dev_ptr, size_t size, void *stream) {
if (AMDGPUContext::get_instance().supports_mem_pool()) {
malloc_async_impl(dev_ptr, size, stream);
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

P1 Badge Verify async HIP symbols before enabling mem-pool mode

malloc_async() and mem_free_async() call async HIP entry points whenever supports_mem_pool() is true, but that flag is derived from default mem-pool query success rather than async alloc/free symbol availability. On ROCm setups where mem-pool query exists but hipMallocAsync/hipFreeAsync are unavailable, this path dereferences an unavailable driver function and aborts at runtime. Mem-pool enablement should also require async alloc/free APIs (or fall back per call).

Useful? React with 👍 / 👎.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants