Fill CUDA Cast operator opset gap: extend registration from opset 23 to 25 by Copilot · Pull Request #27744 · microsoft/onnxruntime

Copilot · 2026-03-18T18:08:47Z

Description

Extends CUDA Cast kernel registration to cover opset 25 (latest ONNX spec). The existing non-versioned opset 23 registration is capped to VERSIONED (23, 24), and a new non-versioned opset 25 registration is added for all type specializations.

cast_op.cc:

REGISTER_KERNEL_TYPED(T): opset 23 → VERSIONED (23, 24), added non-versioned opset 25
Renamed REGISTER_KERNEL_TYPED_23 → REGISTER_KERNEL_TYPED_23_TO_24 (VERSIONED)
Added REGISTER_KERNEL_TYPED_25 macro (non-versioned)
Renamed SPECIALIZE_IMPL_19_TO_23 → SPECIALIZE_IMPL_19_TO_25, covering Float8 types through opset 25
Updated Float4E2M1x2 registration to use new versioned/non-versioned macros

cuda_execution_provider.cc:

Forward declarations: all opset 23 Cast entries → VERSIONED (23, 24), added opset 25 non-versioned entries (all 16 types: 13 standard + 2 Float8 + 1 Float4)
BuildKernelCreateInfo: same pattern — capped 23 to (23, 24), added opset 25 block

Motivation and Context

CUDA Cast operator was registered up to opset 23, but ONNX spec defines Cast through opset 25. This gap can cause kernel lookup failures when running models exported at opset 25. Part of the broader CUDA opset gap-filling effort tracked in #27729.

Co-authored-by: tianleiwu <30328909+tianleiwu@users.noreply.github.com>

tianleiwu · 2026-03-18T18:22:12Z

/azp run Windows GPU Doc Gen CI Pipeline

azure-pipelines · 2026-03-18T18:22:24Z

Azure Pipelines successfully started running 1 pipeline(s).

…ange for CUDA EP Co-authored-by: tianleiwu <30328909+tianleiwu@users.noreply.github.com>

…uda-operator

github-actions

You can commit the suggested changes from lintrunner.

…uda-operator

std::vector<bool> is bit-packed and cannot be converted to gsl::span. Use a C-style bool array with explicit span construction, matching the pattern used by existing bool tests in the same file.

Co-authored-by: Copilot <copilot@github.com>

justinchuby · 2026-05-05T19:43:49Z

I think there is also issues with reshape: #28368

tianleiwu

Summary

Clean mechanical opset gap-fill extending CUDA Cast kernel registrations from opset 23 to opset 25. The implementation correctly versions existing opset 23 entries as (23, 24) and adds non-versioned opset 25 entries, consistent with how the CPU EP already handles Cast at these opset versions.

Positives:

Complete contiguous opset coverage from opset 6 through 25 in the REGISTER_KERNEL_TYPED macro.
All 16 type specializations (13 standard + 2 Float8 + 1 Float4) properly registered for both versioned (23,24) and non-versioned opset 25 blocks.
Good test coverage: standard types, Float8, and Float4 at opset 25 with proper CUDA-only guards and compute capability checks.
Combining opsets 23 and 24 into one versioned registration is valid since the kernel implementation is identical between them.

LGTM.

Update Cast CUDA operator to fill opset gap from 23 to 25

a792709

Co-authored-by: tianleiwu <30328909+tianleiwu@users.noreply.github.com>

Copilot AI assigned Copilot and tianleiwu Mar 18, 2026

Copilot created this pull request from a session on behalf of tianleiwu March 18, 2026 18:08 View session

tianleiwu mentioned this pull request Mar 18, 2026

[Feature Request] Extend CUDA ONNX Ops to latest opset version #27729

Open

Copilot AI and others added 2 commits March 18, 2026 16:40

Update docs/OperatorKernels.md: Cast 23+ → 25+ with [23,24] version r…

550fb8c

…ange for CUDA EP Co-authored-by: tianleiwu <30328909+tianleiwu@users.noreply.github.com>

Merge remote-tracking branch 'origin/main' into copilot/update-cast-c…

ba73ecb

…uda-operator

tianleiwu marked this pull request as ready for review March 18, 2026 23:48

github-actions Bot reviewed Mar 19, 2026

View reviewed changes

Comment thread onnxruntime/core/providers/cuda/tensor/cast_op.cc Outdated

Comment thread onnxruntime/core/providers/cuda/tensor/cast_op.cc Outdated

tianleiwu added 4 commits April 19, 2026 14:10

Merge remote-tracking branch 'origin/main' into copilot/update-cast-c…

abac052

…uda-operator

lintrunner

3c5a8a9

add tests

d3c79f6

fix: use C-style bool array instead of vector<bool> for gsl::span

dcf024f

std::vector<bool> is bit-packed and cannot be converted to gsl::span. Use a C-style bool array with explicit span construction, matching the pattern used by existing bool tests in the same file.

tianleiwu mentioned this pull request May 5, 2026

Fix CUDA EP: add opset 24 kernel registrations for Reshape and Cast #28368

Closed

Merge main

31a36e7

Co-authored-by: Copilot <copilot@github.com>

tianleiwu requested review from justinchuby and titaiwangms May 5, 2026 19:41

justinchuby approved these changes May 5, 2026

View reviewed changes

tianleiwu enabled auto-merge (squash) May 5, 2026 19:56

tianleiwu approved these changes May 5, 2026

View reviewed changes

Comment thread onnxruntime/core/providers/cuda/cuda_execution_provider.cc

titaiwangms approved these changes May 5, 2026

View reviewed changes

tianleiwu merged commit ee5158e into main May 5, 2026
89 checks passed

tianleiwu deleted the copilot/update-cast-cuda-operator branch May 5, 2026 22:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fill CUDA Cast operator opset gap: extend registration from opset 23 to 25#27744

Fill CUDA Cast operator opset gap: extend registration from opset 23 to 25#27744
tianleiwu merged 8 commits intomainfrom
copilot/update-cast-cuda-operator

Copilot AI commented Mar 18, 2026

Uh oh!

tianleiwu commented Mar 18, 2026

Uh oh!

azure-pipelines Bot commented Mar 18, 2026

Uh oh!

github-actions Bot left a comment

Uh oh!

Uh oh!

Uh oh!

justinchuby commented May 5, 2026

Uh oh!

tianleiwu left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

Copilot AI commented Mar 18, 2026

Description

Motivation and Context

Uh oh!

tianleiwu commented Mar 18, 2026

Uh oh!

azure-pipelines Bot commented Mar 18, 2026

Uh oh!

github-actions Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

justinchuby commented May 5, 2026

Uh oh!

tianleiwu left a comment

Choose a reason for hiding this comment

Summary

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants