Skip to content

[CoreML] Update Conv and Softmax ops#24594

Merged
carzh merged 27 commits intomainfrom
carzh/coreml-reshape
Jun 6, 2025
Merged

[CoreML] Update Conv and Softmax ops#24594
carzh merged 27 commits intomainfrom
carzh/coreml-reshape

Conversation

@carzh
Copy link
Contributor

@carzh carzh commented Apr 29, 2025

Description

Moved the dimension limit because it seems to only apply to conv operations (texture memory is typically used for conv operations in the GPU because it has a slow write but fast read -- ChromaDB model had a slice operation with an input > 16384 -- operation worked fine after I had moved the dim check)

Also added extra checks for Softmax on MLProgram that allows more softmax nodes to be moved to CoreML

@carzh carzh requested review from edgchen1 and skottmckay May 1, 2025 20:37
@carzh carzh changed the title [CoreML] Update Reshape op to support more nodes [CoreML] Update Conv and Softmax ops Jun 6, 2025
@carzh carzh merged commit ebcf2eb into main Jun 6, 2025
88 of 89 checks passed
@carzh carzh deleted the carzh/coreml-reshape branch June 6, 2025 17:31
}

bool SoftmaxOpBuilder::IsOpSupportedImpl(const Node& node, const OpBuilderInputParams& /*input_params*/,
bool SoftmaxOpBuilder::IsOpSupportedImpl(const Node& node, const OpBuilderInputParams& input_params,
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

input_params is not used. There is build warnings.

javier-intel pushed a commit to intel/onnxruntime that referenced this pull request Jun 15, 2025
### Description
Moved the dimension limit because it seems to only apply to conv
operations (texture memory is typically used for conv operations in the
GPU because it has a slow write but fast read -- ChromaDB model had a
slice operation with an input > 16384 -- operation worked fine after I
had moved the dim check)

Also added extra checks for Softmax on MLProgram that allows more
softmax nodes to be moved to CoreML
quic-ankus pushed a commit to CodeLinaro/onnxruntime that referenced this pull request Nov 25, 2025
### Description
Moved the dimension limit because it seems to only apply to conv
operations (texture memory is typically used for conv operations in the
GPU because it has a slow write but fast read -- ChromaDB model had a
slice operation with an input > 16384 -- operation worked fine after I
had moved the dim check)

Also added extra checks for Softmax on MLProgram that allows more
softmax nodes to be moved to CoreML
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants