[CoreML] more performace flag by wejoncy · Pull Request #22975 · microsoft/onnxruntime

wejoncy · 2024-11-29T12:05:38Z

Description

refactor unsquzee's implementation
add more flags to boost peformance.
add profile flag

Motivation and Context

This reverts commit 5249880.

onnxruntime/core/providers/coreml/builders/impl/batch_norm_op_builder.cc

onnxruntime/core/providers/coreml/builders/impl/squeeze_op_builder.cc

onnxruntime/core/providers/coreml/model/model.mm

include/onnxruntime/core/providers/coreml/coreml_provider_factory.h

onnxruntime/core/providers/coreml/model/model.mm

include/onnxruntime/core/providers/coreml/coreml_provider_factory.h

onnxruntime/core/providers/coreml/builders/impl/batch_norm_op_builder.cc

onnxruntime/core/providers/coreml/builders/impl/squeeze_op_builder.cc

onnxruntime/core/providers/coreml/model/model.mm

include/onnxruntime/core/providers/coreml/coreml_provider_factory.h

…ory.h Co-authored-by: Scott McKay <skottmckay@gmail.com>

### Description refactor unsquzee's implementation add more flags to boost peformance. add profile flag ### Motivation and Context  --------- Co-authored-by: jicwen <jicwen@YiMacBook-Pro.local> Co-authored-by: wejoncy <wejoncy@.com> Co-authored-by: Scott McKay <skottmckay@gmail.com>

### Description Add int64 as a supported datatype for moving nodes to the CoreML EP. We already convert constants automatically from int64 to int32 for CoreML by calling narrow. Adding the conversion for outputs as well. ### Motivation and Context - More nodes supported on CoreML ### Note on the Unsqueeze op According to #22975 there is a bug with the Unsqueeze op with scalar inputs on x86. I was running into a bug for unsqueezes that unsqueezed a scalar input to a tensor of shape [1] since CoreML doesn't support scalar values for MLProgram. I adapted the HandleX86ArchUnsqueeze method but alternatively, can replace with an identity operator or add some additional checks. I went with adapting the HandleX86ArchUnsqueeze method since it seemed like the fastest solution.

### Description Add int64 as a supported datatype for moving nodes to the CoreML EP. We already convert constants automatically from int64 to int32 for CoreML by calling narrow. Adding the conversion for outputs as well. ### Motivation and Context - More nodes supported on CoreML ### Note on the Unsqueeze op According to microsoft#22975 there is a bug with the Unsqueeze op with scalar inputs on x86. I was running into a bug for unsqueezes that unsqueezed a scalar input to a tensor of shape [1] since CoreML doesn't support scalar values for MLProgram. I adapted the HandleX86ArchUnsqueeze method but alternatively, can replace with an identity operator or add some additional checks. I went with adapting the HandleX86ArchUnsqueeze method since it seemed like the fastest solution.

more performace flag

0d547a2

wejoncy requested review from edgchen1 and skottmckay and removed request for skottmckay November 29, 2024 12:05

wejoncy marked this pull request as ready for review November 29, 2024 12:05

wejoncy added 2 commits December 1, 2024 19:18

fix

364b897

MLComputePlan

bf095dd

wejoncy force-pushed the jicwen/coreml_flag branch from 218fe66 to bf095dd Compare December 2, 2024 05:41

debug

5249880

wejoncy force-pushed the jicwen/coreml_flag branch from d64d79c to 5249880 Compare December 2, 2024 08:07

wejoncy added 3 commits December 2, 2024 01:24

Revert "debug"

74953ca

This reverts commit 5249880.

handle x64_cpu bug

37e77a5

Update squeeze_op_builder.cc

d57d62f

skottmckay reviewed Dec 4, 2024

View reviewed changes

add comments for new flag

e641750

github-actions bot reviewed Dec 5, 2024

View reviewed changes

onnxruntime/core/providers/coreml/model/model.mm Show resolved Hide resolved

onnxruntime/core/providers/coreml/model/model.mm Show resolved Hide resolved

wejoncy force-pushed the jicwen/coreml_flag branch from 5bc1ecf to 92a7c59 Compare December 5, 2024 03:55

bypass staticanalyze

123a4f0

wejoncy force-pushed the jicwen/coreml_flag branch from 92a7c59 to 123a4f0 Compare December 5, 2024 04:00

skottmckay reviewed Dec 6, 2024

View reviewed changes

add more comments for flag

17d2c4d

wejoncy requested a review from skottmckay December 9, 2024 01:46

skottmckay previously approved these changes Dec 9, 2024

View reviewed changes

include/onnxruntime/core/providers/coreml/coreml_provider_factory.h Outdated Show resolved Hide resolved

Add comments for clang version checks

2bc621f

wejoncy dismissed skottmckay’s stale review via 2bc621f December 9, 2024 04:13

wejoncy and others added 2 commits December 9, 2024 12:14

Add comments for clang version checks

cf3d389

Update include/onnxruntime/core/providers/coreml/coreml_provider_fact…

063341c

…ory.h Co-authored-by: Scott McKay <skottmckay@gmail.com>

skottmckay previously approved these changes Dec 9, 2024

View reviewed changes

format

49625fc

wejoncy dismissed skottmckay’s stale review via 49625fc December 9, 2024 04:38

skottmckay approved these changes Dec 9, 2024

View reviewed changes

wejoncy merged commit e12421b into main Dec 10, 2024

wejoncy deleted the jicwen/coreml_flag branch December 10, 2024 01:35

carzh mentioned this pull request Apr 17, 2025

[CoreML] Add support for int64 #24462

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CoreML] more performace flag#22975

[CoreML] more performace flag#22975
wejoncy merged 14 commits intomainfrom
jicwen/coreml_flag

wejoncy commented Nov 29, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

wejoncy commented Nov 29, 2024

Description

Motivation and Context

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants