Skip to content

Conversation

@srkreddy1238
Copy link
Contributor

Various benchmarks enabled for CLML BYOC backend for Adreno GPU.
Networks resnet-18, resnet-34, resnet-50, densenet-121, inception_v3, mobilenetv1, squeezenet_v1.0, squeezenet_v1.1 are added with FP16 and FP32 dtypes.

Fixes added to handle batch_flatten, dropout to get largest sub graph possible.
CLML subgraphs runs under OpenCL runtime context, hence CLML execution is accounted as part of OpenCL profiling.

Various benchmarks enabled for CLML BYOC backend for Adreno GPU
Networks resnet-18, resnet-34, resnet-50, densenet-121,
inception_v3, mobilenetv1, squeezenet_v1.0, squeezenet_v1.1 are
added with FP16 and FP32 dtypes.
@tvm-bot
Copy link
Collaborator

tvm-bot commented Jan 4, 2023

Thanks for contributing to TVM! Please refer to the contributing guidelines https://tvm.apache.org/docs/contribute/ for useful information and tips. Please request code reviews from Reviewers by @-ing them in a comment.

  • No users to tag found in teams: benchmarks, clml See #10317 for details

Generated by tvm-bot

@masahi masahi merged commit aec46dc into apache:main Jan 4, 2023
fzi-peccia pushed a commit to fzi-peccia/tvm that referenced this pull request Mar 27, 2023
…e#13696)

* [BENCHMARKS][CLML] Adreno benchmarks with CLML BYOC path added

Various benchmarks enabled for CLML BYOC backend for Adreno GPU
Networks resnet-18, resnet-34, resnet-50, densenet-121,
inception_v3, mobilenetv1, squeezenet_v1.0, squeezenet_v1.1 are
added with FP16 and FP32 dtypes.

* * lint error
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants