Add conversion to fp16 by georgepaw · Pull Request #7 · DenisVieriu97/executorch_fork

georgepaw · 2023-12-06T13:45:29Z

Adapt the class which stores the intermediate tracing results to insert casts which will convert the model to fp16.

georgepaw · 2023-12-06T13:47:15Z

@DenisVieriu97 I've not adopted the tests in test_mps - we currently document them as a way for exporting the models.

If we parameterize the tests, then the users will no longer be able to do that.

What kind of testing do you think we should do? Perhaps we can do some (ugly!) patching?

DenisVieriu97 · 2023-12-12T23:51:49Z

@DenisVieriu97 I've not adopted the tests in test_mps - we currently document them as a way for exporting the models.

If we parameterize the tests, then the users will no longer be able to do that.

What kind of testing do you think we should do? Perhaps we can do some (ugly!) patching?

@georgepaw we could use use it as a command line parameter (commented in the code) - nvm, I see you are already doing that.

DenisVieriu97

Looks good to me

Add an option (enabled by default) to convert the model from FP32 to FP16 (and preserve the input/output signature).

BNNS copy crashes the process when the dtypes differ (pytorch#11714). With the example in this PR (pytorch#11714), we crash the process on main. Here is the stack trace from LLDB: ``` Process 19234 stopped * thread #1, queue = 'com.apple.main-thread', stop reason = signal SIGABRT frame #0: 0x0000000190ac9388 libsystem_kernel.dylib`__pthread_kill + 8 libsystem_kernel.dylib`__pthread_kill: -> 0x190ac9388 <+8>: b.lo 0x190ac93a8 ; <+40> 0x190ac938c <+12>: pacibsp 0x190ac9390 <+16>: stp x29, x30, [sp, #-0x10]! 0x190ac9394 <+20>: mov x29, sp (lldb) bt * thread #1, queue = 'com.apple.main-thread', stop reason = signal SIGABRT * frame #0: 0x0000000190ac9388 libsystem_kernel.dylib`__pthread_kill + 8 frame #1: 0x0000000190b0288c libsystem_pthread.dylib`pthread_kill + 296 frame #2: 0x0000000190a0bc60 libsystem_c.dylib`abort + 124 frame #3: 0x0000000190910174 libsystem_malloc.dylib`malloc_vreport + 892 frame #4: 0x0000000190913c90 libsystem_malloc.dylib`malloc_report + 64 frame #5: 0x000000019091821c libsystem_malloc.dylib`___BUG_IN_CLIENT_OF_LIBMALLOC_POINTER_BEING_FREED_WAS_NOT_ALLOCATED + 32 frame #6: 0x000000019d2f4084 libBNNS.dylib`___lldb_unnamed_symbol1620 + 564 frame #7: 0x000000019d2f5bac libBNNS.dylib`___lldb_unnamed_symbol1628 + 680 frame #8: 0x000000019d69ce48 libBNNS.dylib`BNNSCopy + 616 frame #9: 0x000000030c74d950 _portable_lib.cpython-310-darwin.so`(anonymous namespace)::copy_using_bnns(executorchcoreml::MultiArray const&, executorchcoreml::MultiArray&) + 188 frame #10: 0x000000030c74cfdc _portable_lib.cpython-310-darwin.so`(anonymous namespace)::copy(executorchcoreml::MultiArray const&, executorchcoreml::MultiArray&, executorchcoreml::MultiArray::CopyOptions) + 72 frame #11: 0x000000030c74ceec _portable_lib.cpython-310-darwin.so`executorchcoreml::MultiArray::copy(executorchcoreml::MultiArray&, executorchcoreml::MultiArray::CopyOptions) const + 148 frame #12: 0x000000030c7488d4 _portable_lib.cpython-310-darwin.so`invocation function for block in (anonymous namespace)::copy(MLMultiArray*, executorchcoreml::MultiArray&) + 376 frame #13: 0x000000030c748ac8 _portable_lib.cpython-310-darwin.so`invocation function for block in (anonymous namespace)::copy(MLMultiArray*, executorchcoreml::MultiArray&) + 52 frame #14: 0x000000019ad33f4c CoreML`CoreML::MultiArrayBuffer::getBytesWithHandler(void (void const*, unsigned long) block_pointer) const + 340 frame pytorch#15: 0x000000019ad34138 CoreML`-[MLMultiArray(ScopedBufferAccess) getBytesWithHandler:] + 152 frame pytorch#16: 0x000000030c7485ec _portable_lib.cpython-310-darwin.so`(anonymous namespace)::copy(MLMultiArray*, executorchcoreml::MultiArray&) + 296 frame pytorch#17: 0x000000030c744f68 _portable_lib.cpython-310-darwin.so`(anonymous namespace)::set_outputs(std::__1::vector<executorchcoreml::MultiArray, std::__1::allocator<executorchcoreml::MultiArray>>&, NSArray<MLMultiArray*>*) + 180 ``` With this PR, the process succeeds.

georgepaw force-pushed the dev/georgep/f32tof16 branch 3 times, most recently from 0cc41a5 to 54731c4 Compare December 12, 2023 14:42

DenisVieriu97 force-pushed the main branch from c6c336b to d642991 Compare December 12, 2023 20:42

DenisVieriu97 force-pushed the dev/georgep/f32tof16 branch from 54731c4 to 7f52a7d Compare December 12, 2023 23:49

DenisVieriu97 force-pushed the dev/georgep/f32tof16 branch from 7f52a7d to 3f11751 Compare December 13, 2023 02:16

georgepaw force-pushed the dev/georgep/f32tof16 branch 5 times, most recently from 35a3b96 to 7825bcd Compare December 14, 2023 12:13

DenisVieriu97 force-pushed the dev/georgep/f32tof16 branch from 7825bcd to a2661be Compare December 15, 2023 16:09

George Pawelczak added 2 commits December 15, 2023 16:11

Add option to convert model to FP16.

67a3221

Updates

4c3ee4a

georgepaw force-pushed the dev/georgep/f32tof16 branch from a2661be to a480991 Compare December 15, 2023 16:12

lintrunner -a

2f3dda4

georgepaw force-pushed the dev/georgep/f32tof16 branch from a480991 to 2f3dda4 Compare December 18, 2023 14:01

DenisVieriu97 approved these changes Dec 18, 2023

View reviewed changes

georgepaw merged commit aa5480c into main Dec 18, 2023

georgepaw deleted the dev/georgep/f32tof16 branch December 18, 2023 18:55

DenisVieriu97 pushed a commit that referenced this pull request Jan 19, 2024

Add conversion to fp16 (#7)

02d1ae4

Add an option (enabled by default) to convert the model from FP32 to FP16 (and preserve the input/output signature).

DenisVieriu97 pushed a commit that referenced this pull request Jan 19, 2024

Add conversion to fp16 (#7)

8db9322

Add an option (enabled by default) to convert the model from FP32 to FP16 (and preserve the input/output signature).

DenisVieriu97 mentioned this pull request Jan 20, 2024

[MPS Delegate] Add new mps runtime with support for FP16, lifted and unlifted graphs (iOS15+, macOS12+) pytorch/executorch#1655

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add conversion to fp16#7

Add conversion to fp16#7
georgepaw merged 3 commits intomainfrom
dev/georgep/f32tof16

georgepaw commented Dec 6, 2023

Uh oh!

georgepaw commented Dec 6, 2023

Uh oh!

DenisVieriu97 commented Dec 12, 2023 •

edited

Loading

Uh oh!

DenisVieriu97 left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

georgepaw commented Dec 6, 2023

Uh oh!

georgepaw commented Dec 6, 2023

Uh oh!

DenisVieriu97 commented Dec 12, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

DenisVieriu97 left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

DenisVieriu97 commented Dec 12, 2023 •

edited

Loading