make bundled_executor_runner only for bp by Gasoonjia · Pull Request #532 · pytorch/executorch

Gasoonjia · 2023-09-29T02:53:55Z

Summary: bundled executor runner should only focus on bundled program. remove the support for normal ExecuTorch Program.

Differential Revision: D49761307

netlify · 2023-09-29T02:53:59Z

✅ Deploy Preview for resplendent-gnome-14e531 ready!

Name	Link
🔨 Latest commit	`4ac3bd1`
🔍 Latest deploy log	https://app.netlify.com/sites/resplendent-gnome-14e531/deploys/651e9a3bb24c6a00085a99e1
😎 Deploy Preview	https://deploy-preview-532--resplendent-gnome-14e531.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

facebook-github-bot · 2023-09-29T02:54:14Z

This pull request was exported from Phabricator. Differential Revision: D49761307

facebook-github-bot · 2023-10-02T21:25:03Z

This pull request was exported from Phabricator. Differential Revision: D49761307

facebook-github-bot · 2023-10-02T23:40:45Z

This pull request was exported from Phabricator. Differential Revision: D49761307

Summary: Update the export example for bundled program, including api update, comments and file name suffix. Differential Revision: https://internalfb.com/D49849371 fbshipit-source-id: fc8044dd2cbfeec87992264f18a2d0b976c5f7eb

Summary: Pull Request resolved: pytorch/executorch#532 bundled executor runner should only focus on bundled program. remove the support for normal ExecuTorch Program. Reviewed By: tarun292 Differential Revision: D49761307 fbshipit-source-id: 73edc68cede7b9a4b499dc0638703d62daf85821

facebook-github-bot · 2023-10-05T11:12:53Z

This pull request was exported from Phabricator. Differential Revision: D49761307

facebook-github-bot · 2023-10-06T02:13:26Z

This pull request has been merged in 7c64f01.

* clean up runner code a little * update * update * pull out generate loop in chat * updates * edit docs * typo

* make --device fast the default * Update iOS.md (#517) * Update iOS.md * Update iOS.md * Pip to pip3 (#504) * remove macos-12 test * pip to pip3 * break aoti CI jobs separately (#500) * init * fixes * more fixes * fixes * fix * fix * bug fix * add objcopy update * suppress int8 * undefined variable --------- Co-authored-by: Michael Gschwind <mikekg@meta.com> * Support llama3 in chat in run.cpp (#486) * refactor chat runner in preparation for llama3 * add sketch for llama3 prompt template and move to returning tokens * fix tiktoken * fixes to chat * add default llama_ver * Add tests for quantize json, add cuda device specification and precision to cuda.json (#519) * remove code for no KV Cache path (#527) * Update ADVANCED-USERS.md (#529) Update Advanced Users description to reflect changes in the repo since the description was initially created. * runner-aoti on cuda (#531) * runner-aoti on cuda * transfer results back to CPU * transfer results back to CPU * runner-aoti on cuda * Update runner_build.md (#530) Update description of runner and build process in runner_build.md * clean up runner code a little (#532) * clean up runner code a little * update * update * pull out generate loop in chat * updates * edit docs * typo * move int8 linear class and function into qops.py (#534) * add dtype tests for runner-aoti + runner-et (#539) * add dtype tests for runner-aoti + runner-et * typo * Quantized embedding (#536) * move int8 linear class and function into qops.py * move Quantized Embedding to qops.py * Move Linear int4 to qops (#537) * move int8 linear class and function into qops.py * move Quantized Embedding to qops.py * move int4 linear to qops * Revert "add dtype tests for runner-aoti + runner-et (#539)" (#548) This reverts commit a7a24577a65be67ac9ae4dc05452f35d9c49e5d1. * fix generate for llama3 (#538) * fix generate for llama3 * switch more things to C * remove C++ header * add delegation visualization instructions (#551) * Add dtype runner aoti (#552) * add dtype tests for runner-aoti + runner-et * typo * add dtype test runner-aoti * test sdpa with fp16 (#553) * test sdpa with fp16 * kv cache fp32 * typo * update (#560) * Only support newest versions of lm-eval (#556) Summary: remove support for lm-eval 0.3 to reduce the options we have Test Plan: CI Reviewers: Subscribers: Tasks: Tags: * split cpu eval CI by dtype (#554) * split cpu eval CI by dtype * fix * differentiate names with checks * keep one name the same as old * fix * Removing duplicate HF issue message from README (#559) Co-authored-by: Michael Gschwind <61328285+mikekgfb@users.noreply.github.com> * doc updates (#567) * Add VM-safe MPS check --------- Co-authored-by: Anthony Shoumikhin <anthony@shoumikh.in> Co-authored-by: metascroy <161522778+metascroy@users.noreply.github.com> Co-authored-by: Nikita Shulga <2453524+malfet@users.noreply.github.com> Co-authored-by: lucylq <lfq@meta.com> Co-authored-by: Jerry Zhang <jerryzh168@gmail.com> Co-authored-by: Jack-Khuu <jack.khuu.7@gmail.com>

* code beautification * code beautification, move functions together * make --device fast the default (#515) * make --device fast the default * Update iOS.md (#517) * Update iOS.md * Update iOS.md * Pip to pip3 (#504) * remove macos-12 test * pip to pip3 * break aoti CI jobs separately (#500) * init * fixes * more fixes * fixes * fix * fix * bug fix * add objcopy update * suppress int8 * undefined variable --------- Co-authored-by: Michael Gschwind <mikekg@meta.com> * Support llama3 in chat in run.cpp (#486) * refactor chat runner in preparation for llama3 * add sketch for llama3 prompt template and move to returning tokens * fix tiktoken * fixes to chat * add default llama_ver * Add tests for quantize json, add cuda device specification and precision to cuda.json (#519) * remove code for no KV Cache path (#527) * Update ADVANCED-USERS.md (#529) Update Advanced Users description to reflect changes in the repo since the description was initially created. * runner-aoti on cuda (#531) * runner-aoti on cuda * transfer results back to CPU * transfer results back to CPU * runner-aoti on cuda * Update runner_build.md (#530) Update description of runner and build process in runner_build.md * clean up runner code a little (#532) * clean up runner code a little * update * update * pull out generate loop in chat * updates * edit docs * typo * move int8 linear class and function into qops.py (#534) * add dtype tests for runner-aoti + runner-et (#539) * add dtype tests for runner-aoti + runner-et * typo * Quantized embedding (#536) * move int8 linear class and function into qops.py * move Quantized Embedding to qops.py * Move Linear int4 to qops (#537) * move int8 linear class and function into qops.py * move Quantized Embedding to qops.py * move int4 linear to qops * Revert "add dtype tests for runner-aoti + runner-et (#539)" (#548) This reverts commit a7a24577a65be67ac9ae4dc05452f35d9c49e5d1. * fix generate for llama3 (#538) * fix generate for llama3 * switch more things to C * remove C++ header * add delegation visualization instructions (#551) * Add dtype runner aoti (#552) * add dtype tests for runner-aoti + runner-et * typo * add dtype test runner-aoti * test sdpa with fp16 (#553) * test sdpa with fp16 * kv cache fp32 * typo * update (#560) * Only support newest versions of lm-eval (#556) Summary: remove support for lm-eval 0.3 to reduce the options we have Test Plan: CI Reviewers: Subscribers: Tasks: Tags: * split cpu eval CI by dtype (#554) * split cpu eval CI by dtype * fix * differentiate names with checks * keep one name the same as old * fix * Removing duplicate HF issue message from README (#559) Co-authored-by: Michael Gschwind <61328285+mikekgfb@users.noreply.github.com> * doc updates (#567) * Add VM-safe MPS check --------- Co-authored-by: Anthony Shoumikhin <anthony@shoumikh.in> Co-authored-by: metascroy <161522778+metascroy@users.noreply.github.com> Co-authored-by: Nikita Shulga <2453524+malfet@users.noreply.github.com> Co-authored-by: lucylq <lfq@meta.com> Co-authored-by: Jerry Zhang <jerryzh168@gmail.com> Co-authored-by: Jack-Khuu <jack.khuu.7@gmail.com> * add unpacking support (#525) * add unpacking support * fix typos and linter * perform parallel prefill when possible (#568) * perform parallel prefill when possible * typo * disable hack * remove print * remove debug messages which prevent export * fixes * stream results in generate.py (#571) * remove logging interfering with export --------- Co-authored-by: Anthony Shoumikhin <anthony@shoumikh.in> Co-authored-by: metascroy <161522778+metascroy@users.noreply.github.com> Co-authored-by: Nikita Shulga <2453524+malfet@users.noreply.github.com> Co-authored-by: lucylq <lfq@meta.com> Co-authored-by: Jerry Zhang <jerryzh168@gmail.com> Co-authored-by: Jack-Khuu <jack.khuu.7@gmail.com>

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Sep 29, 2023

facebook-github-bot added the fb-exported label Sep 29, 2023

Gasoonjia force-pushed the export-D49761307 branch from 3dd6576 to b9baa46 Compare October 2, 2023 21:24

Gasoonjia force-pushed the export-D49761307 branch from b9baa46 to 2cd8da8 Compare October 2, 2023 23:40

Songhao Jia and others added 2 commits October 5, 2023 04:11

Update bp export example

c2e7bb0

Summary: Update the export example for bundled program, including api update, comments and file name suffix. Differential Revision: https://internalfb.com/D49849371 fbshipit-source-id: fc8044dd2cbfeec87992264f18a2d0b976c5f7eb

Gasoonjia force-pushed the export-D49761307 branch from 2cd8da8 to 4ac3bd1 Compare October 5, 2023 11:12

facebook-github-bot closed this in 7c64f01 Oct 6, 2023

facebook-github-bot added the Merged label Oct 6, 2023

Gasoonjia pushed a commit that referenced this pull request Jul 30, 2024

clean up runner code a little (#532)

f0e7895

* clean up runner code a little * update * update * pull out generate loop in chat * updates * edit docs * typo

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

make bundled_executor_runner only for bp#532

make bundled_executor_runner only for bp#532
Gasoonjia wants to merge 2 commits intopytorch:mainfrom
Gasoonjia:export-D49761307

Gasoonjia commented Sep 29, 2023

Uh oh!

netlify bot commented Sep 29, 2023 •

edited

Loading

Uh oh!

facebook-github-bot commented Sep 29, 2023

Uh oh!

facebook-github-bot commented Oct 2, 2023

Uh oh!

facebook-github-bot commented Oct 2, 2023

Uh oh!

facebook-github-bot commented Oct 5, 2023

Uh oh!

facebook-github-bot commented Oct 6, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Gasoonjia commented Sep 29, 2023

Uh oh!

netlify bot commented Sep 29, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for resplendent-gnome-14e531 ready!

Uh oh!

facebook-github-bot commented Sep 29, 2023

Uh oh!

facebook-github-bot commented Oct 2, 2023

Uh oh!

facebook-github-bot commented Oct 2, 2023

Uh oh!

facebook-github-bot commented Oct 5, 2023

Uh oh!

facebook-github-bot commented Oct 6, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

netlify bot commented Sep 29, 2023 •

edited

Loading