Skip to content

make extract_delegate_segments=True by default#3405

Closed
davidlin54 wants to merge 1 commit intopytorch:mainfrom
davidlin54:export-D56712292
Closed

make extract_delegate_segments=True by default#3405
davidlin54 wants to merge 1 commit intopytorch:mainfrom
davidlin54:export-D56712292

Conversation

@davidlin54
Copy link
Contributor

Summary: updated all existing callsites to use the previous default value of False

Differential Revision: D56712292

@pytorch-bot
Copy link

pytorch-bot bot commented Apr 29, 2024

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/3405

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure, 1 Unrelated Failure

As of commit 7cf9f49 with merge base dd81fc7 (image):

NEW FAILURE - The following job has failed:

BROKEN TRUNK - The following job failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Apr 29, 2024
@davidlin54 davidlin54 force-pushed the export-D56712292 branch 5 times, most recently from 309c6b1 to cc9fdff Compare April 30, 2024 15:30
@huydhn
Copy link
Contributor

huydhn commented May 2, 2024

@pytorchbot drci

(please ignore this, I'm testing Dr.CI)

@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D56712292

@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D56712292

davidlin54 pushed a commit to davidlin54/executorch that referenced this pull request May 8, 2024
Summary:
Pull Request resolved: pytorch#3405

updated all existing callsites to use the previous default value of False.

when extract_delegate_segments is set to False (previous behavior), the backend blob data is part of the flatbuffer serialized program. this leads to higher memory consumption, as backends may not need the input blob post initialization, but cannot free the memory as it's part of the flatbuffer.

when extract_delegate_segments is set to True, the backend blob data is extracted into separate segments. this way, each backend can choose to free the memory after initialization if it is no longer needed. this reduces peak memory consumption as a result. the con is that this leads to an increased program size due to internal padding between the flatbuffer program and the extracted segments

Reviewed By: JacobSzwejbka, cccclai, dbort, zonglinpengmeta

Differential Revision: D56712292
davidlin54 pushed a commit to davidlin54/executorch that referenced this pull request May 9, 2024
Summary:

updated all existing callsites to use the previous default value of False.

when extract_delegate_segments is set to False (previous behavior), the backend blob data is part of the flatbuffer serialized program. this leads to higher memory consumption, as backends may not need the input blob post initialization, but cannot free the memory as it's part of the flatbuffer.

when extract_delegate_segments is set to True, the backend blob data is extracted into separate segments. this way, each backend can choose to free the memory after initialization if it is no longer needed. this reduces peak memory consumption as a result. the con is that this leads to an increased program size due to internal padding between the flatbuffer program and the extracted segments

Reviewed By: JacobSzwejbka, cccclai, dbort, zonglinpengmeta

Differential Revision: D56712292
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D56712292

Summary:

updated all existing callsites to use the previous default value of False.

when extract_delegate_segments is set to False (previous behavior), the backend blob data is part of the flatbuffer serialized program. this leads to higher memory consumption, as backends may not need the input blob post initialization, but cannot free the memory as it's part of the flatbuffer.

when extract_delegate_segments is set to True, the backend blob data is extracted into separate segments. this way, each backend can choose to free the memory after initialization if it is no longer needed. this reduces peak memory consumption as a result. the con is that this leads to an increased program size due to internal padding between the flatbuffer program and the extracted segments

Reviewed By: JacobSzwejbka, cccclai, dbort, zonglinpengmeta

Differential Revision: D56712292
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D56712292

@facebook-github-bot
Copy link
Contributor

This pull request has been merged in cc11541.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. fb-exported Merged

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants