support skip atten in export #16104

jackzhxng · 2025-12-05T18:49:46Z

Summary:
Support export for llama model variants with attention layer skipping. We only need to specify the attention skip patterns in config.json in layer_type. E.g.,

"layer_types": [
"full_attention",
"full_attention",
"full_attention",
"skip_attention",
"skip_attention",
"skip_attention"
]

Differential Revision: D88399533

pytorch-bot · 2025-12-05T18:49:50Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/16104

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

Replace all macOS instances with nextjs due CVE-2025-55182

✅ You can merge normally! (1 Unrelated Failure)

As of commit 72fd468 with merge base 4014597 ():

BROKEN TRUNK - The following job failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

pull / android / run-emulator (gh) (trunk failure)
Timeout waiting for emulator to boot.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

meta-codesync · 2025-12-05T18:49:54Z

@jackzhxng has exported this pull request. If you are a Meta employee, you can view the originating Diff in D88399533.

github-actions · 2025-12-05T18:50:39Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

Summary: Support export for llama model variants with attention layer skipping. We only need to specify the attention skip patterns in config.json in layer_type. E.g., "layer_types": [ "full_attention", "full_attention", "full_attention", "skip_attention", "skip_attention", "skip_attention" ] Differential Revision: D88399533

lucylq · 2025-12-08T17:43:22Z

examples/models/llama/llama_transformer.py

            )
+        elif (
+            model_args.layer_types
+            and model_args.layer_types[layer_id] == "skip_attention"


Is 'skip_attention' standard?

yes it is a standard name if we want to call https://github.com/pytorch/executorch/blob/main/examples/models/llama/attention.py#L525

jackzhxng requested a review from lucylq as a code owner December 5, 2025 18:49

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Dec 5, 2025

meta-codesync bot added fb-exported meta-exported labels Dec 5, 2025

jackzhxng force-pushed the export-D88399533 branch from dca1f2c to 375d689 Compare December 5, 2025 19:28

jackzhxng force-pushed the export-D88399533 branch from 375d689 to 72fd468 Compare December 5, 2025 19:54

lucylq approved these changes Dec 8, 2025

View reviewed changes

lucylq reviewed Dec 8, 2025

View reviewed changes

meta-codesync bot merged commit d859b01 into pytorch:main Dec 8, 2025
225 of 228 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

support skip atten in export #16104

support skip atten in export #16104

jackzhxng commented Dec 5, 2025

Uh oh!

pytorch-bot bot commented Dec 5, 2025 •

edited

Loading

Uh oh!

meta-codesync bot commented Dec 5, 2025

Uh oh!

github-actions bot commented Dec 5, 2025

Uh oh!

lucylq Dec 8, 2025

Uh oh!

Hanxian97 Dec 8, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

support skip atten in export #16104

support skip atten in export #16104

Conversation

jackzhxng commented Dec 5, 2025

Uh oh!

pytorch-bot bot commented Dec 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/16104

❗ 1 Active SEVs

✅ You can merge normally! (1 Unrelated Failure)

Uh oh!

meta-codesync bot commented Dec 5, 2025

Uh oh!

github-actions bot commented Dec 5, 2025

This PR needs a release notes: label

Uh oh!

lucylq Dec 8, 2025

Choose a reason for hiding this comment

Uh oh!

Hanxian97 Dec 8, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

pytorch-bot bot commented Dec 5, 2025 •

edited

Loading

This PR needs a `release notes:` label