[`PEFT`] Fix prefix tuning by vasqu · Pull Request #41696 · huggingface/transformers

vasqu · 2025-10-17T15:29:16Z

Prefix tuning is a bit more of a special case where we have different q and kv lengths despite having the encoder attention. We now rely on the mask for kv length now (if provided). The assumption was holding before because causal masks always have q == kv length. Fixes #38301 (comment)

cc @BenjaminBossan

vasqu · 2025-10-17T15:30:09Z

run-slow: bert,bart

github-actions · 2025-10-17T15:31:50Z

This comment contains run-slow, running the specified jobs:

models: ['models/bart', 'models/bert']
quantizations: [] ...

vasqu · 2025-10-17T15:32:41Z

run-slow: bert,bart

github-actions · 2025-10-17T15:34:37Z

This comment contains run-slow, running the specified jobs:

models: ['models/bart', 'models/bert']
quantizations: [] ...

BenjaminBossan

Thanks a lot for taking quick care of this regression.

I also tested this PR with the original PEFT unit tests that were failing and they pass now!

HuggingFaceDocBuilderDev · 2025-10-17T15:42:06Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

BenjaminBossan · 2025-10-31T11:40:37Z

Ping @ArthurZucker could we please get this in before the next release?

Cyrilvallez

All right, let's go for it even if it's quite a weird pattern, as both should always be the same in general 🥲

Removed after more offline discussion

In transformers, the bidirecional mask creation (as e.g. used by Bert) does not take into account possible virtual tokens inserted by prefix tuning. This results in an attention mask of the wrong shape. This PR monkey patches the _preprocess_mask_arguments function from transformers to take the virtual tokens into account. This is far from ideal but there are currently no plans to fix this on the transformers side (see huggingface/transformers#41696).

Cyrilvallez

Alright, after all the discussions it's a bit too hard to fix in peft directly, so let's go for it!

* fix * simplify * add my 2 cents

fix

1d7dd8c

simplify

59a0738

vasqu requested a review from Cyrilvallez October 17, 2025 15:32

BenjaminBossan approved these changes Oct 17, 2025

View reviewed changes

vasqu requested a review from ArthurZucker October 20, 2025 14:18

Cyrilvallez reviewed Nov 10, 2025

View reviewed changes

Comment thread src/transformers/masking_utils.py

Merge branch 'main' into fix-prefix-tuning

7dac8b6

Cyrilvallez previously approved these changes Nov 11, 2025

View reviewed changes

BenjaminBossan mentioned this pull request Nov 12, 2025

FIX: Patch mask for prefix tuning in transformers huggingface/peft#2905

Closed

add my 2 cents

faf17be

Cyrilvallez approved these changes Nov 13, 2025

View reviewed changes

vasqu merged commit 6408d3b into huggingface:main Nov 13, 2025
23 checks passed

vasqu deleted the fix-prefix-tuning branch November 13, 2025 10:59

albertvillanova mentioned this pull request Nov 27, 2025

CI fails with dev dependencies: RuntimeError: output with shape [3, 4, 1, 1] doesn't match the broadcast shape [3, 4, 1, 280] huggingface/trl#4593

Closed

SangbumChoi pushed a commit to SangbumChoi/transformers that referenced this pull request Jan 23, 2026

[PEFT] Fix prefix tuning (huggingface#41696)

7d5a923

* fix * simplify * add my 2 cents

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[`PEFT`] Fix prefix tuning#41696

[`PEFT`] Fix prefix tuning#41696
vasqu merged 4 commits intohuggingface:mainfrom
vasqu:fix-prefix-tuning

vasqu commented Oct 17, 2025

Uh oh!

vasqu commented Oct 17, 2025

Uh oh!

github-actions Bot commented Oct 17, 2025

Uh oh!

vasqu commented Oct 17, 2025

Uh oh!

github-actions Bot commented Oct 17, 2025

Uh oh!

BenjaminBossan left a comment

Uh oh!

HuggingFaceDocBuilderDev commented Oct 17, 2025

Uh oh!

BenjaminBossan commented Oct 31, 2025

Uh oh!

Uh oh!

Cyrilvallez left a comment

Uh oh!

Cyrilvallez left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

vasqu commented Oct 17, 2025

Uh oh!

vasqu commented Oct 17, 2025

Uh oh!

github-actions Bot commented Oct 17, 2025

Uh oh!

vasqu commented Oct 17, 2025

Uh oh!

github-actions Bot commented Oct 17, 2025

Uh oh!

BenjaminBossan left a comment

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented Oct 17, 2025

Uh oh!

BenjaminBossan commented Oct 31, 2025

Uh oh!

Uh oh!

Cyrilvallez left a comment

Choose a reason for hiding this comment

Uh oh!

Cyrilvallez left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants