Skip to content

[Mistral] Fix query scaling for Mistral4 and Ministral3#44860

Merged
Cyrilvallez merged 1 commit intomainfrom
fix-mistral
Mar 19, 2026
Merged

[Mistral] Fix query scaling for Mistral4 and Ministral3#44860
Cyrilvallez merged 1 commit intomainfrom
fix-mistral

Conversation

@Cyrilvallez
Copy link
Copy Markdown
Member

What does this PR do?

As per the title. As discussed offline with the Mistral team, the scaling applied to the query should not be the absolute one (old cache_position), but the actual position_ids, taking into account padding, packed format in case of continuous batching etc

@github-actions
Copy link
Copy Markdown
Contributor

[For maintainers] Suggested jobs to run (before merge)

run-slow: ministral3, mistral4

@HuggingFaceDocBuilderDev
Copy link
Copy Markdown

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Copy link
Copy Markdown
Collaborator

@ArthurZucker ArthurZucker left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can you run integration tests please ? 😉

@Cyrilvallez
Copy link
Copy Markdown
Member Author

Well, I expect this to break (at least some of) the integration tests, so let's wait to hear back from @juliendenize and @patrickvonplaten if it's correct instead!

@Cyrilvallez
Copy link
Copy Markdown
Member Author

run-slow: ministral3, mistral4

@github-actions
Copy link
Copy Markdown
Contributor

Workflow Run ⚙️

This comment contains run-slow, running the specified jobs:

models: ["models/ministral3", "models/mistral4"]
quantizations: []

@juliendenize
Copy link
Copy Markdown
Contributor

Thanks !!

@github-actions
Copy link
Copy Markdown
Contributor

CI Results

Workflow Run ⚙️

Commit Info

Context Commit Description
RUN 809c6612 workflow commit (merge commit)
PR c396876d branch commit (from PR)
main 88433336 base commit (on main)

✅ No failing test specific to this PR 🎉 👏 !

@Cyrilvallez Cyrilvallez merged commit b96f8a9 into main Mar 19, 2026
24 checks passed
@Cyrilvallez Cyrilvallez deleted the fix-mistral branch March 19, 2026 18:02
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants