Skip to content

[Auto] Add Mistral tokenizer regex regression test (cluster-45081-3): merged 1 of 2 PRs#32

Open
evalstate wants to merge 3 commits intomainfrom
merge-cluster-cluster-45081-3-20260424123400
Open

[Auto] Add Mistral tokenizer regex regression test (cluster-45081-3): merged 1 of 2 PRs#32
evalstate wants to merge 3 commits intomainfrom
merge-cluster-cluster-45081-3-20260424123400

Conversation

@evalstate
Copy link
Copy Markdown
Owner

Cluster: cluster-45081-3
Base: origin/main

Merged:

Skipped:

Failed:

  • None

Notes:

Next steps:

  • Run the targeted tokenizer test once pytest/test dependencies are available.
  • Optionally run make style before any PR-ready handoff.

Mohd Faour and others added 3 commits April 8, 2026 16:26
The existing test only checks that passing fix_mistral_regex=True doesn't
error, but the hub model's config version causes early return so the
patching logic is never exercised. This new test creates a local config
with an old transformers_version to force the patching code path, verifying
that the pre_tokenizer is correctly patched to a Sequence without
AttributeError.
@evalstate
Copy link
Copy Markdown
Owner Author

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant