Add universal phone recognition model - PhoneticXeus by Shikhar-S · Pull Request #45355 · huggingface/transformers

Shikhar-S · 2026-04-10T04:26:05Z

What does this PR do?

This PR introduces PhoneticXeus, which is the state-of-the-art universal phone recognizer trained on 70+ languages and evaluated on ~100 languages. The model should have high utility for the linguistic, phonological and multilingual research community.
Since this is currently the best-performing multilingual phone recognition model available, I expect the integration here will also attract new users to HF.

The E-Branchformer encoder used here is architecturally distinct from existing models, and has been employed in top performing speech models (XEUS, OWSM v2+ etc). The implementation here (ported from espnet) could also serve as a foundation for future speech models in HF.

Code Agent Policy

I have read CONTRIBUTING.md.

I confirm that this is not a pure code agent PR. I used coding agent for initial draft of this PR. Then I tested the output with original implementations, and manually reduced verbosity of the code generated.

Before submitting

Did you read the contributor guideline,
Pull Request section?
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

…to pxeus

Rocketknight1 · 2026-04-10T12:39:54Z

Audio model I guess, albeit an unusual one, so cc @ebezzam @eustlb! Transcription to IPA is very niche (but very cool) though, so we may want to keep this as a remote code model unless we expect significant usage.

github-actions · 2026-04-11T02:31:44Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: auto, phoneticxeus

Shikhar-S · 2026-04-11T02:43:17Z

Thanks @Rocketknight1 for the initial review!
I have added motivation for this PR to the description above. Additionally, integrating the model into HF ecosystem will lower the barrier for people to easily reproduce results and build on top.

eustlb · 2026-04-13T12:24:23Z

Hey @Shikhar-S, thanks a lot for your PR, really cool contribution!

I’d like to confirm what @Rocketknight1 mentioned above: this model is a better candidate for remote code, which will allow you and users to fully leverage the HF ecosystem. If it gains strong usage, it could become a great candidate for native support 🤗

Thanks a lot for your understanding! I’m happy to help with the remote code option if you need it

Shikhar-S · 2026-04-14T15:02:02Z

Hi @eustlb thanks for taking a look!
For adding it via remote code option, do I need to move the modeling code to my hf repo, and just keep the auto registrations here? Is there an example I can look at/or some documentation for this option? I will change the PR accordingly. Thanks!

edit: adding @Rocketknight1 to advise next action for me.

Shikhar added 5 commits April 9, 2026 22:26

add pxeus modeling

c76381e

Merge branch 'main' of https://github.com/huggingface/transformers in…

db0facd

…to pxeus

fix issue with return

d123259

add documentation

98ea62c

add tests for intergratinon

37651ed

Shikhar-S marked this pull request as draft April 10, 2026 04:27

Shikhar Bharadwaj added 2 commits April 10, 2026 20:47

simplify ckpt loading

ec4313f

register phoneticxeus for auto loading

1788c0b

Shikhar-S changed the title ~~Pxeus~~ Add universal phone recognition model - PhoneticXeus Apr 11, 2026

Merge branch 'main' into pxeus

e8c0fb1

Shikhar-S marked this pull request as ready for review April 11, 2026 02:43

github-actions Bot requested review from ArthurZucker and Rocketknight1 April 11, 2026 02:44

ebezzam added the Audio label Apr 13, 2026

eustlb added the New model label Apr 13, 2026

evalstate mentioned this pull request Apr 28, 2026

Cumulative defect fixes from recent Transformers PRs evalstate/transformers#41

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add universal phone recognition model - PhoneticXeus#45355

Add universal phone recognition model - PhoneticXeus#45355
Shikhar-S wants to merge 8 commits intohuggingface:mainfrom
Shikhar-S:pxeus

Shikhar-S commented Apr 10, 2026 •

edited

Loading

Uh oh!

Rocketknight1 commented Apr 10, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented Apr 11, 2026

Uh oh!

Shikhar-S commented Apr 11, 2026

Uh oh!

eustlb commented Apr 13, 2026

Uh oh!

Shikhar-S commented Apr 14, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

Shikhar-S commented Apr 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Code Agent Policy

Before submitting

Uh oh!

Rocketknight1 commented Apr 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions Bot commented Apr 11, 2026

Uh oh!

Shikhar-S commented Apr 11, 2026

Uh oh!

eustlb commented Apr 13, 2026

Uh oh!

Shikhar-S commented Apr 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Shikhar-S commented Apr 10, 2026 •

edited

Loading

Rocketknight1 commented Apr 10, 2026 •

edited

Loading

Shikhar-S commented Apr 14, 2026 •

edited

Loading