Add universal phone recognition model - PhoneticXeus#45355
Add universal phone recognition model - PhoneticXeus#45355Shikhar-S wants to merge 8 commits intohuggingface:mainfrom
Conversation
|
[For maintainers] Suggested jobs to run (before merge) run-slow: auto, phoneticxeus |
|
Thanks @Rocketknight1 for the initial review! |
|
Hey @Shikhar-S, thanks a lot for your PR, really cool contribution! I’d like to confirm what @Rocketknight1 mentioned above: this model is a better candidate for remote code, which will allow you and users to fully leverage the HF ecosystem. If it gains strong usage, it could become a great candidate for native support 🤗 Thanks a lot for your understanding! I’m happy to help with the remote code option if you need it |
|
Hi @eustlb thanks for taking a look! edit: adding @Rocketknight1 to advise next action for me. |
What does this PR do?
This PR introduces PhoneticXeus, which is the state-of-the-art universal phone recognizer trained on 70+ languages and evaluated on ~100 languages. The model should have high utility for the linguistic, phonological and multilingual research community.
Since this is currently the best-performing multilingual phone recognition model available, I expect the integration here will also attract new users to HF.
The E-Branchformer encoder used here is architecturally distinct from existing models, and has been employed in top performing speech models (XEUS, OWSM v2+ etc). The implementation here (ported from espnet) could also serve as a foundation for future speech models in HF.
Code Agent Policy
I have read
CONTRIBUTING.md.Before submitting
Pull Request section?
documentation guidelines, and
here are tips on formatting docstrings.