Production First and Production Ready End-to-End Speech Recognition Toolkit
-
Updated
Dec 4, 2025 - Python
Production First and Production Ready End-to-End Speech Recognition Toolkit
OpenAI Whisper ASR Webservice API
End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR benchmarks, while also offering outstanding singing lyrics recognition capability.
PORORO: Platform Of neuRal mOdels for natuRal language prOcessing
⚡ TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords
Evaluate your speech-to-text system with similarity measures such as word error rate (WER)
Open STT
On-device streaming speech-to-text engine powered by deep learning
End-to-end ASR/LM implementation with PyTorch
一个执着于让CPU\端侧-Model逼近GPU-Model性能的项目,CPU上的实时率(RTF)小于0.1
HuggingSound: A toolkit for speech-related tasks based on Hugging Face's tools
On-device speech-to-text engine powered by deep learning
[LREC-COLING 2024 (Oral), Interspeech 2024 (Oral), NAACL 2025, ACL 2025, EMNLP 2025] A Series of Multilingual Multitask Medical Speech Processing
End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)
speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names.
A PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.
🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)
SOVA ASR (Automatic Speech Recognition)
Add a description, image, and links to the automatic-speech-recognition topic page so that developers can more easily learn about it.
To associate your repository with the automatic-speech-recognition topic, visit your repo's landing page and select "manage topics."