automatic-speech-recognition

Star

Here are 198 public repositories matching this topic...

wenet-e2e / wenet

Star

Production First and Production Ready End-to-End Speech Recognition Toolkit

pytorch transformer speech-recognition automatic-speech-recognition production-ready whisper asr conformer e2e-models

Updated Dec 4, 2025
Python

ahmetoner / whisper-asr-webservice

Sponsor

Star

OpenAI Whisper ASR Webservice API

docker speech speech-recognition automatic-speech-recognition speech-to-text asr openai-whisper

Updated Nov 23, 2025
Python

zzw922cn / Automatic_Speech_Recognition

Star

End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow

audio deep-learning tensorflow paper end-to-end evaluation cnn lstm speech-recognition rnn automatic-speech-recognition feature-vector data-preprocessing phonemes timit-dataset layer-normalization rnn-encoder-decoder chinese-speech-recognition

Updated Mar 24, 2023
Python

Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR benchmarks, while also offering outstanding singing lyrics recognition capability.

open-source transformer speech-recognition automatic-speech-recognition asr conformer llm industrial-grade multimodal-llm speechllm

Updated Sep 22, 2025
Python

kakaobrain / pororo

Star

PORORO: Platform Of neuRal mOdels for natuRal language prOcessing

natural-language-processing deep-learning speech-synthesis automatic-speech-recognition neural-models

Updated Mar 23, 2022
Python

TensorSpeech / TensorFlowASR

Star

⚡ TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords

tensorflow speech-recognition jasper automatic-speech-recognition speech-to-text ctc conformer deepspeech2 tflite rnn-transducer end2end tensorflow2 contextnet tflite-model tflite-convertion subword-speech-recognition streaming-transducer

Updated Jun 11, 2025
Python

jitsi / jiwer

Star

Evaluate your speech-to-text system with similarity measures such as word error rate (WER)

python3 automatic-speech-recognition speech-to-text evaluation-metrics wer word-error-rate

Updated Feb 15, 2025
Python

snakers4 / open_stt

Star

Open STT

dataset russian automatic-speech-recognition speech-to-text stt asr

Updated Mar 11, 2022
Python

shirayu / whispering

Sponsor

Star

Streaming transcriber with whisper

automatic-speech-recognition whisper

Updated May 1, 2023
Python

Picovoice / cheetah

Star

On-device streaming speech-to-text engine powered by deep learning

voice-recognition speech-recognition automatic-speech-recognition speech-to-text transcription stt asr online-speech-recognition streaming-speech-to-text

Updated Nov 27, 2025
Python

hirofumi0810 / neural_sp

Star

End-to-end ASR/LM implementation with PyTorch

streaming speech language-modeling pytorch transformer speech-recognition seq2seq attention automatic-speech-recognition sequence-to-sequence language-model attention-mechanism asr ctc rnn-transducer transformer-xl

Updated Aug 30, 2021
Python

Z-yq / TensorflowASR

Star

一个执着于让CPU\端侧-Model逼近GPU-Model性能的项目，CPU上的实时率(RTF)小于0.1

cpp transformer transducers automatic-speech-recognition bert ctc state-of-the-art listen-attend-and-spell tensorflow-cpp tensorflow2

Updated Mar 13, 2025
Python

jonatasgrosman / huggingsound

Sponsor

Star

HuggingSound: A toolkit for speech-related tasks based on Hugging Face's tools

audio speech transformers speech-recognition automatic-speech-recognition speech-to-text asr

Updated Sep 20, 2023
Python

Picovoice / leopard

Star

On-device speech-to-text engine powered by deep learning

voice-recognition speech-recognition automatic-speech-recognition speech-to-text transcription stt asr voice-to-text on-device

Updated Dec 4, 2025
Python

leduckhai / MultiMed

Star

[LREC-COLING 2024 (Oral), Interspeech 2024 (Oral), NAACL 2025, ACL 2025, EMNLP 2025] A Series of Multilingual Multitask Medical Speech Processing

machine-learning natural-language-processing deep-learning artificial-intelligence named-entity-recognition automatic-speech-recognition speech-summarization

Updated Jun 19, 2025
Python

hirofumi0810 / tensorflow_end2end_speech_recognition

Star

End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)

tensorflow end-to-end speech-recognition beam-search automatic-speech-recognition speech-to-text attention-mechanism asr timit-dataset ctc timit end-to-end-learning csj librispeech joint-ctc-attention

Updated Jan 23, 2018
Python

NavodPeiris / speechlib

Sponsor

Star

speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names.

ai automatic-speech-recognition transcription speaker-recognition speaker-verification speaker-diarization whisper-ai faster-whisper