Repository containing experimentation platform on how to train, infer on wav2vec2 models.
-
Updated
Sep 22, 2022 - Python
Repository containing experimentation platform on how to train, infer on wav2vec2 models.
Implementation of Hybrid CTC/Attention Architecture for End-to-End Speech Recognition in pure python and PyTorch
Fine-tuning Multilingual Large Speech Recognition Models: Wav2vec and Whisper
🎤 Enhance speech recognition by detecting emotions in spoken language, combining OpenAI's Whisper and emotion analysis for deeper insights.
An intelligent speech recognition system that combines OpenAI's Whisper for accurate transcription with dual emotion detection models. Analyzes both audio characteristics (tone, pitch, intensity) and textual content to provide comprehensive emotional context alongside transcriptions.
Sara :- The Personal Voice Assistant
🤖 A Python-based AI Voice Assistant (Jarvis) with OpenAI GPT, NewsAPI, and music playback.
Add a description, image, and links to the speech-recognition-model topic page so that developers can more easily learn about it.
To associate your repository with the speech-recognition-model topic, visit your repo's landing page and select "manage topics."