Build software better, together

innovatorved / subtitle

Open-source subtitle generation for seamless content translation.

subtitle innovatorved whisper-cpp whisper-model

Updated Nov 30, 2025
Python

shhossain / BanglaSpeech2Text

BanglaSpeech2Text: An open-source offline speech-to-text package for Bangla language. Fine-tuned on the latest whisper speech to text model for optimal performance.

machine-learning deep-learning speech pytorch transformer voice-recognition speech-recognition bangla speech-to-text hacktoberfest whisper bangla-asr bangla-speech-recognition bangla-speech-to-text bangla-automatic-speech-recognition whisper-model bangla-voice-recognition

Updated Mar 1, 2025
Python

jim-schwoebel / nala_assistant

Star

🔊😊 A fastapi voice-assistant framework to quickly prototype LLM-powered voice assistants in <5 minutes.

Updated Jan 15, 2024
JavaScript

thc1006 / whisper-colab-tpu-transcriber

Star

High-performance Google Colab Notebook for fast & accurate audio transcription/translation using OpenAI Whisper. Accelerated on TPUs with PyTorch/XLA. Features an interactive UI for model selection, multi-language support, and long-form audio processing.

python machine-learning natural-language-processing deep-learning ffmpeg jupyter-notebook pytorch speech-recognition ipywidgets voice-to-text tpu google-colab audio-transcription huggingface-transformers pytorch-xla openai-whisper whisper-model multilingual-asr

Updated Jun 8, 2025
Jupyter Notebook

hemangjoshi37a / French_audio_transcription_using_gradio

Star

French audio transcription using gradio

machine-learning speech-recognition gradio audio-processing french-language audio-transcription audio-to-text transcription-tool whisper-model french-audio-transcription

Updated Sep 22, 2024
Jupyter Notebook

JulianFP / project-W

Star

A self-hostable platform on which users can create transcripts of their audio files (speech-to-text) using Whisper AI

docker svelte speech-to-text transcription whisper audio-processing transcribe fastapi-framework whisper-model

Updated Dec 1, 2025
Python

seccanj / generate-subtitle-llm

Star

Generates subtitles from a video speech (Whisper OpenAI LLM) or extracts existing subtitles, translates them into a different language using Mistral LLM and adds them to the video. Uses ffmpeg for extracting and encoding

machine-learning video ai ffmpeg python3 video-processing subtitles-generator llms whisper-model mistral-7b subtitles-translator mistral-ai

Updated Jan 28, 2025
Python

furkanksl / FreeWhisper

Star

free macOS whisper dictation app

speech-to-text transcription whisper whisper-model

Updated Jun 22, 2025
Swift

krithicswaroopan / AI-Voice-Assistance-Pipeline

Star

A real-time voice-to-text and text-to-speech AI pipeline using Whisper, an LLM, and Edge-TTS with tunable parameters for low-latency audio processing and response generation.

python natural-language-processing text-to-speech speech-recognition speech-to-text real-time-processing conversational-ai voice-activity-detection ai-ml hugging-face-transformers large-language-models whisper-model edge-tts

Updated Sep 24, 2024
Python

umitkacar / transformer-asr-transcription

Star

Real-time transformer-based ASR supporting 100+ languages - Google Cloud integration with noise cancellation & low-latency optimization

Updated Nov 10, 2025
Python

franckferman / Whisper_Transcriber

Star

📝 Turn audio into text effortlessly. Audio transcription powered by OpenAI's Whisper API.

Updated Mar 15, 2025
Python

Avinraj01 / SHL-Grammar-Scoring-Engine-for-Voice-Samples

Star

This model predicts grammar scores (1–5) from audio files. It uses Whisper to transcribe speech to text, cleans the text, and extracts features with TF-IDF. A Random Forest Regressor is trained to learn grammar score patterns. Evaluation via Pearson Correlation showed good results.

machine-learning random-forest speech-recognition tf-idf nlp-machine-learning model-evaluation pearson-correlation text-preprocessing regression-model audio-to-text whisper-model grammar-scoring submission-pipeline

Updated Jun 20, 2025
Jupyter Notebook

dvorobiev / subtitles_project

Star

Subtitles Generator: Автоматический генератор субтитров для видео с поддержкой перевода на различные языки, использующий модель Whisper от OpenAI.

python machine-learning subtitles video-processing audio-transcription whisper-model

Updated Mar 19, 2025
Python

sushant1827 / CrewAI-Agents-MinutesOfMeeting-Gmail

Star

MinutesOfMeeting and Gmail is a collaborative crew of AI agents that autonomously understand audio, transcripts, summarizes, writes and drafts an email in Gmail account.

chunking google-cloud-platform gmail-api audio-segmentation google-auth-library whisper-model llm-tools crewai agentic-workflow gpt-4o-mini agent-ops crewai-flow

Updated Jan 18, 2025
Python

arthurtran04 / english-audio-transcription

Star

An English Audio Transcription Application.

python3 gradio whisper-model

Updated Jul 14, 2025
Python

13shivam / yt-agent

Star

Offline-friendly backend POC to transcribe YouTube videos and chat with video content using Whisper (no cloud required) and local LLMs via Ollama like Mistral or LLaMA2. Built with Flask and PostgreSQL, fully open source with Swagger APIs. Easily connect any frontend. ⚠️ Use Submit API to download one video at a time to avoid YouTube throttling.

postgresql speech-recognition flask-api dockerised audiototext llm whisper-model whisper-ai ollama mistral-ai mistral-7b-instruct

Updated May 6, 2025
Python

Xza85hrf / Whisper-Subtitle-Generator

Star

The Whisper Subtitle Generator leverages OpenAI's Whisper model to generate subtitles from audio and video files. This Python-based tool supports multiple languages and employs advanced audio processing techniques to ensure high accuracy in transcription.

python ffmpeg speech-recognition openai gpu-acceleration noise-reduction audio-processing subtitle-generator audio-to-text video-subtitles transcription-tool whisper-model multilingual-transcription srt-output vtt-output

Updated Apr 23, 2024
Python

otonomee / youtube-to-transcript

Star

Convert YouTube videos to text files. Why spend 30 minutes watching a video when you can skim the transcript in a couple minutes?

python machine-learning openai youtube-downloader speech-to-text transcription pytube video-to-text audio-transcription whisper-model

Updated Jul 30, 2024
Python

yu3325363946 / ai-education-video-analyzer

Star

自动生成字幕，内容总结，章节划分 | AI-driven education video analysis using Whisper, BLIP-2, and DeepSeek

nlp open-source education machine-learning ai spring-boot computer-vision vue deep-learning core edtech subtitles whisper-model whisper-ai blip-2 deepseek deepseek-api blip-model deepseek-r1

Updated Oct 16, 2025
Java

LandoNikko / Transcribe-Panel

Star

Transcribe audio and video files locally or try different cloud models. Compare, explore and control everything through the web interface.

transcript transcription whisper transcribe themes-css whisper-model bring-your-own-ai-api-key

Updated Oct 27, 2025
HTML

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

whisper-model

Here are 27 public repositories matching this topic...

innovatorved / subtitle

shhossain / BanglaSpeech2Text

jim-schwoebel / nala_assistant

thc1006 / whisper-colab-tpu-transcriber

hemangjoshi37a / French_audio_transcription_using_gradio

JulianFP / project-W

seccanj / generate-subtitle-llm

furkanksl / FreeWhisper

krithicswaroopan / AI-Voice-Assistance-Pipeline

umitkacar / transformer-asr-transcription

franckferman / Whisper_Transcriber

Avinraj01 / SHL-Grammar-Scoring-Engine-for-Voice-Samples

dvorobiev / subtitles_project

sushant1827 / CrewAI-Agents-MinutesOfMeeting-Gmail

arthurtran04 / english-audio-transcription

13shivam / yt-agent

Xza85hrf / Whisper-Subtitle-Generator

otonomee / youtube-to-transcript

yu3325363946 / ai-education-video-analyzer

LandoNikko / Transcribe-Panel

Improve this page

Add this topic to your repo