δΈζ | English
β οΈ Development Status: This project is under active development. Features may change, and bugs are expected. Use at your own risk.
π€ AI-Generated Code: This project was entirely written by AI coding assistants including Claude Code, Clawdbot, and other AI tools. Human involvement was limited to product direction and testing.
All-in-one video translation and dubbing pipeline. Download, transcribe, translate, dub, and upload - fully automated.
- Video Download - YouTube, TikTok, Bilibili and other platforms via yt-dlp
- Speech Transcription - Multiple backends:
- Whisper (OpenAI)
- Faster Whisper (4-8x faster, GPU optimized)
- WhisperX (word-level alignment, speaker diarization)
- Smart Translation - Multiple engines:
- Google Translate (free)
- OpenAI GPT-4/GPT-4o
- Anthropic Claude
- DeepSeek
- AI Dubbing - Text-to-speech with multiple engines:
- Edge TTS (Microsoft, free, 400+ voices)
- CosyVoice (voice cloning)
- Index TTS (voice cloning)
- Qwen3 TTS (voice cloning)
- Subtitle Processing - Dual subtitles, custom styles, ASS/SRT export, hardcoded burn-in
- One-click Upload - Bilibili, Douyin (ζι³), Xiaohongshu (ε°ηΊ’δΉ¦) with multi-account support
| Mode | Description | Use Case |
|---|---|---|
| Full | Transcribe β Translate β Dub β Compose | YouTube β Bilibili translation |
| Subtitle | Transcribe β Translate β Embed subtitles | Videos with dialogue, no dubbing needed |
| Direct | Download β Upload directly | Viral clips, no dialogue |
| Auto | AI analyzes content and decides | Uncertain content type |
- Subscribe to YouTube/TikTok channels
- Auto-detect new videos with configurable intervals
- Auto-processing pipeline (download β process β upload)
- Batch import historical videos
- AI Proofreading: Grammar check, terminology consistency, timing optimization
- AI Metadata: Auto-generate platform-specific titles, descriptions, tags
- Different styles for Douyin (short, trendy), Bilibili (detailed), Xiaohongshu (lifestyle)
- Custom Glossary: Maintain terminology consistency across translations
- Modern React UI with dark theme
- Multi-language interface (Chinese/English)
- Processing presets for quick setup
- Trending video discovery (YouTube)
- Task management with progress tracking
- Multi-account support for all upload platforms
Backend
- Python 3.10+ / FastAPI / SQLAlchemy + SQLite
- yt-dlp (video download)
- FFmpeg (video processing)
- Whisper/Faster-Whisper/WhisperX (transcription)
- Playwright (browser automation for TikTok, Douyin)
Frontend
- React 18 + TypeScript
- Tailwind CSS
- Vite
- React Query
- i18next
- Python 3.10+
- Node.js 18+
- FFmpeg
- GPU (optional, recommended for Whisper)
# Clone
git clone https://github.com/vmansus/iDubb.git
cd iDubb
# Backend
cd backend
pip install -r requirements.txt
playwright install chromium # For TikTok/Douyin support
# Start backend
uvicorn api.main:app --host 0.0.0.0 --port 8888
# Frontend (new terminal)
cd frontend
npm install
npm run devVisit http://localhost:5173
Configure in Settings page:
| Service | Required For | Notes |
|---|---|---|
| OpenAI API Key | GPT translation, AI proofreading | Optional if using Google Translate |
| Anthropic API Key | Claude translation | Optional |
| DeepSeek API Key | DeepSeek translation | Optional, cost-effective |
| YouTube Data API | Trending videos | Optional |
For auto-upload features, configure platform credentials in Settings:
- Bilibili: Login via QR code scan
- Douyin: Login via QR code or cookies
- Xiaohongshu: Login via QR code or cookies
# .env (optional)
WHISPER_MODEL=small # tiny, base, small, medium, large-v3
WHISPER_DEVICE=auto # auto, cpu, cuda, mps- Create Task - Paste video URL or upload local file
- Choose Mode - Full translation, subtitle only, or direct upload
- Configure - Select languages, TTS voice, subtitle style
- Process - One click to start, monitor progress in real-time
- Review - Preview results, edit subtitles if needed
- Upload - One-click upload to multiple platforms
- Go to "Subscriptions" page
- Click "Add Subscription"
- Paste YouTube/TikTok channel URL
- Configure check interval and processing options
- Enable auto-process for hands-free operation
iDubb/
βββ backend/
β βββ api/ # FastAPI routes
β βββ database/ # SQLAlchemy models
β βββ downloaders/ # Video download (yt-dlp)
β βββ transcription/ # Whisper backends
β βββ translation/ # Translation engines
β βββ tts/ # TTS engines
β βββ dubbing/ # Audio/video composition
β βββ subtitles/ # Subtitle processing
β βββ uploaders/ # Platform uploaders
β βββ subscriptions/ # Subscription scheduler
β βββ metadata/ # AI metadata generation
β βββ proofreading/ # AI proofreading
β βββ pipeline.py # Main processing pipeline
βββ frontend/
β βββ src/
β β βββ components/ # React components
β β βββ pages/ # Page components
β β βββ services/ # API client
β β βββ locales/ # i18n translations
β βββ public/
βββ docs/ # Documentation
# Install CosyVoice to external/cosyvoice
git clone https://github.com/FunAudioLLM/CosyVoice external/cosyvoice
cd external/cosyvoice && pip install -r requirements.txt# Install IndexTTS to external/indextts
git clone https://github.com/indexteam/IndexTTS external/indextts
cd external/indextts && pip install -r requirements.txtContributions welcome! Please read the existing code style and submit PRs.
MIT
Made with β€οΈ by vmansus & Chad π