Professional local-first AI production pipeline for long-form narration. Clone voices and generate studio-grade audiobooks (M4B/MP3) using Coqui XTTS-v2 and support for Voxtral (cloud)
text-to-speech privacy offline self-hosted audiobook tts speech-synthesis narration m4b voice-synthesis voice-cloning fastapi privacy-focused local-first coqui-tts audiobook-production local-ai xtts-v2 voxtral long-form-narration
-
Updated
Apr 21, 2026 - Python