AI-powered transcription SaaS that converts video into structured, searchable text with timestamps and speaker identification.
Live demo: https://wordwave.app
WordWave is an AI-powered transcription service that converts video content into structured, searchable text with timestamps and speaker identification.
It processes video uploads asynchronously and delivers the final transcript as a plain-text (.txt) file to the user’s inbox.
The system follows a modular micro-SaaS architecture that separates the interface, API, and processing layers for reliability and scalability.
Built with a privacy-first approach, WordWave operates without exposing proprietary source code or infrastructure details.
Frontend: Vue 3 · Nuxt (SSR)
Backend: Fastify (Node.js)
AI & Processing: Python FastAPI · WhisperX
Infrastructure: Docker · Object Storage (S3-compatible) · PostgreSQL (task queue)
Integrations: Stripe (billing) · Mailgun (email delivery)
sequenceDiagram
participant Client as Client (Vue + Nuxt SSR)
participant API as Public API (Fastify, Node.js)
participant Queue as Task Queue (PostgreSQL)
participant Proc as Processing Service (FastAPI + WhisperX)
participant Store as Object Storage (S3-compatible)
participant Mail as Mail Delivery
Client->>API: Upload media for transcription
API->>Store: Store media object
API->>Queue: Enqueue transcription task (with callback URL)
Proc->>Queue: Poll task
Proc->>Store: Retrieve media object
Proc->>Store: Upload transcript
Proc->>API: Callback with result metadata
API->>Mail: Deliver transcript to user via email
The architecture is fully asynchronous, horizontally scalable, and deployable through containerized CI/CD pipelines.
- Extracts text from video files using AI models
- Speaker detection and timestamp alignment
- Automatic language detection across supported languages
- Downloadable transcripts in plain-text (.txt) format
- Automatic transcript delivery via email
- Asynchronous task-queue processing
- Designed for cloud or self-hosted deployment
Interface and output examples of WordWave.
Example of transcript output with timestamps, speaker identification, and automatic language detection.
Upload interface for video transcription.
Active / Production MVP
Source code is private. A demo is available upon request.
- CargoMagic – desktop application for cargo volume planning and optimization
- RouteMagic – mobile application for route and delivery operations
Maintained by WebMagic Agency
Custom web and AI systems for logistics, e-commerce, and SaaS.
This repository serves as a public showcase for demonstration purposes only.
No source code, credentials, or proprietary infrastructure details are shared.


