WordWave

AI-powered transcription SaaS that converts video into structured, searchable text with timestamps and speaker identification.

Live demo: https://wordwave.app

Overview

WordWave is an AI-powered transcription service that converts video content into structured, searchable text with timestamps and speaker identification.
It processes video uploads asynchronously and delivers the final transcript as a plain-text (.txt) file to the user’s inbox.
The system follows a modular micro-SaaS architecture that separates the interface, API, and processing layers for reliability and scalability.
Built with a privacy-first approach, WordWave operates without exposing proprietary source code or infrastructure details.

Technical Overview

Frontend: Vue 3 · Nuxt (SSR)
Backend: Fastify (Node.js)
AI & Processing: Python FastAPI · WhisperX
Infrastructure: Docker · Object Storage (S3-compatible) · PostgreSQL (task queue)
Integrations: Stripe (billing) · Mailgun (email delivery)

System Architecture

sequenceDiagram
    participant Client as Client (Vue + Nuxt SSR)
    participant API as Public API (Fastify, Node.js)
    participant Queue as Task Queue (PostgreSQL)
    participant Proc as Processing Service (FastAPI + WhisperX)
    participant Store as Object Storage (S3-compatible)
    participant Mail as Mail Delivery

    Client->>API: Upload media for transcription
    API->>Store: Store media object
    API->>Queue: Enqueue transcription task (with callback URL)
    Proc->>Queue: Poll task
    Proc->>Store: Retrieve media object
    Proc->>Store: Upload transcript
    Proc->>API: Callback with result metadata
    API->>Mail: Deliver transcript to user via email

The architecture is fully asynchronous, horizontally scalable, and deployable through containerized CI/CD pipelines.

Key Features

Extracts text from video files using AI models
Speaker detection and timestamp alignment
Automatic language detection across supported languages
Downloadable transcripts in plain-text (.txt) format
Automatic transcript delivery via email
Asynchronous task-queue processing
Designed for cloud or self-hosted deployment

Product Preview

Interface and output examples of WordWave.

Example of transcript output with timestamps, speaker identification, and automatic language detection.

Upload interface for video transcription.

Status

Active / Production MVP
Source code is private. A demo is available upon request.

Related Projects

CargoMagic – desktop application for cargo volume planning and optimization
RouteMagic – mobile application for route and delivery operations

Maintained by WebMagic Agency
Custom web and AI systems for logistics, e-commerce, and SaaS.

Security Disclaimer

This repository serves as a public showcase for demonstration purposes only.
No source code, credentials, or proprietary infrastructure details are shared.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
assets		assets
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

WordWave

Overview

Technical Overview

System Architecture

Key Features

Product Preview

Status

Related Projects

Security Disclaimer

About

Uh oh!

Releases

Packages

web-magic/wordwave

Folders and files

Latest commit

History

Repository files navigation

WordWave

Overview

Technical Overview

System Architecture

Key Features

Product Preview

Status

Related Projects

Security Disclaimer

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages