GitHub - EmZod/speak: A fast CLI tool for Agents to convert their text output to speech using Chatterbox TTS on Apple Silicon. Agent SKILL files included.

A fast CLI tool for AI agents to convert their text output to speech using Chatterbox TTS on Apple Silicon.

Quick Start

git clone https://github.com/EmZod/speak.git
cd speak
bun install

# First run auto-installs Python dependencies
bun run src/index.ts "Hello, world!" --play

Create an alias for easier access:

alias speak="bun run $(pwd)/src/index.ts"

Requirements

macOS with Apple Silicon (M Series)
Bun
Python 3.10+
sox (for long documents): brew install sox

Basic Usage

speak "Hello, world!" --play        # Generate and play
speak article.md --stream           # Stream long content
speak --clipboard --play            # Read from clipboard
speak document.md --output out.wav  # Save to file

Key Features

# Long documents - auto-chunk for reliability
speak book.md --auto-chunk --output book.wav

# Resume interrupted generation
speak --resume manifest.json

# Batch processing
speak *.md --output-dir ~/Audio/

# Estimate duration before generating
speak --estimate document.md

# Concatenate audio files
speak concat part1.wav part2.wav --out combined.wav

Commands

Command	Description
`speak <text\|file>`	Generate speech
`speak health`	Check system status
`speak models`	List available models
`speak concat <files>`	Combine audio files
`speak daemon kill`	Stop TTS server

Common Options

Option	Description
`--play`	Play after generation
`--stream`	Stream as it generates
`--output <path>`	Output file or directory
`--auto-chunk`	Chunk long documents
`--estimate`	Show duration estimate
`--dry-run`	Preview without generating

Documentation

docs/usage.md - Complete usage guide
docs/configuration.md - Config file, environment variables, shell setup
docs/troubleshooting.md - Common issues and fixes
SKILL.md - Agent-optimized reference
CHANGELOG.md - Version history
.agentic/ - Agentic engineering artifacts (optimization reports, focus group tests)

Development

bun install          # Install dependencies
bun test             # Run tests
bun run typecheck    # Type check

For AI Agents

Copy SKILL.md to your agent's skills directory:

cp SKILL.md ~/.claude/skills/speak-tts/SKILL.md

See AGENTS.md for setup details.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
.agentic		.agentic
.github/workflows		.github/workflows
assets		assets
dev		dev
docs		docs
src		src
test		test
.gitignore		.gitignore
AGENTS.md		AGENTS.md
CHANGELOG.md		CHANGELOG.md
README.md		README.md
SKILL.md		SKILL.md
bun.lock		bun.lock
package.json		package.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Quick Start

Requirements

Basic Usage

Key Features

Commands

Common Options

Documentation

Development

For AI Agents

License

About

Uh oh!

Releases 1

Packages

Contributors 2

Uh oh!

Languages

EmZod/speak

Folders and files

Latest commit

History

Repository files navigation

Quick Start

Requirements

Basic Usage

Key Features

Commands

Common Options

Documentation

Development

For AI Agents

License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 2

Uh oh!

Languages

Packages