Verbose

Press a key. Speak. Press again. Text appears.

Voice-to-text for Linux. 100% local, no subscriptions.

# 1. Install Python dependencies
sudo apt install python3-evdev python3-pyaudio python3-yaml python3-gi ydotool

# 2. Get whisper.cpp (OPTION A - Easiest: Download pre-compiled binary)
#    Download from: https://github.com/ggerganov/whisper.cpp/releases
#    Extract to ./whisper.cpp/
#    Download a model: cd whisper.cpp && bash models/download-ggml-model.sh base

# 2. Get whisper.cpp (OPTION B - Compile for better performance)
git clone https://github.com/ggerganov/whisper.cpp.git
cd whisper.cpp && mkdir build && cd build && cmake .. && make -j$(nproc) && cd ..
bash models/download-ggml-model.sh base && cd ..

# 3. Run it
python3 verbose.py

# 4. Use it - Press F9, speak, press F9 again

What It Does

Local voice-to-text using whisper.cpp. Works in any application - terminals, browsers, IDEs.

No cloud dependencies. No API keys needed.

Features

Multiple configs with different hotkeys (F9 for coding, F10 for emails)
Auto-correct common mistakes ("cloud code" → "Claude Code")
Phrase shortcuts ("my email" → expands to full address)
Works on login, lives in system tray
Cancel anytime with ESC

How It Works

Press F9 → Red icon appears
Speak → "echo hello world"
Press F9 → Orange icon while processing
Watch it type → Text appears exactly where your cursor was

Press ESC anytime to cancel.

Setup

Quick Start (Pre-compiled Binary - Recommended)

Install system dependencies:

sudo apt install python3-evdev python3-pyaudio python3-yaml python3-gi ydotool portaudio19-dev
sudo usermod -a -G input $USER  # Required for keyboard/ydotool access
# Log out and back in for group change to take effect

Set up uinput permissions (required for ydotool):

echo 'KERNEL=="uinput", GROUP="input", MODE="0660"' | sudo tee /etc/udev/rules.d/80-uinput.rules
sudo udevadm control --reload-rules && sudo udevadm trigger

Get whisper.cpp:
- Download the latest release from whisper.cpp releases
- Extract to ./whisper.cpp/ in the verbose directory
- Make sure the binary is at ./whisper.cpp/build/bin/whisper-cli
Download a model:

cd whisper.cpp
bash models/download-ggml-model.sh base  # or tiny/small/medium/large-v1
cd ..

Run it:

python3 verbose.py

Alternative: Compile for Better Performance

If pre-compiled binaries don't work or you want CPU-optimized builds:

# Install build dependencies
sudo apt install cmake build-essential

# Clone and build
git clone https://github.com/ggerganov/whisper.cpp.git
cd whisper.cpp
mkdir build && cd build
cmake ..
make -j$(nproc)
cd ..

# Download model
bash models/download-ggml-model.sh base
cd ..

Auto-start on Login

./install-service.sh

This installs Verbose as a systemd user service. See Auto-start section for details.

Multiple Configs Example

# configs/coding.yaml - F9 for CLI safety
hotkey: "<f9>"
avoid_newlines: true  # Prevents accidental command execution
dictionary:
  "cloud code": "Claude Code"
  "postgres": "PostgreSQL"

# configs/writing.yaml - F10 for natural text
hotkey: "<f10>"
avoid_newlines: false  # Keeps paragraph breaks
shortcuts:
  "my email": "you@example.com"

Each hotkey loads its own config. Press F9 for coding, F10 for emails. Simple.

Requirements

Ubuntu 22.04+ (or any Linux with evdev and ydotool). ~200MB disk space. That's it.

Contributing

Contributions welcome! Here's where help is needed:

Packaging & Distribution

The project currently requires manual installation. I don't know how to package for Linux distributions - if you do, help would be appreciated:

Snap package
Flatpak
AUR (Arch)
Debian/Ubuntu .deb
AppImage

Open an issue if you'd like to help with any of these.

Other Contributions

Bug fixes and improvements
Better error handling
Documentation improvements
Multi-language support

Read CLAUDE.md for architecture details. Keep changes simple and focused.

Development

Single Python file. ~500 lines. Built with Claude Code.

License

MIT - Do whatever you want with it.

Credits

Built with whisper.cpp by @ggerganov
Inspired by Talon Voice and Wispr Flow (but free and local)
Designed with Claude Code

Support

Issues? Check docs/TROUBLESHOOTING.md.

Still stuck? Open an issue.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
configs		configs
docs		docs
icons		icons
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
README.md		README.md
install-service.sh		install-service.sh
requirements.txt		requirements.txt
uninstall-service.sh		uninstall-service.sh
verbose.py		verbose.py
verbose.service		verbose.service

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Verbose

What It Does

Features

How It Works

Setup

Quick Start (Pre-compiled Binary - Recommended)

Alternative: Compile for Better Performance

Auto-start on Login

Multiple Configs Example

Requirements

Contributing

Packaging & Distribution

Other Contributions

Development

License

Credits

Support

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Verbose

What It Does

Features

How It Works

Setup

Quick Start (Pre-compiled Binary - Recommended)

Alternative: Compile for Better Performance

Auto-start on Login

Multiple Configs Example

Requirements

Contributing

Packaging & Distribution

Other Contributions

Development

License

Credits

Support

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages