🎙️ Vozes: Professional Voice Dictation for Linux

Vozes is a high-performance, privacy-focused voice dictation system for Linux. Powered by a native C++ implementation of OpenAI's Whisper, it allows you to type with your voice anywhere—from professional IDEs to simple text editors—with zero latency and 100% offline processing.

✨ Key Features

🚀 Blazing Fast: Powered by whisper.cpp for native performance.
🔒 100% Private: Everything stays on your machine. No cloud, no APIs, no tracking.
⌨️ Global Typing: Works like a virtual keyboard. Dictate directly into any active window.
🐕 Wake-Word Support: Start dictating hands-free with "Hey Jarvis" (OpenWakeWord integration).
🛠️ Optimized for Linux: Native GTK4/Adwaita interface, Udev rules for hotkeys, and seamless system integration.
📦 Multi-Arch: Native support for both Intel/AMD (x64) and ARM (Raspberry Pi, Apple Silicon/Asahi).

🛠️ Installation

1. Download the latest release

Grab the .deb package for your architecture from the releases section.

Note that the .deb contained in every release are built in different devices such as Proxmox or Ubuntu and the experience may be different in some architectures.

2. Install using APT

sudo apt install ./vozes_1.5.0_amd64.deb

Note: This will automatically set up a dedicated Python virtual environment and system dependencies to keep your OS clean.

Known errors:

PyAudio:

Run this

sudo apt-get install portaudio19-dev python3-dev

3. Permissions (First time only)

To allow the app to listen to global hotkeys and type on your behalf, ensure your user is in the input group:

sudo usermod -aG input $USER
# Log out and log back in for changes to take effect

Requirements:

PyAudio==0.2.14
numpy>=2.1.0
webrtcvad==2.0.10
onnxruntime>=1.17.0
scipy>=1.13.0
scikit-learn>=1.4.0
tqdm>=4.66.0
requests==2.31.0
evdev==1.7.0
PyGObject==3.48.2

🚀 How to Use

Launch: Open "Vozes" from your applications menu.
Select Model: Choose between tiny, base, or small depending on your CPU power.
Dictate:
- Push-to-Talk: Set a global hotkey in settings.
- Wake-Word: Just say "Hey Jarvis" and start speaking.
Automatic Typing: Your speech will be converted to text and typed instantly at your cursor location.

🏗️ Building from Source

If you want to build the package yourself:

# Clone the repo with submodules
git clone --recursive https://github.com/InledGroup/vozes.git
cd vozes

# Build the whisper-cli binary
cd bin/whisper.cpp
mkdir build && cd build
cmake .. -DWHISPER_SDL2=OFF -DWHISPER_ALL_EXTRAS=OFF -DWHISPER_BUILD_EXAMPLES=ON
make -j$(nproc) whisper-cli
cd ../../../

# Create the .deb package
./build_deb.sh

🔧 Requirements

OS: Ubuntu 22.04+, Debian 12+, or any Debian-based distro.
Python: 3.10 or higher.
Libraries: libgirepository1.0-dev, libportaudio2, libevdev2.

🤝 Contributing

Contributions are welcome! Whether it's a bug report, a new feature, or a translation, feel free to open an Issue or a Pull Request.

📄 License

Vozes is released under the GNU GPLv3. See LICENSE for more details.

Built with ❤️ by JaimeGH.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
data		data
src		src
vozes_1.1.0_amd64		vozes_1.1.0_amd64
vozes_1.1.0_arm64		vozes_1.1.0_arm64
vozes_1.5.0_arm64		vozes_1.5.0_arm64
vozes_1.6.0_arm64		vozes_1.6.0_arm64
.gitignore		.gitignore
README.md		README.md
build_deb.sh		build_deb.sh
org.vozes.Vozes.yaml		org.vozes.Vozes.yaml
run_local.sh		run_local.sh
test_flatpak.sh		test_flatpak.sh
vozes.png		vozes.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🎙️ Vozes: Professional Voice Dictation for Linux

✨ Key Features

🛠️ Installation

1. Download the latest release

2. Install using APT

Known errors:

PyAudio:

3. Permissions (First time only)

Requirements:

🚀 How to Use

🏗️ Building from Source

🔧 Requirements

🤝 Contributing

📄 License

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🎙️ Vozes: Professional Voice Dictation for Linux

✨ Key Features

🛠️ Installation

1. Download the latest release

2. Install using APT

Known errors:

PyAudio:

3. Permissions (First time only)

Requirements:

🚀 How to Use

🏗️ Building from Source

🔧 Requirements

🤝 Contributing

📄 License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages