🎧 DeepEcho - Real-time Voice AI Assistant

DeepEcho is a comprehensive real-time voice transcription and AI assistant system that supports multiple AI providers. It captures both microphone and speaker audio, provides live transcription, and generates intelligent response suggestions using various AI models including DeepSeek, OpenAI GPT, Claude, Grok, and more.

✨ Key Features

🎤 Real-time Audio Capture: Simultaneous microphone and speaker audio recording
📝 Live Transcription: Real-time speech-to-text with local and API modes
🤖 Multi-AI Provider Support: DeepSeek, OpenAI, Claude, Grok, Volcano Engine, and GLM
🎨 Modern UI: New integrated interface with AI provider selection
⚙️ Flexible Configuration: JSON-based configuration with multiple presets
🔧 Cross-platform: Windows and macOS support
📊 System Monitoring: Built-in diagnostics and resource optimization
🛡️ Error Recovery: Comprehensive error handling and retry mechanisms

🚀 Quick Start

📋 Prerequisites

Python >=3.8.0
FFmpeg (for audio processing)
At least one AI provider API key (see API Setup Guide)
Windows OS / macOS

🔧 Installation

Clone the repository:

git clone https://github.com/zemochen/deep_echo.git
cd deep_echo

Install dependencies:
```
pip install -r requirements.txt
```
Set up security (Recommended):
```
# Linux/macOS
chmod +x setup_security.sh
./setup_security.sh

# Windows
setup_security.bat
```
This will configure git hooks to prevent accidental API key commits.

Set up API keys (choose one method):

Method 1: Configuration File (Recommended)

cp resources/config.example.json config.json
# Edit config.json and add your API key

Method 2: Environment Variable

export DEEPSEEK_API_KEY="sk-your-key-here"
# or
export OPENAI_API_KEY="sk-your-key-here"

Method 3: Legacy keys.py (Secure)

# Copy the template file
cp keys.example.py keys.py
# Edit keys.py and add your actual API keys
# Note: keys.py is in .gitignore and will NOT be committed to git

⚠️ Security Note: Never commit your actual API keys to version control. The keys.py file is automatically excluded from git commits. See SECURITY.md for detailed security guidelines.

Run DeepEcho:
```
python main.py
```

🎯 Usage Modes

Default Mode (Integrated Application)

python main.py

Uses new integrated architecture
Automatic AI provider detection
Modern UI with provider selection
Comprehensive error handling

API Transcription Mode

python main.py --api

Uses OpenAI Whisper API for transcription
Higher accuracy and multi-language support
Requires internet connection

Legacy Mode

python main.py --legacy

Uses original application architecture
Backward compatibility mode
Legacy UI interface

Verbose Logging

python main.py --verbose

Detailed logging for troubleshooting
System diagnostics information

🤖 Supported AI Providers

Provider	Models	Setup Guide
DeepSeek	deepseek-chat, deepseek-coder	DeepSeek Setup
OpenAI	gpt-3.5-turbo, gpt-4, gpt-4o	OpenAI Setup
Claude	claude-3-haiku, claude-3-sonnet, claude-3-opus	Claude Setup
Grok	grok-beta, grok-2	Grok Setup
Volcano Engine	doubao-pro, doubao-lite	Volcano Setup
GLM	qwen-turbo, qwen-plus, qwen-max	GLM Setup

⚙️ Configuration

Configuration Files

DeepEcho supports multiple configuration presets:

resources/config.example.json - Template configuration
resources/config.deepseek.json - DeepSeek optimized settings
resources/config.openai.json - OpenAI optimized settings

Configuration Options

{
  "audio": {
    "use_api_mode": true,          // Use API vs local transcription
    "record_timeout": 3,           // Recording timeout (seconds)
    "energy_threshold": 1000       // Audio sensitivity
  },
  "ai_provider": {
    "provider_type": "deepseek",   // AI provider to use
    "api_key": "your-key-here",    // API key
    "model": "deepseek-chat",      // Model name
    "response_interval": 5         // Response update interval
  },
  "ui": {
    "use_new_ui": true,           // Use new integrated UI
    "theme": "dark",              // UI theme
    "window_width": 1200          // Window dimensions
  }
}

🖥️ System Requirements

Windows

Windows 10/11
Python 3.8+
PyAudioWPatch (auto-installed)
FFmpeg

macOS

macOS 10.14+
Python 3.8+
BlackHole virtual audio device
FFmpeg, PortAudio

macOS Setup:

brew install ffmpeg portaudio python-tk blackhole-2ch

🔧 Advanced Features

System Monitoring

Real-time resource usage tracking
Thread health monitoring
Queue size optimization
Memory usage alerts

Error Recovery

Automatic retry with exponential backoff
Graceful degradation on failures
Component health checks
Network failure handling

Performance Optimization

Multi-threaded architecture
Resource usage optimization
Queue management
Memory leak prevention

🐛 Troubleshooting

Common Issues

"FFmpeg not found"

# Windows (with Chocolatey)
choco install ffmpeg

# macOS
brew install ffmpeg

"No AI provider configured"

Check your API key in configuration
Verify the key format and permissions
See API Setup Guide

Audio device issues

Ensure default audio devices are properly set
On macOS, configure BlackHole for speaker capture
Check system audio permissions

High memory usage

Enable resource optimization in config
Reduce queue size limits
Use local transcription mode

Getting Help

Run with verbose logging: python main.py --verbose
Check the system diagnostics output
Review configuration file syntax
Consult the API Setup Guide

📊 Performance Tips

For best accuracy: Use --api mode with internet connection
For lowest latency: Use local mode with tiny Whisper model
For cost efficiency: Use DeepSeek or Claude Haiku
For best quality: Use OpenAI GPT-4 or Claude Opus

🔄 Migration from Legacy Version

If upgrading from an older version:

Backup your keys.py file

Create new configuration file:

cp resources/config.example.json config.json

Update your API keys in the new format
Test with: python main.py --legacy (fallback mode)

📖 License

This project is licensed under the MIT License - see the LICENSE file for details.

🤝 Contributing

Contributions are welcome! Please follow these guidelines:

Code Style: Follow PEP 8 guidelines
Testing: Add tests for new features
Documentation: Update documentation for changes
Commits: Use clear, descriptive commit messages

Development Setup

pip install -r requirements-dev.txt
pytest tests/

🙏 Acknowledgments

OpenAI for Whisper and GPT models
Anthropic for Claude models
DeepSeek for accessible AI APIs
All contributors and users of DeepEcho

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
.git-hooks		.git-hooks
.github		.github
.kiro/specs		.kiro/specs
resources		resources
src		src
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
LlmClient.py		LlmClient.py
PROJECT_STRUCTURE.md		PROJECT_STRUCTURE.md
README.md		README.md
SECURITY.md		SECURITY.md
SECURITY_SETUP_SUMMARY.md		SECURITY_SETUP_SUMMARY.md
WHISPER_MODEL_GUIDE.md		WHISPER_MODEL_GUIDE.md
check_security.py		check_security.py
check_system.py		check_system.py
convert_whisper_model.py		convert_whisper_model.py
keys.example.py		keys.example.py
requirements-dev.txt		requirements-dev.txt
requirements.txt		requirements.txt
requirements_win.txt		requirements_win.txt
setup.py		setup.py
setup_keys.py		setup_keys.py
setup_security.bat		setup_security.bat
setup_security.sh		setup_security.sh
start.py		start.py
tiny.en.pt		tiny.en.pt

Uh oh!

License

zemochen/deep-echo

Folders and files

Latest commit

History

Repository files navigation

🎧 DeepEcho - Real-time Voice AI Assistant

✨ Key Features

🚀 Quick Start

📋 Prerequisites

🔧 Installation

🎯 Usage Modes

Default Mode (Integrated Application)

API Transcription Mode

Legacy Mode

Verbose Logging

🤖 Supported AI Providers

⚙️ Configuration

Configuration Files

Configuration Options

🖥️ System Requirements

Windows

macOS

🔧 Advanced Features

System Monitoring

Error Recovery

Performance Optimization

🐛 Troubleshooting

Common Issues

Getting Help

📊 Performance Tips

🔄 Migration from Legacy Version

📖 License

🤝 Contributing

Development Setup

🙏 Acknowledgments

About

Resources

License

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Sponsor this project

Uh oh!

Packages 0

Uh oh!

Languages

Packages