Summary
Add selectable voice input modes for the chat composer so users can choose between assistant voice, browser-local dictation, and ElevenLabs realtime transcription.
Motivation
The current voice path is optimized for the assistant conversation flow. For text entry, users need a lower-friction dictation option and a cloud-backed realtime transcription option.
Proposed scope
- add a voice mode setting in the web app
- support browser-native local dictation for low-latency text entry
- support ElevenLabs realtime transcription for streaming cloud dictation
- add the hub endpoint needed to mint single-use realtime scribe tokens
- normalize language selection between browser locales and ElevenLabs transcription codes
Notes
This is intentionally focused on realtime dictation for the text composer, not on changing the existing voice-assistant conversation flow.
Summary
Add selectable voice input modes for the chat composer so users can choose between assistant voice, browser-local dictation, and ElevenLabs realtime transcription.
Motivation
The current voice path is optimized for the assistant conversation flow. For text entry, users need a lower-friction dictation option and a cloud-backed realtime transcription option.
Proposed scope
Notes
This is intentionally focused on realtime dictation for the text composer, not on changing the existing voice-assistant conversation flow.