Voice Input
Last updated
Kaizen supports voice input for hands-free interaction. Speak your tasks instead of typing them.
Go to Settings > Voice (/settings/voice)
Enable Voice Input
Optionally configure the transcription model and keyboard shortcut
Choose which model handles speech-to-text conversion. The default is google/gemini-2.5-flash, which provides fast and accurate transcription.
Customize the system prompt used for audio-to-text conversion. The default prompt is optimized for clean, accurate transcription that preserves your intent.
Set a keyboard shortcut to start/stop voice recording. When you press the shortcut, Kaizen listens for your speech, transcribes it, and places the text in the chat input.
Press the dictation shortcut or click the voice input button
Speak your message
The audio is sent to the configured transcription model via OpenRouter
The transcribed text appears in the chat input
Review and send the message
Last updated