Product Guide

Voice Transcription

Recording lifecycle, device management, and streaming behaviour for voice-driven prompts.

September 22, 2025

•

5 min read

•

Voice transcription is available anywhere the desktop app exposes dictation controls, including the plan terminal and prompt editors. The feature records audio locally, sends chunks to the transcription service, and inserts recognised text into the active input field without blocking manual typing.

Recording workflow

The recording hook keeps a state machine with idle, recording, processing, and error states. It tracks duration, manages silence detection, and ensures recordings stop automatically after ten minutes. Chunks are buffered and forwarded to the transcription action, which returns recognised text for insertion.

Project-aware settings

When a recording session starts, the hook looks up the active project’s transcription configuration. Language codes, preferred models, and other settings are retrieved before capturing audio so recordings follow the project’s preferences.

Device management

The feature requests microphone permission, enumerates available audio inputs, and lets users switch devices during a session. Audio levels are monitored live so the UI can surface silence warnings if the microphone is muted or disconnected.