List of STT Providers

Configuring transcription and voice recognition.

View as Markdown

STT (Speech-to-Text) is the "Ears" of the agent. It transcribes user audio into text for the LLM. Speed and accuracy (especially for interruptions) are vital.

Supported Providers

On this page