Transcriber

Deepgram

Plug Deepgram speech-to-text into Bolna for fast, accurate phone call transcription. Supports low latency streaming, high accuracy, and custom vocabulary.

Book a Demo Bolna Docs

stttranscriberdeepgramspeech-to-textvoice-recognition

Official Documentation

At a glance

How Deepgram fits in the stack

Best for

Turning noisy phone audio into text the agent can reason over.

Use this layer when

Recognition accuracy, language coverage, and latency matter most.

Connects to

Telephony audio upstream and your LLM downstream.

Voice Stack

Speech-to-text is the listening layer

This provider converts raw audio into text in real time. It shapes how accurately the agent hears the caller and how natural the back-and-forth feels.

TelephonyPhone Network

→

STTListener

→

LLMReasoning

→

TTSVoice

→

ToolsActions

This page focuses on where Deepgram fits in a production voice stack. For full setup steps, credentials, and API details, use the documentation link above.

Overview

Deepgram is the leading speech-to-text (STT) provider purpose-built for real-time voice applications. With industry-leading accuracy and ultra-low latency, Deepgram powers the listening capability of your Bolna voice agents, converting spoken words into text that your LLM can understand and respond to.

Deepgram's AI models are trained on diverse datasets including phone conversations, making it the ideal choice for voice agent applications where every word counts.

Features & Use Cases

Highly Accurate (Nova-2 Model)
Deepgram gets it right over 95% of the time, easily understanding different accents, background noise, and varying phone call qualities.

Lightning-Fast Speed
It turns speech into text in under 300 milliseconds. This real-time speed prevents awkward pauses and keeps conversations smooth.

Custom Words & Jargon
You can teach the AI your specific industry terms, product names, or technical jargon so it always recognizes them correctly.

Speaks Many Languages
Understands over 30 languages - including English, Spanish, French, German, and Hindi - and accurately handles native accents.

Clean, Readable Text
Automatically adds punctuation, capitalizes the right words, and formats numbers so the final text is clean and ready to use.

Optimized for Phone Calls
Specially trained to handle typical phone call audio perfectly, making it ideal for call centers and customer support.

Knows Who is Speaking
Can automatically figure out and separate who is talking at any given moment, which is perfect for distinguishing the agent from the customer.

Other providers in this layer

Transcriber

Sarvam AI (STT)

Full Indian language AI stack for Bolna voice agents. LLM, speech-to-text, and text-to-speech tuned for Hindi, Tamil, Telugu, Kannada, and 10+ languages.

Browse this layer

Keep exploring the voice stack

Browse

Speech-to-Text

Speech-to-text converts what callers say into text that your LLM can process. Transcription accuracy and latency directly affect how natural a conversation feels. Bolna supports streaming STT providers optimized for telephony audio, including specialized models for Indian languages.

Browse

Telephony

Telephony providers connect your voice agents to the phone network so they can make and receive real calls. Bolna supports managed integrations with major carriers as well as bring-your-own-carrier via SIP trunking, giving you full control over call routing, number provisioning, and cost.

Browse

Large Language Models

The LLM is the brain of your voice agent. It understands what callers say and decides how to respond. Bolna lets you swap between models like GPT-4o, Claude, and DeepSeek without changing your agent configuration, so you can optimize for speed, cost, or reasoning depth.

Browse

Text-to-Speech

Text-to-speech turns your agent responses into spoken audio. Voice quality shapes how callers perceive your brand. Flat, robotic speech kills trust while natural, expressive voices build it. Bolna integrates with the fastest TTS providers so responses sound human and arrive without awkward pauses.

Browse

Tools & Workflows

Tools let your voice agents take action during a call, not just talk. Book a calendar slot, look up an order in Shopify, push a lead into your CRM, or trigger a multi-step automation in Zapier. These integrations turn voice agents from answering machines into workflow engines.

See where Deepgram fits in your production workflow

Use the demo to walk through provider selection, stack tradeoffs, and the exact workflow you want Bolna to automate.

Book a Demo Read Bolna Docs