Skip to main content

What is the Voice Tab?

The Voice Tab is where you configure how your AI agent sounds. Select from multiple voice synthesis providers, choose specific voices, and fine-tune audio settings like buffer size and ambient noise to create the perfect voice experience for your users.
Access Bolna playground from https://platform.bolna.ai/.
Voice configuration tab in Bolna Playground showing text-to-speech provider options, voice selection, buffer size settings, ambient noise controls, and audio quality parameters for Voice AI agents

Voice Tab on Bolna Playground

Voice configuration options

  1. Choose your TTS Provider and Voice
  • ElevenLabs is the most realistic and costliest voice
  • Deepgram and Azure TTS are the quickest and cheapest providers.
  1. Play around with more voices from each provider in Voice Labs before finalising on the voice you want. Pressing the play button will enable your selected voice to speak out the Welcome Message that you have set
  2. Increasing buffer size enables agent to speak long responses fluently, but increases latency. Buffer sizes of ~250 are ideal for most conversations
  3. Ambient noise removes the pin-drop silence between a conversation and makes it more realistic. However, be careful not to let the background noise be a distraction
  4. Agent will check if user is still active in the call after a fixed time that you can decide. You can customise the message the user will use to ask

Next steps

Ready to perfect your agent’s voice? Explore related features: Return to Agent Tab to configure prompts or LLM Tab to select your language model.
I