Skip to content

Configuring Voice Settings

Choose voices, enable expressive mode, adjust silence detection, interruption sensitivity, and more.

3 min readConfiguration

Overview

Voice settings control how your AI sounds and behaves during calls. Fine-tuning these settings helps create a natural, professional experience for your callers.

Choosing Between Standard and Ultra Voices

AIVO offers two tiers of voice quality:

Standard Voices

  • Available on all plans.
  • Clear, professional speech.
  • Multiple male and female options with various accents.
  • Lower latency (faster response time).

Ultra Voices

  • Available on Professional and Enterprise plans.
  • More natural intonation and rhythm.
  • Better handling of complex sentences.
  • Slightly higher latency (adds roughly 200ms).

How to switch:

  1. Go to Voice & AI > Voice Settings.
  2. Toggle Ultra Quality on or off.
  3. Browse the voice library - voices marked with a star icon are Ultra.
  4. Click Preview to hear a sample, then Select to apply.

Expressive Mode Explained

Expressive mode adds natural emotional variation to the AI's speech. Instead of a flat, monotone delivery, the AI adjusts its tone based on context:

  • Warm and friendly for greetings.
  • Empathetic for apologies or issues.
  • Upbeat when confirming appointments.

Enable it: Voice & AI > Voice Settings > toggle Expressive Mode.

Note: Expressive mode works best with Ultra voices. Standard voices support a limited version.

Setting Silence Detection Timeout

Silence detection controls how long the AI waits for the caller to speak before prompting them.

  • Default: 5 seconds.
  • Short (3 seconds): Good for fast-paced interactions.
  • Long (8 seconds): Better for callers who may need extra time (elderly, ESL speakers).

Adjust it: Voice & AI > Advanced > Silence Timeout.

Adjusting Interruption Sensitivity

This controls how easily a caller can interrupt the AI while it is speaking.

  • High sensitivity: The AI stops speaking as soon as it detects the caller's voice. Best for conversational, back-and-forth interactions.
  • Medium sensitivity (default): The AI pauses after a short burst of caller speech. Balances responsiveness with avoiding false triggers from background noise.
  • Low sensitivity: The AI finishes its current sentence before pausing. Best for noisy environments or when the AI is delivering important information.

Adjust it: Voice & AI > Advanced > Interruption Sensitivity.

Max Call Duration Settings

Set a maximum length for calls to prevent unusually long sessions:

  • Default: 15 minutes.
  • Range: 1 to 60 minutes.
  • When the limit is reached, the AI politely wraps up: "I want to make sure I've been helpful. Is there anything else before we end the call?"

Adjust it: Voice & AI > Advanced > Max Duration.

TTS and STT Provider Options

AIVO supports multiple text-to-speech (TTS) and speech-to-text (STT) engines:

TTS Providers

  • AIVO Default - Optimized for low latency and natural speech.
  • ElevenLabs - Premium voices with superior expressiveness (Enterprise only).

STT Providers

  • AIVO Default - Fast, accurate transcription.
  • Deepgram - Enhanced accuracy for noisy environments (Enterprise only).

Change providers: Voice & AI > Advanced > TTS Provider / STT Provider.

Most businesses get excellent results with the default providers. Only switch if you have a specific need.

Was this article helpful?