๐๏ธ Give Your Assistant a Voice
Transform text into natural, human-like speech with our integrated TTS providers. Each provider offers unique advantages for different use cases.
Quick Provider Comparison
โก ElevenLabs
Premium Quality & Customization70+ languages, voice cloning, advanced controlsBest for: High-quality customer interactions
๐ Deepgram Aura
Ultra-Low Latency3x faster than competitors, phone-optimizedBest for: Real-time conversations
๐ญ Inworld.ai
AI-Powered EmotionsMultilingual, emotional markup, voice cloningBest for: Expressive, contextual responses
๐ฏ Resemble AI
Custom Voice CreationWebSocket streaming, personalized voicesBest for: Brand-specific voice identity
Feature Matrix
Provider | Latency | Languages | Voice Cloning | Streaming | Best For |
---|---|---|---|---|---|
ElevenLabs | ~250ms | 70+ | โ Advanced | WebSocket | Premium quality |
Deepgram | ~75ms | English | โ | WebSocket | Speed & phone calls |
Inworld | ~200ms | 11 | โ Zero-shot | HTTP/WS | Emotional expression |
Resemble | ~300ms | English | โ Custom | WebSocket | Brand voices |
Setup Overview
All providers follow the same basic setup pattern:1
Get API Credentials
Sign up with your chosen provider and obtain API keys
2
Configure in Burki
Add your credentials in the assistantโs AI Configuration โ TTS tab
3
Select Voice & Model
Choose from available voices and models for your use case
4
Fine-tune Settings
Adjust speed, stability, and other provider-specific options
Provider Deep Dives
๐ญ ElevenLabs - Premium Voice Quality
Latest Models: Flash v2.5 (75ms), v3 (70+ languages), Turbo v2.5Key Features: Advanced voice controls, multilingual support, custom voice creationPerfect For: Customer service, content creation, multilingual applicationsโ Complete ElevenLabs Guide
โก Deepgram Aura - Ultra-Fast TTS
Latest Models: Aura-2 (next-gen), Aura (proven)Key Features: Industry-leading speed, phone optimization, ยต-law encodingPerfect For: Real-time phone calls, live chat, interactive applicationsโ Complete Deepgram Guide
๐ช Inworld.ai - AI-Powered Expression
Latest Models: TTS-1 (flagship), TTS-1-Max (experimental)Key Features: Emotional markup, context awareness, 11 languagesPerfect For: Gaming, entertainment, emotional customer supportโ Complete Inworld Guide
๐ฏ Resemble AI - Custom Brand Voices
Key Features: WebSocket streaming, unlimited voice creation, business plansPerfect For: Brand consistency, personalized experiences, enterpriseโ Complete Resemble Guide
Advanced Topics
๐๏ธ Voice Tuning
Master stability, similarity, and style controls across all providers
๐ง Troubleshooting
Common issues and solutions with step-by-step fixes
๐ Best Practices
Performance optimization, cost reduction, and production tips
Related Documentation
๐ See Also
Configuration: Learn how to configure TTS in your AI Configuration settings.Integration: Understand how TTS fits into the overall Architecture of Burki Voice AI.Call Management: Discover how TTS works with Call Management features.
Quick Start Guide
Recommended Setup:
- Provider: ElevenLabs or Deepgram
- Model: Flash v2.5 or Aura-2
- Voice: Professional (Rachel, Asteria)
- Settings: Stability 0.5, Speaker Boost ON
๐ Ready to Get Started?
Choose your provider and dive into the detailed setup guides, or check out our Best Practices for optimization tips.