ποΈ Give Your Assistant a Voice
Transform text into natural, human-like speech with our integrated TTS providers. Each provider offers unique advantages for different use cases.
Quick Provider Comparison
β‘ ElevenLabs
Premium Quality & Customization70+ languages, voice cloning, advanced controlsBest for: High-quality customer interactions
π Deepgram Aura
Ultra-Low Latency3x faster than competitors, phone-optimizedBest for: Real-time conversations
π Inworld.ai
AI-Powered EmotionsMultilingual, emotional markup, voice cloningBest for: Expressive, contextual responses
π― Resemble AI
Custom Voice CreationWebSocket streaming, personalized voicesBest for: Brand-specific voice identity
Feature Matrix
| Provider | Latency | Languages | Voice Cloning | Streaming | Best For |
|---|---|---|---|---|---|
| ElevenLabs | ~250ms | 70+ | β Advanced | WebSocket | Premium quality |
| Deepgram | ~75ms | English | β | WebSocket | Speed & phone calls |
| Inworld | ~200ms | 11 | β Zero-shot | HTTP/WS | Emotional expression |
| Resemble | ~300ms | English | β Custom | WebSocket | Brand voices |
Setup Overview
All providers follow the same basic setup pattern:1
Get API Credentials
Sign up with your chosen provider and obtain API keys
2
Configure in Burki
Add your credentials in the assistantβs AI Configuration β TTS tab
3
Select Voice & Model
Choose from available voices and models for your use case
4
Fine-tune Settings
Adjust speed, stability, and other provider-specific options
Provider Deep Dives
π ElevenLabs - Premium Voice Quality
Latest Models: Flash v2.5 (75ms), v3 (70+ languages), Turbo v2.5Key Features: Advanced voice controls, multilingual support, custom voice creationPerfect For: Customer service, content creation, multilingual applicationsβ Complete ElevenLabs Guide
β‘ Deepgram Aura - Ultra-Fast TTS
Latest Models: Aura-2 (next-gen), Aura (proven)Key Features: Industry-leading speed, phone optimization, Β΅-law encodingPerfect For: Real-time phone calls, live chat, interactive applicationsβ Complete Deepgram Guide
πͺ Inworld.ai - AI-Powered Expression
Latest Models: TTS-1 (flagship), TTS-1-Max (experimental)Key Features: Emotional markup, context awareness, 11 languagesPerfect For: Gaming, entertainment, emotional customer supportβ Complete Inworld Guide
π― Resemble AI - Custom Brand Voices
Key Features: WebSocket streaming, unlimited voice creation, business plansPerfect For: Brand consistency, personalized experiences, enterpriseβ Complete Resemble Guide
Advanced Topics
ποΈ Voice Tuning
Master stability, similarity, and style controls across all providers
π§ Troubleshooting
Common issues and solutions with step-by-step fixes
π Best Practices
Performance optimization, cost reduction, and production tips
Related Documentation
π See Also
Configuration: Learn how to configure TTS in your AI Configuration settings.Integration: Understand how TTS fits into the overall Architecture of Burki Voice AI.Call Management: Discover how TTS works with Call Management features.
Quick Start Guide
- Business Calls
- Customer Support
- Multilingual
- Real-time Apps
Recommended Setup:
- Provider: ElevenLabs or Deepgram
- Model: Flash v2.5 or Aura-2
- Voice: Professional (Rachel, Asteria)
- Settings: Stability 0.5, Speaker Boost ON
π Ready to Get Started?
Choose your provider and dive into the detailed setup guides, or check out our Best Practices for optimization tips.