🎭 Inworld.ai: AI-Powered Expression
Advanced AI-driven TTS with emotional markup, context awareness, and support for 11 languages. Perfect for gaming, entertainment, and expressive customer interactions.
Quick Setup
1
Get API Key
- Visit Inworld Studio and create an account
- Navigate to Integrations → API Keys
- Generate a new API key for TTS
- Copy your Bearer token
2
Configure in Burki
- Go to AI Configuration → TTS tab
- Select Inworld.ai as provider
- Paste your Bearer token in the TTS API Key field
3
Choose Model & Voice
Select TTS-1 or TTS-1-Max model and your preferred voice
Available Models
🎪 TTS-1
~200ms latencyFlagship model with realistic, context-aware synthesisLanguages: 11 supported languages
Best for: Production applications, customer service
🔬 TTS-1-Max
~250ms latencyLarger, more expressive model (experimental)Languages: 11 supported languages
Best for: Creative content, gaming, entertainment
Multilingual Voice Library
- English
- Spanish
- French
- Other Languages
Hades
Deep and commandingPerfect for authoritative characters
Voice ID: HadesAlex
Clear and naturalGreat for professional applications
Voice ID: AlexAshley
Warm and friendlyIdeal for customer service
Voice ID: AshleyAria
Professional and articulatePerfect for business communications
Voice ID: AriaEmotional Markup System
🎭 Express Emotions in Speech
Inworld’s unique emotional markup allows you to add feelings and speaking styles directly in your text.
Emotional Tags
- Basic Emotions
- Speaking Styles
- Advanced Examples
Zero-Shot Voice Cloning
🎯 Custom Voice Creation
Create custom voices without training data. Just provide a voice ID and Inworld handles the rest.
1
Enable Custom Voice
In your assistant configuration, select “Custom” voice option
2
Enter Voice ID
Provide your custom voice identifier in the Custom Voice ID field
3
Test & Refine
Test with sample text and adjust based on results
Language Support & Quality
🌍 Production-Ready Languages
Inworld provides native-quality voices for multiple languages with varying production readiness.
| Language | Status | Voices Available | Quality Rating |
|---|---|---|---|
| English | 🟢 Production | 4+ voices | ⭐⭐⭐⭐⭐ |
| Spanish | 🟢 Production | 4+ voices | ⭐⭐⭐⭐⭐ |
| French | 🟢 Production | 4+ voices | ⭐⭐⭐⭐ |
| German | 🟡 Beta | 2+ voices | ⭐⭐⭐⭐ |
| Chinese | 🟡 Beta | 4+ voices | ⭐⭐⭐⭐ |
| Japanese | 🟡 Beta | 2+ voices | ⭐⭐⭐ |
| Italian | 🟡 Beta | 2+ voices | ⭐⭐⭐ |
| Portuguese | 🟡 Beta | 2+ voices | ⭐⭐⭐ |
| Dutch | 🟡 Beta | 2+ voices | ⭐⭐⭐ |
| Korean | 🟡 Beta | 2+ voices | ⭐⭐⭐ |
| Polish | 🟡 Beta | 2+ voices | ⭐⭐⭐ |
API Integration
Use Case Examples
- Customer Support
- Gaming/Entertainment
- Educational Content
- Multilingual Business
Configuration Examples
🎛️ Optimal Settings
Recommended configurations for different use cases.
Phone Call Setup
Phone Call Setup
- Phone-compatible audio format
- Friendly, professional tone
- Emotional warmth in greeting
Multilingual Application
Multilingual Application
- Native Spanish voice
- Cultural appropriate emotions
- Higher quality audio format
Gaming/Interactive
Gaming/Interactive
- Maximum expression model
- Character-appropriate voice
- High-quality audio for immersion
Best Practices
🎯 Maximize Inworld's Emotional AI
Get the most out of Inworld’s unique features with these proven strategies.
Emotional Markup Guidelines
Emotional Markup Guidelines
Do’s:
- Use emotions that match the content context
- Place tags at natural speech boundaries
- Mix emotions sparingly for realistic conversations
- Test different voices with the same emotions
- Overuse emotional tags (max 2-3 per sentence)
- Use conflicting emotions close together
- Rely only on markup - the text itself should convey meaning
- Mix languages and emotions in complex ways
Voice Selection Strategy
Voice Selection Strategy
For Business Applications:
- English: Ashley, Alex, Aria
- Spanish: Lupita, Rafael
- French: Hélène, Mathieu
- Dramatic: Hades, Aria
- Friendly: Ashley, Diego
- Professional: Alex, Johanna
- Heroes: Aria, Alex
- Villains: Hades
- NPCs: Ashley, Diego
Performance Optimization
Performance Optimization
Pricing
💰 Flexible Pricing Model
Pay-per-character with volume discounts and free tier for testing.
| Plan | Characters/Month | Price | Features |
|---|---|---|---|
| Free | 25,000 | $0 | All voices, emotional markup |
| Starter | 100,000 | $9 | Priority processing |
| Professional | 500,000 | $29 | Custom voice support |
| Enterprise | Custom | Custom | Dedicated support, SLA |
Troubleshooting
Emotional Markup Not Working
Emotional Markup Not Working
Problem: Emotions don’t seem to affect the voiceSolutions:
- Verify emotion tag spelling (case-sensitive)
- Check if the voice supports the specific emotion
- Try with TTS-1-Max model for stronger expression
- Ensure tags are properly formatted:
[emotion]
Language Detection Issues
Language Detection Issues
Problem: Wrong language pronunciationSolutions:
- Explicitly set language parameter in API call
- Use native voices for each language
- Avoid mixing languages in single requests
- Verify voice ID supports the target language
Custom Voice Problems
Custom Voice Problems
Problem: Custom voice ID not workingSolutions:
- Verify custom voice ID with Inworld support
- Check if voice is properly trained and available
- Use fallback to standard voices during setup
- Contact support for voice activation status
Migration Guide
- From Other Providers
- Voice Mapping
Key Advantages of Switching:
- Emotional markup for better user engagement
- Native multilingual support (11 languages)
- Zero-shot voice cloning capabilities
- AI-powered context awareness
- Map existing voice preferences to Inworld voices
- Add emotional markup to enhance user experience
- Test multilingual capabilities if applicable
- Optimize for Inworld’s strengths (emotions, languages)
🎭 Ready to Add Emotion to Your AI?
Configure Inworld.ai in your assistant settings and start creating more engaging, expressive conversations!