π Inworld.ai: AI-Powered Expression
Advanced AI-driven TTS with emotional markup, context awareness, and support for 11 languages. Perfect for gaming, entertainment, and expressive customer interactions.
Quick Setup
1
Get API Key
- Visit Inworld Studio and create an account
- Navigate to Integrations β API Keys
- Generate a new API key for TTS
- Copy your Bearer token
2
Configure in Burki
- Go to AI Configuration β TTS tab
- Select Inworld.ai as provider
- Paste your Bearer token in the TTS API Key field
3
Choose Model & Voice
Select TTS-1 or TTS-1-Max model and your preferred voice
Available Models
πͺ TTS-1
~200ms latencyFlagship model with realistic, context-aware synthesisLanguages: 11 supported languages
Best for: Production applications, customer service
π¬ TTS-1-Max
~250ms latencyLarger, more expressive model (experimental)Languages: 11 supported languages
Best for: Creative content, gaming, entertainment
Multilingual Voice Library
Hades
Deep and commandingPerfect for authoritative characters
Voice ID: Hades
Alex
Clear and naturalGreat for professional applications
Voice ID: Alex
Ashley
Warm and friendlyIdeal for customer service
Voice ID: Ashley
Aria
Professional and articulatePerfect for business communications
Voice ID: Aria
Emotional Markup System
π Express Emotions in Speech
Inworldβs unique emotional markup allows you to add feelings and speaking styles directly in your text.
Emotional Tags
Zero-Shot Voice Cloning
π― Custom Voice Creation
Create custom voices without training data. Just provide a voice ID and Inworld handles the rest.
1
Enable Custom Voice
In your assistant configuration, select βCustomβ voice option
2
Enter Voice ID
Provide your custom voice identifier in the Custom Voice ID field
3
Test & Refine
Test with sample text and adjust based on results
Language Support & Quality
π Production-Ready Languages
Inworld provides native-quality voices for multiple languages with varying production readiness.
Language | Status | Voices Available | Quality Rating |
---|---|---|---|
English | π’ Production | 4+ voices | βββββ |
Spanish | π’ Production | 4+ voices | βββββ |
French | π’ Production | 4+ voices | ββββ |
German | π‘ Beta | 2+ voices | ββββ |
Chinese | π‘ Beta | 4+ voices | ββββ |
Japanese | π‘ Beta | 2+ voices | βββ |
Italian | π‘ Beta | 2+ voices | βββ |
Portuguese | π‘ Beta | 2+ voices | βββ |
Dutch | π‘ Beta | 2+ voices | βββ |
Korean | π‘ Beta | 2+ voices | βββ |
Polish | π‘ Beta | 2+ voices | βββ |
API Integration
Use Case Examples
Configuration Examples
ποΈ Optimal Settings
Recommended configurations for different use cases.
Phone Call Setup
Phone Call Setup
- Phone-compatible audio format
- Friendly, professional tone
- Emotional warmth in greeting
Multilingual Application
Multilingual Application
- Native Spanish voice
- Cultural appropriate emotions
- Higher quality audio format
Gaming/Interactive
Gaming/Interactive
- Maximum expression model
- Character-appropriate voice
- High-quality audio for immersion
Best Practices
π― Maximize Inworld's Emotional AI
Get the most out of Inworldβs unique features with these proven strategies.
Emotional Markup Guidelines
Emotional Markup Guidelines
Doβs:
- Use emotions that match the content context
- Place tags at natural speech boundaries
- Mix emotions sparingly for realistic conversations
- Test different voices with the same emotions
- Overuse emotional tags (max 2-3 per sentence)
- Use conflicting emotions close together
- Rely only on markup - the text itself should convey meaning
- Mix languages and emotions in complex ways
Voice Selection Strategy
Voice Selection Strategy
For Business Applications:
- English: Ashley, Alex, Aria
- Spanish: Lupita, Rafael
- French: Hélène, Mathieu
- Dramatic: Hades, Aria
- Friendly: Ashley, Diego
- Professional: Alex, Johanna
- Heroes: Aria, Alex
- Villains: Hades
- NPCs: Ashley, Diego
Performance Optimization
Performance Optimization
Pricing
π° Flexible Pricing Model
Pay-per-character with volume discounts and free tier for testing.
Plan | Characters/Month | Price | Features |
---|---|---|---|
Free | 25,000 | $0 | All voices, emotional markup |
Starter | 100,000 | $9 | Priority processing |
Professional | 500,000 | $29 | Custom voice support |
Enterprise | Custom | Custom | Dedicated support, SLA |
Troubleshooting
Emotional Markup Not Working
Emotional Markup Not Working
Problem: Emotions donβt seem to affect the voiceSolutions:
- Verify emotion tag spelling (case-sensitive)
- Check if the voice supports the specific emotion
- Try with TTS-1-Max model for stronger expression
- Ensure tags are properly formatted:
[emotion]
Language Detection Issues
Language Detection Issues
Problem: Wrong language pronunciationSolutions:
- Explicitly set language parameter in API call
- Use native voices for each language
- Avoid mixing languages in single requests
- Verify voice ID supports the target language
Custom Voice Problems
Custom Voice Problems
Problem: Custom voice ID not workingSolutions:
- Verify custom voice ID with Inworld support
- Check if voice is properly trained and available
- Use fallback to standard voices during setup
- Contact support for voice activation status
Migration Guide
Key Advantages of Switching:
- Emotional markup for better user engagement
- Native multilingual support (11 languages)
- Zero-shot voice cloning capabilities
- AI-powered context awareness
- Map existing voice preferences to Inworld voices
- Add emotional markup to enhance user experience
- Test multilingual capabilities if applicable
- Optimize for Inworldβs strengths (emotions, languages)
π Ready to Add Emotion to Your AI?
Configure Inworld.ai in your assistant settings and start creating more engaging, expressive conversations!