Path Parameters
assistant_id(integer, required): The unique identifier of the assistant to associate the cloned voice with.
Request Body
The request body is a JSON object containing cloning configuration.voice_sample_id(integer, required): ID of the uploaded voice sample to clone fromprovider(string, required): TTS provider to use for voice cloning- Allowed values:
elevenlabs,resemble,inworld
- Allowed values:
name(string, required): Display name for the cloned voicedescription(string, optional): Description of the cloned voicelanguage(string, optional): Language code (e.g.,en,es,fr)enhance_quality(boolean, optional): Whether to apply quality enhancement. Default:truetags(array of strings, optional): Tags for organizing cloned voicesmetadata(object, optional): Additional provider-specific metadata
Example Request
Request
Response
A successful request returns the cloned voice object with initial status.Response
Response Fields
id(integer): Unique identifier for the cloned voicename(string): Display name of the cloned voiceprovider(string): TTS provider used for cloningprovider_voice_id(string): Provider-specific voice identifierstatus(string): Current cloning statuscreated_at(string): ISO 8601 timestamp of creation
Cloning Status
processing: Voice cloning is in progressready: Voice is successfully cloned and ready for usefailed: Voice cloning failed, check error details
Provider-Specific Behavior
ElevenLabs
- Processing Time: Usually 5-30 seconds
- Instant Preview: Voice ready for immediate testing
- Voice ID: Returns ElevenLabs voice ID for direct API usage
- Quality: High-quality instant cloning from short samples
Resemble AI
- Processing Time: 2-10 minutes for training
- Professional Quality: Advanced training algorithms
- Voice UUID: Returns Resemble voice UUID
- Customization: Supports advanced voice customization options
Example Requests
ElevenLabs Voice Cloning
Resemble AI Voice Cloning
Error Responses
400 Bad Request - Invalid parameters:Best Practices
- Provider Selection: Choose ElevenLabs for quick prototyping, Resemble for production quality
- Naming Convention: Use descriptive names that indicate voice characteristics
- Quality Settings: Enable quality enhancement for better results
- Monitoring: Check cloning status regularly for large-scale operations
- Error Handling: Implement retry logic for transient provider failures