Working with speakers and voices
Learn how to add, modify, or remove speakers in your project, and how to change the voices assigned to them.
Once you are finished editing, use generate project endpoint to apply changes to voiceover (re-dub project).
Minutes for re-dubbing are charged only for the modified and added segments, based on their combined length.
For example, if you modified two segments (text, timestamp, speaker, or speaker’s voice assigned to segment) that are 15 and 20 seconds long, and added a new segment that is 10 seconds long, their durations will be combined (15+20+10 = 45 seconds in total) before calculating the total minutes charged when regenerating the project.
The combined duration in seconds is then rounded up to the closest full minute for charges, for example:
- 0:01 to 0:59 = 1 minute
- 1:01 to 1:29 = 1 minute
- 1:30 to 1:59 = 2 minutes
This calculation is performed every time when generate project method is called.
You can select from variety of voice options for speakers in your project. In projects with destination languages, that support voice cloning, you can select between Instant Voice clone, Custom voice clone and AI voice presets. In other languages, you can choose any available AI voice preset for your Speaker.
Get available voice options for your project
Use your project_id
and get project voices endpoint to pull the list of all voices available for specified project
From the successful response, identify and obtain the voice_id
for the voices you wish to assign.
Change voice assigned to Speaker(s)
Use your project_id
with the Patch Project endpoint. Pass the voice array with key-value entries where:
- Key: The speaker identifier (e.g.,
SPEAKER_00
,SPEAKER_01
, etc.). - Value: The corresponding
voice_id
you want to assign.
Ensure that you pass all speakers’ voice assignments, including both modified and non-modified ones. If a speaker’s voice assignment is not included:
- The speaker will be removed if no segments are assigned to it.
- The request will return an error if segments are assigned but no voice assignment is provided.
Can I create Custom Voice clone via API?
Currently, creating a Custom voice clone is only available through our web platform. Once you’ve created the clone in the platform, you can retrieve its voice_id
using the API and assign it to a speaker.
How do I distinguish between Instant Voice clone, Custom Voice clones, and AI voice presets?
In languages where all these options are available, you can identify them based on the following attributes:
- Instant Voice clone
Instant Voice Clone option always has the following attributes:
- Custom voice clone
If you have created Custom voice clone in your platform, it will also be available for selection in supported languages. The parameters typically look like this:
- AI voice presets
AI Voice Presets are available in all languages and always include a specified gender: