Working with speakers and voices

Once you are finished editing, use generate project endpoint to apply changes to voiceover (re-dub project).

Minutes for re-dubbing are charged only for the modified and added segments, based on their combined length.For example, if you modified two segments (text, timestamp, speaker, or speaker’s voice assigned to segment) that are 15 and 20 seconds long, and added a new segment that is 10 seconds long, their durations will be combined (15+20+10 = 45 seconds in total) before calculating the total minutes charged when regenerating the project.The combined duration in seconds is then rounded up to the closest full minute for charges, for example:

0:01 to 0:59 = 1 minute
1:01 to 1:29 = 1 minute
1:30 to 1:59 = 2 minutes

This calculation is performed every time when generate project method is called.

You can select from variety of voice options for speakers in your project. In projects with destination languages, that support voice cloning, you can select between Instant Voice clone, Custom voice clone and AI voice presets. In other languages, you can choose any available AI voice preset for your Speaker.

Get available voice options for your project

Use your project_id and get project voices endpoint to pull the list of all voices available for specified projectFrom the successful response, identify and obtain the voice_id for the voices you wish to assign.

Change voice assigned to Speaker(s)

Use your project_id with the Patch Project endpoint. Pass the voice array with key-value entries where:

Key: The speaker identifier (e.g., SPEAKER_00, SPEAKER_01, etc.).
Value: The corresponding voice_id you want to assign.

Ensure that you pass all speakers’ voice assignments, including both modified and non-modified ones. If a speaker’s voice assignment is not included:

The speaker will be removed if no segments are assigned to it.
The request will return an error if segments are assigned but no voice assignment is provided.

Can I create Custom Voice clone via API?

Currently, creating a Custom voice clone is only available through our web platform. Once you’ve created the clone in the platform, you can retrieve its voice_id using the API and assign it to a speaker.

How do I distinguish between Instant Voice clone, Custom Voice clones, and AI voice presets?

In languages where all these options are available, you can identify them based on the following attributes:

Instant Voice clone

Instant Voice Clone option always has the following attributes:

"label": "Voice Clone",
"sample_src": null,
"gender": "auto"

Custom voice clone

If you have created Custom voice clone in your platform, it will also be available for selection in supported languages. The parameters typically look like this:

"label": "Your name",
"sample_src": "https://sample_url",
"gender": "auto"

AI voice presets

AI Voice Presets are available in all languages and always include a specified gender:

"label": "Preset name",
"sample_src": "https://sample_url",
"gender": "(fe)male"

Get Started

Languages list

Workflow

API documentation

Resources

Media

Project

Transcription

Glossary

Lipsync

User

Working with speakers and voices

Can I create Custom Voice clone via API?

How do I distinguish between Instant Voice clone, Custom Voice clones, and AI voice presets?

Get Started

Languages list

Workflow

API documentation

Resources

Media

Project

Transcription

Glossary

Lipsync

User

​Can I create Custom Voice clone via API?

​How do I distinguish between Instant Voice clone, Custom Voice clones, and AI voice presets?

Can I create Custom Voice clone via API?

How do I distinguish between Instant Voice clone, Custom Voice clones, and AI voice presets?