ByteDance Volcano Voice Synthesis
Voice synthesis, or converting text into speech, has many excellent open-source solutions like GPT-SoVITS and ChatTTS, as well as free options like edge-tts. There are also commercial-grade services, such as ByteDance's Volcano voice synthesis. For free options, open-source tools are the best choice, but for higher quality, commercial services are more suitable. With the advancement of large models, prices are becoming increasingly affordable, making commercial APIs a great option for dubbing.
Starting from version 2.88, ByteDance's Volcano Engine voice synthesis service has been added. It supports dubbing in 8 languages: Chinese, English, Japanese, Portuguese, Spanish, Thai, Vietnamese, and Indonesian. For Chinese, it also supports various dialects like Northeastern and Sichuan accents. It offers 20,000 free requests, approximately enough to synthesize 10 hours of speech.
Supported Chinese Voice Styles
Only some Chinese voice styles are shown here. For the other 7 languages, check: https://www.volcengine.com/docs/6561/97465
There are many supported Chinese voice styles, including various dialects and popular voices like those used in Douyin for movie commentary, such as Xiao Shuai and Xiao Mei.
| Voice Name | voice_type |
|---|---|
| Can Can 2.0 | BV700_V2_streaming |
| Yang Yang | BV705_streaming |
| Sunny Youth | BV123_streaming |
| Anti-Roll Youth | BV120_streaming |
| General Son-in-Law | BV119_streaming |
| Ancient Elegant Lady | BV115_streaming |
| Dominant Uncle | BV107_streaming |
| Simple Youth | BV100_streaming |
| Gentle Lady | BV104_streaming |
| Cheerful Youth | BV004_streaming |
| Sweet Young Lady | BV113_streaming |
| Elegant Youth | BV102_streaming |
| Sweet Xiao Yuan | BV405_streaming |
| Friendly Female Voice | BV007_streaming |
| Intellectual Female Voice | BV009_streaming |
| Cheng Cheng | BV419_streaming |
| Tong Tong | BV415_streaming |
| Friendly Male Voice | BV008_streaming |
| Dubbed Film Male Voice | BV408_streaming |
| Lazy Little Sheep | BV426_streaming |
| Fresh Literary Female Voice | BV428_streaming |
| Inspirational Female Voice | BV403_streaming |
| Wise Elder | BV158_streaming |
| Loving Grandma | BV157_streaming |
| Rap Guy | BR001_streaming |
| Energetic Male Narrator | BV410_streaming |
| Movie Narrator Xiao Shuai | BV411_streaming |
| Xiao Shuai - Multi-Emotion | BV437_streaming |
| Movie Narrator Xiao Mei | BV412_streaming |
| Playboy Youth | BV159_streaming |
| Live Stream Queen | BV418_streaming |
| Anti-Roll Youth | BV120_streaming |
| Calm Male Narrator | BV142_streaming |
| Free-spirited Youth | BV143_streaming |
How to Activate
- First, register, log in, and complete real-name verification.
https://console.volcengine.com/
Open the link to register, log in, and complete the verification.
- After entering the console, open the Speech Technology page as shown below.

Alternatively, click this link to go directly: https://console.volcengine.com/speech/app
Then, create an application as shown below. Fill in the name and description as desired, but make sure to select "Voice Synthesis Service" and confirm to complete.

- Next, go to the voice synthesis page to activate the free trial.
Navigate to: https://console.volcengine.com/speech/service/8
At the top, select the application you just created and click "Trial" to activate.

- Copy the three parameters and fill them into the video translation software for use.
First is the cluster id. Copy the name under cluster id as shown.

Second is the App id. Scroll down on the same page to find it.

Third is the Access Token, located to the right of App id. Copy it.

- Fill these into the video translation software. Open Menu - TTS Settings - ByteDance Volcano Voice Synthesis window, enter the details, test to ensure no issues, and save.

Using It in Video Translation Software
After filling in and testing without issues, select the target language in the software, then choose ByteDance Volcano Voice Synthesis under the dubbing channel. You can click to preview each voice style.

Choose a satisfactory voice style to start the dubbing process.
Important Notes
If you activate the official version, only the "General Male" and "General Female" voices are available by default. Other voice styles need to be purchased and activated separately in the ByteDance Volcano backend.
