Skip to content

ByteDance Volcano Engine Text-to-Speech

Text-to-speech (TTS) converts text into audio. There are many excellent open-source solutions, such as GPT-SoVITS and ChatTTS, as well as the free edge-tts. Of course, commercial services like ByteDance Volcano Engine's TTS are also available. If you want to use a free option, open-source solutions are the first choice, but for better quality, commercial services are more suitable. Especially with the development of large models, the price is getting lower and lower, making commercial APIs a good option for dubbing.

Version 2.88 and later include ByteDance Volcano Engine's TTS service, which supports dubbing in 8 languages including Chinese, English, Japanese, Portuguese, Spanish, Thai, Vietnamese, and Indonesian. In Chinese, it also supports dialects such as Northeastern Mandarin and Sichuanese. It offers 20,000 free requests, which can synthesize approximately 10 hours of speech.

Supported Chinese Voices

Only some Chinese voices are displayed. See https://www.volcengine.com/docs/6561/97465 for voices in the other 7 languages.

There are many supported Chinese voices, including various dialects and popular Douyin (TikTok) movie commentary voices.

Voice Namevoice_type
Can Can 2.0BV700_V2_streaming
Yang YangBV705_streaming
Sunny YouthBV123_streaming
Anti-Involution YouthBV120_streaming
Common Son-in-lawBV119_streaming
Ancient Style Young LadyBV115_streaming
Dominant Middle-Aged ManBV107_streaming
Simple YouthBV100_streaming
Gentle LadyBV104_streaming
Cheerful YouthBV004_streaming
Sweet Young LadyBV113_streaming
Refined YouthBV102_streaming
Sweet Xiao YuanBV405_streaming
Kind Female VoiceBV007_streaming
Intellectual Female VoiceBV009_streaming
Cheng ChengBV419_streaming
Tong TongBV415_streaming
Kind Male VoiceBV008_streaming
Dubbing Male VoiceBV408_streaming
Lazy SheepBV426_streaming
Fresh Literary Female VoiceBV428_streaming
Chicken Soup Female VoiceBV403_streaming
Wise Old ManBV158_streaming
Loving GrandmaBV157_streaming
Rapping GuyBR001_streaming
Energetic Commentary MaleBV410_streaming
Movie Commentary Xiao ShuaiBV411_streaming
Commentary Xiao Shuai - Multi-EmotionalBV437_streaming
Movie Commentary Xiao MeiBV412_streaming
纨绔青年BV159_streaming
直播一姐BV418_streaming
Anti-Involution YouthBV120_streaming
Calm Commentary MaleBV142_streaming
Handsome YouthBV143_streaming

How to Enable

  1. Of course, you must first register, log in, and complete real-name authentication.

https://console.volcengine.com/

Open this address to register, log in, and complete real-name authentication.

  1. After entering the console, as shown in the figure below, open the Speech Technology page.

image.png

You can also click this address to directly enter https://console.volcengine.com/speech/app

Then, as shown in the figure below, create an application. Fill in the name and introduction as you like, but be sure to select "Text-to-Speech Service" and then confirm.

image.png

  1. Next, enter the text-to-speech page to enable the free trial.

Go to https://console.volcengine.com/speech/service/8

Select the application you just created at the top, and click "Try" to enable it.

image.png

  1. Copy the 3 parameters, which can be filled in the video translation software.

The first is cluster id. Copy the name under cluster id as shown.

image.png

The second is App id. Scroll down on this page to see it.

image.png

The third is Access Token on the right side of App id. Copy it.

image.png

  1. Fill it into the video translation software. Open the Menu - TTS Settings - ByteDance Volcano Engine Text-to-Speech window to fill it in. Save it after testing without problems.

image.png

Using it in Video Translation Software

After filling in the information and testing without problems, first select the target language in the software, and then select ByteDance Volcano Engine Text-to-Speech in the dubbing channel. You can click to listen to each voice.

image.png

Select a satisfactory voice to start the dubbing operation.

Special Note

If you have enabled the official version, only the General Male and General Female roles are available by default. Other roles need to be purchased and enabled separately in the ByteDance Volcano Engine backend.