FunASR Chinese Speech Recognition:
FunASR is an open-source speech recognition toolkit from Alibaba. It performs better than the Whisper series in Chinese speech scenarios. Video translation software supports FunASR via HTTP calls using the zh_recogn
and SenseVoice
projects. Simply deploy the corresponding zh_recogn
and SenseVoice
integration packages, start them, and enter the API address in the video translation software to use it.
However, many users are still confused about this process. Therefore, starting from version v2.97, this functionality has been integrated into the video translation software. This means you no longer need to additionally deploy and start the zh_recogn
and SenseVoice
projects. You can directly select FunASR Chinese Recognition in the software to use it.
Select FunASR Chinese in Speech Recognition
After selecting FunASR Chinese Recognition in Speech Recognition, you can choose to use either the paraformer-zh model or the SenseVoiceSmall model. It is recommended to choose the former, as it offers better performance and speed than the latter.
Online Download of FunASR Chinese Recognition Model on First Use
To avoid an excessively large package size, the FunASR models are not integrated within the software package. The first time you use it, it will automatically download from modelscope.cn. After downloading, it will be saved in the models
folder under the software directory, inside the hub
folder. Depending on network conditions, this may take a few minutes to tens of minutes or even longer. As long as there are no red error messages, patiently wait for the download to complete.