From 7fce76b5048aae1d1689b6c5241f80593917cdc8 Mon Sep 17 00:00:00 2001 From: XXXXRT666 <157766680+XXXXRT666@users.noreply.github.com> Date: Wed, 20 Mar 2024 15:48:15 +0000 Subject: [PATCH] Update README.md --- README.md | 2 ++ 1 file changed, 2 insertions(+) diff --git a/README.md b/README.md index c53ab766..f02b78ca 100644 --- a/README.md +++ b/README.md @@ -143,6 +143,8 @@ Users in China region can download these two models by entering the links below For Chinese ASR (additionally), download models from [Damo ASR Model](https://modelscope.cn/models/damo/speech_paraformer-large_asr_nat-zh-cn-16k-common-vocab8404-pytorch/files), [Damo VAD Model](https://modelscope.cn/models/damo/speech_fsmn_vad_zh-cn-16k-common-pytorch/files), and [Damo Punc Model](https://modelscope.cn/models/damo/punc_ct-transformer_zh-cn-common-vocab272727-pytorch/files) and place them in `tools/damo_asr/models`. +For English or Japanese ASR (additionally), download models from [Faster Whisper Large V3 model](https://huggingface.co/Systran/faster-whisper-large-v3) and place them in `tools/damo_asr/models`. Also, [other models](https://huggingface.co/Systran) may have the similar effect with smaller disk footprint. + ## Dataset Format The TTS annotation .list file format: