1037 Commits

Author SHA1 Message Date
Kakaru
4be73d35d5
Merge 2b64032cdad29befe8ebcff078a0029c36d56b24 into 9ec3a60f30d228719e5ec6cd6796c5b2d888dd1a 2025-12-03 00:49:16 -08:00
RVC-Boss
9ec3a60f30
Update config.py 2025-12-01 20:23:49 +08:00
RVC-Boss
fc533b6fb7
Update fasterwhisper_asr.py 2025-12-01 11:38:37 +08:00
XXXXRT666
857799276c
Fix Modelscope (#2679) 2025-12-01 11:13:15 +08:00
Spr_Aachen
92d2d337fd
Fix training error caused by float type of default_batch_size parameter (#2662) 2025-11-28 22:53:43 +08:00
ChasonJiang
6fb441f65e
更友好的流模式选项 (#2678) 2025-11-28 22:13:48 +08:00
XXXXRT666
c85c54eca9
Add ModelScope Snapshot Download For ASR (#2627)
* Add ModelScope Snapshot Download For ASR

* Typo Fix

* Remove YUE in whisper

* Remove HF ENDPOINT

* Add FunASR Download
2025-11-28 22:10:49 +08:00
RVC-Boss
cb00840c4e
Add files via upload 2025-11-28 22:02:03 +08:00
wzy3650
60a4a214af
vq distributed training support (#2577)
Co-authored-by: wangzeyuan <wangzeyuan@agora.io>
2025-11-28 21:57:13 +08:00
zzz
6375bbe316
尝试 stream infer (#2469)
* 尝试 stream infer

* 在 stream_infer 脚本中绘制生成的音频

* stream_infer 增加导出部分。

* stream_infer: 更方便找规律的图

* stream_infer: 在拼接音频时进行相关性搜索,减少拼接带来基频断裂的情况

* stream_infer: 导出 `find_best_audio_offset_fast`

* stream_infer: 优化波形显示,方便对比

* stream_v2pro.py 从命令行读取参数

* stream_v2pro.py 减少用于导出的文本长度

* stream_v2pro: 修复由于 spectrogram_torch 输入是 half 导致 spec 溢出最终没有声音的问题

* stream_v2pro: 新增 --lang 参数提示参考文字的语言类型
2025-11-28 21:36:57 +08:00
KamioRinn
e00ca92140
Fix ASMD (#2636) 2025-11-28 21:22:43 +08:00
ChasonJiang
92ab59c553
更细粒度的流式推理模式 (#2671)
* 更好的流式推理模式

* 清理无用代码

* modified:   GPT_SoVITS/AR/models/t2s_model.py
	modified:   GPT_SoVITS/TTS_infer_pack/TTS.py
	modified:   GPT_SoVITS/module/models.py

* modified:   GPT_SoVITS/TTS_infer_pack/TTS.py

* modified:   .gitignore
	modified:   GPT_SoVITS/AR/models/t2s_model.py
	modified:   GPT_SoVITS/TTS_infer_pack/TTS.py
	modified:   GPT_SoVITS/module/models.py

* modified:   GPT_SoVITS/AR/models/t2s_model.py
	modified:   GPT_SoVITS/TTS_infer_pack/TTS.py
	modified:   GPT_SoVITS/module/models.py
	modified:   api_v2.py

* modified:   GPT_SoVITS/TTS_infer_pack/TTS.py

* 更正拼写错误

* 支持固定chunk长度的流式推理,优化sola算法

* 修复api_v2的ogg格式传输问题
2025-11-28 21:12:41 +08:00
KakaruHayate
2b64032cda fix MCCL error on qy1 2025-10-21 18:01:24 +08:00
Kakaru
3a92c046f9
revert "rfft fallback to cpu" 2025-10-21 17:11:54 +08:00
Kakaru
917f73c38c
may fix RuntimeError: MUSA error: an illegal memory access was encountered 2025-10-20 20:44:34 +08:00
Kakaru
70a9243285
Merge pull request #3 from plae-tljg/musa_MUSA1009
solved subtle problem on import and syntax
2025-10-20 20:43:46 +08:00
plae-tljg
42abb66a32 solved subtle problem on import and syntax 2025-10-20 20:29:59 +08:00
KakaruHayate
06994f2a13 may fix RuntimeError: MUSA error: an illegal memory access was encountered 2025-10-19 16:49:13 +08:00
KakaruHayate
90141d2029 clean 2025-10-13 22:21:50 +08:00
KakaruHayate
fd8c860f49 clean 2025-10-13 22:14:23 +08:00
Kakaru
aada52050e
clean 2025-10-13 22:04:09 +08:00
KakaruHayate
47bb5a2cba clean 2025-10-11 22:47:05 +08:00
KakaruHayate
1f91faf51e del musa_utils from musa_accelerator 2025-10-11 22:36:58 +08:00
KakaruHayate
50db2e9199 support S1 train on MUSA 2025-10-11 22:33:32 +08:00
KakaruHayate
72be145051 Support on MUSA device.
fix

Update musa_utils.py

Update musa_utils.py

Update config.py

fix

rollback S1 train

DDP only support S4000

DDP only support S4000

fix
2025-10-11 15:00:54 +08:00
RVC-Boss
11aa78bd9b
修复环境变量可能不为str的问题
修复环境变量可能不为str的问题
2025-09-10 15:01:04 +08:00
XXXXRT666
fdf794e31d
Update WSL Rocm (#2561) 2025-08-02 17:47:15 +08:00
多玩幻灵qwq
0be59c8043
fix: 更正链接 (#2539) 2025-07-19 00:29:48 +08:00
ChasonJiang
b5a67e6247
修复gpt的loss计算问题 (#2537)
* 修复gpt的loss计算问题

* fallback tts config
2025-07-18 14:59:59 +08:00
ChasonJiang
b9211657d8
优化TTS_Config的代码逻辑 (#2536)
* 优化TTS_Config的代码逻辑

* 在载入vits权重之后保存tts_config
2025-07-18 11:54:40 +08:00
XXXXRT666
cefafee32c
Add Distil (#2531) 2025-07-17 20:28:25 +08:00
RVC-Boss
2d09bbe63a
Update tts_infer.yaml 2025-07-16 15:44:04 +08:00
RVC-Boss
4d8ebf8523
Update TTS.py 2025-07-16 15:43:26 +08:00
jiangsier-xyz
e476b01f30
解决 TTS.py 无法识别真正支持版本 v2Pro、v2ProPlus 的问题 (#2490)
同时更新一版默认配置。

Co-authored-by: jiangsier-xyz <jiangsier131@gmail.com>
2025-07-16 15:42:36 +08:00
RVC-Boss
42586e20f7
add RTF performence
add RTF performence
2025-07-14 19:01:26 +08:00
RVC-Boss
85035f7ac0
add RTF performence
add RTF performence
2025-07-14 18:56:22 +08:00
RVC-Boss
706bec74f8
Update assets.py 2025-07-11 16:11:08 +08:00
XXXXRT666
ec1218893e
Update Badge (#2518)
* Update README.md

* Update README.md

* Update Badges

* specify ranges
2025-07-11 16:10:07 +08:00
RVC-Boss
fec515dcce
Update Changelog_CN.md 2025-07-10 18:33:18 +08:00
RVC-Boss
426e1a2bb4
提升推理进程优先级 2025-07-10 18:16:45 +08:00
RVC-Boss
4e3c69043c
Update inference_webui.py 2025-07-10 18:16:24 +08:00
RVC-Boss
e63e0901fd
Update assets.py 2025-07-10 18:12:24 +08:00
RVC-Boss
97e37c74d8
Update README.md 2025-07-10 18:06:04 +08:00
RVC-Boss
3a75f5023f
Update README.md 2025-07-10 18:05:03 +08:00
RVC-Boss
0899b7e432
Update README.md 2025-07-10 17:59:49 +08:00
Yixiao Chen
8c579d46dd
Update export_torch_script.py (#2494)
Avoid dtype inconsistency when exporting
2025-07-02 22:48:28 +08:00
KamioRinn
6df61f58e4
语言分割及格式化优化 (#2488)
* better LangSegmenter

* add version num2str

* better version num2str

* sync fast infer

* sync api

* remove duplicate spaces

* remove unnecessary code

---------

Co-authored-by: RVC-Boss <129054828+RVC-Boss@users.noreply.github.com>
2025-06-27 11:58:41 +08:00
KamioRinn
90ebefa78f
make sure ort providers available (#2489) 2025-06-27 10:41:52 +08:00
XXXXRT666
4839e82148
Add Windows Install Powershell Scripts (#2487) 2025-06-27 01:04:18 +08:00
XXXXRT666
37f5abfcb4
Fix Issues with libstdcxx and conda sysroot (#2482) 2025-06-25 14:52:27 +08:00