Kakaru
|
4be73d35d5
|
Merge 2b64032cdad29befe8ebcff078a0029c36d56b24 into 9ec3a60f30d228719e5ec6cd6796c5b2d888dd1a
|
2025-12-03 00:49:16 -08:00 |
|
RVC-Boss
|
9ec3a60f30
|
Update config.py
|
2025-12-01 20:23:49 +08:00 |
|
RVC-Boss
|
fc533b6fb7
|
Update fasterwhisper_asr.py
|
2025-12-01 11:38:37 +08:00 |
|
XXXXRT666
|
857799276c
|
Fix Modelscope (#2679)
|
2025-12-01 11:13:15 +08:00 |
|
Spr_Aachen
|
92d2d337fd
|
Fix training error caused by float type of default_batch_size parameter (#2662)
|
2025-11-28 22:53:43 +08:00 |
|
ChasonJiang
|
6fb441f65e
|
更友好的流模式选项 (#2678)
|
2025-11-28 22:13:48 +08:00 |
|
XXXXRT666
|
c85c54eca9
|
Add ModelScope Snapshot Download For ASR (#2627)
* Add ModelScope Snapshot Download For ASR
* Typo Fix
* Remove YUE in whisper
* Remove HF ENDPOINT
* Add FunASR Download
|
2025-11-28 22:10:49 +08:00 |
|
RVC-Boss
|
cb00840c4e
|
Add files via upload
|
2025-11-28 22:02:03 +08:00 |
|
wzy3650
|
60a4a214af
|
vq distributed training support (#2577)
Co-authored-by: wangzeyuan <wangzeyuan@agora.io>
|
2025-11-28 21:57:13 +08:00 |
|
zzz
|
6375bbe316
|
尝试 stream infer (#2469)
* 尝试 stream infer
* 在 stream_infer 脚本中绘制生成的音频
* stream_infer 增加导出部分。
* stream_infer: 更方便找规律的图
* stream_infer: 在拼接音频时进行相关性搜索,减少拼接带来基频断裂的情况
* stream_infer: 导出 `find_best_audio_offset_fast`
* stream_infer: 优化波形显示,方便对比
* stream_v2pro.py 从命令行读取参数
* stream_v2pro.py 减少用于导出的文本长度
* stream_v2pro: 修复由于 spectrogram_torch 输入是 half 导致 spec 溢出最终没有声音的问题
* stream_v2pro: 新增 --lang 参数提示参考文字的语言类型
|
2025-11-28 21:36:57 +08:00 |
|
KamioRinn
|
e00ca92140
|
Fix ASMD (#2636)
|
2025-11-28 21:22:43 +08:00 |
|
ChasonJiang
|
92ab59c553
|
更细粒度的流式推理模式 (#2671)
* 更好的流式推理模式
* 清理无用代码
* modified: GPT_SoVITS/AR/models/t2s_model.py
modified: GPT_SoVITS/TTS_infer_pack/TTS.py
modified: GPT_SoVITS/module/models.py
* modified: GPT_SoVITS/TTS_infer_pack/TTS.py
* modified: .gitignore
modified: GPT_SoVITS/AR/models/t2s_model.py
modified: GPT_SoVITS/TTS_infer_pack/TTS.py
modified: GPT_SoVITS/module/models.py
* modified: GPT_SoVITS/AR/models/t2s_model.py
modified: GPT_SoVITS/TTS_infer_pack/TTS.py
modified: GPT_SoVITS/module/models.py
modified: api_v2.py
* modified: GPT_SoVITS/TTS_infer_pack/TTS.py
* 更正拼写错误
* 支持固定chunk长度的流式推理,优化sola算法
* 修复api_v2的ogg格式传输问题
|
2025-11-28 21:12:41 +08:00 |
|
KakaruHayate
|
2b64032cda
|
fix MCCL error on qy1
|
2025-10-21 18:01:24 +08:00 |
|
Kakaru
|
3a92c046f9
|
revert "rfft fallback to cpu"
|
2025-10-21 17:11:54 +08:00 |
|
Kakaru
|
917f73c38c
|
may fix RuntimeError: MUSA error: an illegal memory access was encountered
|
2025-10-20 20:44:34 +08:00 |
|
Kakaru
|
70a9243285
|
Merge pull request #3 from plae-tljg/musa_MUSA1009
solved subtle problem on import and syntax
|
2025-10-20 20:43:46 +08:00 |
|
plae-tljg
|
42abb66a32
|
solved subtle problem on import and syntax
|
2025-10-20 20:29:59 +08:00 |
|
KakaruHayate
|
06994f2a13
|
may fix RuntimeError: MUSA error: an illegal memory access was encountered
|
2025-10-19 16:49:13 +08:00 |
|
KakaruHayate
|
90141d2029
|
clean
|
2025-10-13 22:21:50 +08:00 |
|
KakaruHayate
|
fd8c860f49
|
clean
|
2025-10-13 22:14:23 +08:00 |
|
Kakaru
|
aada52050e
|
clean
|
2025-10-13 22:04:09 +08:00 |
|
KakaruHayate
|
47bb5a2cba
|
clean
|
2025-10-11 22:47:05 +08:00 |
|
KakaruHayate
|
1f91faf51e
|
del musa_utils from musa_accelerator
|
2025-10-11 22:36:58 +08:00 |
|
KakaruHayate
|
50db2e9199
|
support S1 train on MUSA
|
2025-10-11 22:33:32 +08:00 |
|
KakaruHayate
|
72be145051
|
Support on MUSA device.
fix
Update musa_utils.py
Update musa_utils.py
Update config.py
fix
rollback S1 train
DDP only support S4000
DDP only support S4000
fix
|
2025-10-11 15:00:54 +08:00 |
|
RVC-Boss
|
11aa78bd9b
|
修复环境变量可能不为str的问题
修复环境变量可能不为str的问题
|
2025-09-10 15:01:04 +08:00 |
|
XXXXRT666
|
fdf794e31d
|
Update WSL Rocm (#2561)
|
2025-08-02 17:47:15 +08:00 |
|
多玩幻灵qwq
|
0be59c8043
|
fix: 更正链接 (#2539)
|
2025-07-19 00:29:48 +08:00 |
|
ChasonJiang
|
b5a67e6247
|
修复gpt的loss计算问题 (#2537)
* 修复gpt的loss计算问题
* fallback tts config
|
2025-07-18 14:59:59 +08:00 |
|
ChasonJiang
|
b9211657d8
|
优化TTS_Config的代码逻辑 (#2536)
* 优化TTS_Config的代码逻辑
* 在载入vits权重之后保存tts_config
|
2025-07-18 11:54:40 +08:00 |
|
XXXXRT666
|
cefafee32c
|
Add Distil (#2531)
|
2025-07-17 20:28:25 +08:00 |
|
RVC-Boss
|
2d09bbe63a
|
Update tts_infer.yaml
|
2025-07-16 15:44:04 +08:00 |
|
RVC-Boss
|
4d8ebf8523
|
Update TTS.py
|
2025-07-16 15:43:26 +08:00 |
|
jiangsier-xyz
|
e476b01f30
|
解决 TTS.py 无法识别真正支持版本 v2Pro、v2ProPlus 的问题 (#2490)
同时更新一版默认配置。
Co-authored-by: jiangsier-xyz <jiangsier131@gmail.com>
|
2025-07-16 15:42:36 +08:00 |
|
RVC-Boss
|
42586e20f7
|
add RTF performence
add RTF performence
|
2025-07-14 19:01:26 +08:00 |
|
RVC-Boss
|
85035f7ac0
|
add RTF performence
add RTF performence
|
2025-07-14 18:56:22 +08:00 |
|
RVC-Boss
|
706bec74f8
|
Update assets.py
|
2025-07-11 16:11:08 +08:00 |
|
XXXXRT666
|
ec1218893e
|
Update Badge (#2518)
* Update README.md
* Update README.md
* Update Badges
* specify ranges
|
2025-07-11 16:10:07 +08:00 |
|
RVC-Boss
|
fec515dcce
|
Update Changelog_CN.md
|
2025-07-10 18:33:18 +08:00 |
|
RVC-Boss
|
426e1a2bb4
|
提升推理进程优先级
|
2025-07-10 18:16:45 +08:00 |
|
RVC-Boss
|
4e3c69043c
|
Update inference_webui.py
|
2025-07-10 18:16:24 +08:00 |
|
RVC-Boss
|
e63e0901fd
|
Update assets.py
|
2025-07-10 18:12:24 +08:00 |
|
RVC-Boss
|
97e37c74d8
|
Update README.md
|
2025-07-10 18:06:04 +08:00 |
|
RVC-Boss
|
3a75f5023f
|
Update README.md
|
2025-07-10 18:05:03 +08:00 |
|
RVC-Boss
|
0899b7e432
|
Update README.md
|
2025-07-10 17:59:49 +08:00 |
|
Yixiao Chen
|
8c579d46dd
|
Update export_torch_script.py (#2494)
Avoid dtype inconsistency when exporting
|
2025-07-02 22:48:28 +08:00 |
|
KamioRinn
|
6df61f58e4
|
语言分割及格式化优化 (#2488)
* better LangSegmenter
* add version num2str
* better version num2str
* sync fast infer
* sync api
* remove duplicate spaces
* remove unnecessary code
---------
Co-authored-by: RVC-Boss <129054828+RVC-Boss@users.noreply.github.com>
|
2025-06-27 11:58:41 +08:00 |
|
KamioRinn
|
90ebefa78f
|
make sure ort providers available (#2489)
|
2025-06-27 10:41:52 +08:00 |
|
XXXXRT666
|
4839e82148
|
Add Windows Install Powershell Scripts (#2487)
|
2025-06-27 01:04:18 +08:00 |
|
XXXXRT666
|
37f5abfcb4
|
Fix Issues with libstdcxx and conda sysroot (#2482)
|
2025-06-25 14:52:27 +08:00 |
|