RVC-Boss
02425ea256
Fixed issues such as missing imports for types like Optional.
...
Fixed issues such as missing imports for types like `Optional`.
2026-04-18 17:33:53 +08:00
Harikrishna KP
938f05fce8
fix: correct torch.randint upper bound to include both values ( #2733 )
2026-04-18 17:19:55 +08:00
huang yutong
445d18ccce
fix: 修复 TTS 音频后处理中的多个缺陷 ( #2753 )
...
1. 修复音频超采样时 int16 双重转换导致整数溢出(CRITICAL)
- audio_postprocess 中 `audio = (audio * 32768).astype(np.int16)` 位于
if/else 块之外无条件执行,当 super_sampling=True 时音频已在分支内
转为 int16,再次乘以 32768 导致溢出和音频完全失真
- 同时修复 super_sampling=True 但超分模型不存在时 torch.Tensor 调用
.astype() 的 AttributeError
2. 修复 batched vocoder 推理中 padding_len=0 导致音频丢失(HIGH)
- 当 padding_len 恰好为 0 时,`-0 * upsample_rate == 0`,切片
`audio[x:0]` 返回空张量,导致整段音频丢失
3. 修复文件不存在时错误地抛出 FileExistsError(LOW)
- 应为 FileNotFoundError
Made-with: Cursor
2026-04-18 17:16:24 +08:00
Mushroomcowisheggs
00ce973412
feat: 添加数据集的错误处理提示 ( #2758 )
...
Co-authored-by: moomushroom <107208254+moomushroom@users.noreply.github.com>
2026-04-18 17:13:30 +08:00
huang yutong
14191901cd
fix: 修复多个模块中的独立 bug ( #2755 )
...
1. 修复 sync_buffer 中除以函数对象而非调用结果(distrib.py)
- `buffer.data /= world_size` 中 world_size 是函数,缺少 (),
导致 TypeError 使分布式训练 buffer 同步失败
2. 修复 istft 函数缺少 return 语句(spec_utils.py)
- 函数计算了结果但未返回,调用者始终得到 None
3. 修复 cut0 返回字面量 "/n" 而非换行符 "\n"(text_segmentation_method.py)
- 导致后续 text.split("\n") 无法正确切分,字面 /n 被当作文本内容
4. 修复粤语 ASR 的 vad/punc model_revision 被无条件覆盖(funasr_asr.py)
- 粤语分支将 vad_model_revision 设为空(因不使用 VAD/标点模型),
但 if/else 外的赋值将其覆盖为 "v2.0.4",传入错误的 revision 参数
Made-with: Cursor
2026-04-18 17:10:56 +08:00
东云
780383d5bd
[codex] Improve Windows single-GPU v3 LoRA training / 改进 Windows 单卡 v3 LoRA 训练流程 ( #2767 )
...
* Improve Windows single-GPU v3 LoRA training
* Drop unrelated checkpoint helper change from PR
* Tighten PR scope to single-GPU training path fixes
2026-04-18 16:54:26 +08:00
白菜工厂1145号员工
ba8de9b760
优化 G2PW 的推理输入构造与多音字处理流程,减少重复计算,降低长句场景下的推理开销 ( #2763 )
...
* Enhance G2P processing by implementing batch input handling in _g2p function, improving efficiency. Update prepare_onnx_input to utilize caching for tokenization and add optional parameters for character ID mapping and phoneme masks. Refactor G2PWOnnxConverter to streamline model loading and configuration management.
* Enhance G2PW model input handling by introducing polyphonic context character support and updating the data preparation method to return additional query IDs. This improves the processing of polyphonic characters in sentences.
2026-04-18 16:52:32 +08:00
ChasonJiang
c767f0b83b
修复bug ( #2704 )
...
* 修复bug
* fallbak and bug fix
2025-12-30 16:00:21 +08:00
ChasonJiang
9080a967d5
修复采样错误 ( #2703 )
2025-12-30 15:21:03 +08:00
ChasonJiang
bfca0f6b2d
对齐naive_infer的解码策略,防止吞句 ( #2697 )
2025-12-19 17:37:19 +08:00
ChasonJiang
abe984395c
对齐gpt topk默认采样参数 ( #2696 )
2025-12-19 16:05:36 +08:00
ChasonJiang
36b3231c6f
bug fix ( #2689 )
2025-12-15 14:23:06 +08:00
RVC-Boss
cb00840c4e
Add files via upload
2025-11-28 22:02:03 +08:00
wzy3650
60a4a214af
vq distributed training support ( #2577 )
...
Co-authored-by: wangzeyuan <wangzeyuan@agora.io>
2025-11-28 21:57:13 +08:00
zzz
6375bbe316
尝试 stream infer ( #2469 )
...
* 尝试 stream infer
* 在 stream_infer 脚本中绘制生成的音频
* stream_infer 增加导出部分。
* stream_infer: 更方便找规律的图
* stream_infer: 在拼接音频时进行相关性搜索,减少拼接带来基频断裂的情况
* stream_infer: 导出 `find_best_audio_offset_fast`
* stream_infer: 优化波形显示,方便对比
* stream_v2pro.py 从命令行读取参数
* stream_v2pro.py 减少用于导出的文本长度
* stream_v2pro: 修复由于 spectrogram_torch 输入是 half 导致 spec 溢出最终没有声音的问题
* stream_v2pro: 新增 --lang 参数提示参考文字的语言类型
2025-11-28 21:36:57 +08:00
KamioRinn
e00ca92140
Fix ASMD ( #2636 )
2025-11-28 21:22:43 +08:00
ChasonJiang
92ab59c553
更细粒度的流式推理模式 ( #2671 )
...
* 更好的流式推理模式
* 清理无用代码
* modified: GPT_SoVITS/AR/models/t2s_model.py
modified: GPT_SoVITS/TTS_infer_pack/TTS.py
modified: GPT_SoVITS/module/models.py
* modified: GPT_SoVITS/TTS_infer_pack/TTS.py
* modified: .gitignore
modified: GPT_SoVITS/AR/models/t2s_model.py
modified: GPT_SoVITS/TTS_infer_pack/TTS.py
modified: GPT_SoVITS/module/models.py
* modified: GPT_SoVITS/AR/models/t2s_model.py
modified: GPT_SoVITS/TTS_infer_pack/TTS.py
modified: GPT_SoVITS/module/models.py
modified: api_v2.py
* modified: GPT_SoVITS/TTS_infer_pack/TTS.py
* 更正拼写错误
* 支持固定chunk长度的流式推理,优化sola算法
* 修复api_v2的ogg格式传输问题
2025-11-28 21:12:41 +08:00
ChasonJiang
b5a67e6247
修复gpt的loss计算问题 ( #2537 )
...
* 修复gpt的loss计算问题
* fallback tts config
2025-07-18 14:59:59 +08:00
ChasonJiang
b9211657d8
优化TTS_Config的代码逻辑 ( #2536 )
...
* 优化TTS_Config的代码逻辑
* 在载入vits权重之后保存tts_config
2025-07-18 11:54:40 +08:00
RVC-Boss
2d09bbe63a
Update tts_infer.yaml
2025-07-16 15:44:04 +08:00
RVC-Boss
4d8ebf8523
Update TTS.py
2025-07-16 15:43:26 +08:00
jiangsier-xyz
e476b01f30
解决 TTS.py 无法识别真正支持版本 v2Pro、v2ProPlus 的问题 ( #2490 )
...
同时更新一版默认配置。
Co-authored-by: jiangsier-xyz <jiangsier131@gmail.com>
2025-07-16 15:42:36 +08:00
RVC-Boss
426e1a2bb4
提升推理进程优先级
2025-07-10 18:16:45 +08:00
RVC-Boss
4e3c69043c
Update inference_webui.py
2025-07-10 18:16:24 +08:00
Yixiao Chen
8c579d46dd
Update export_torch_script.py ( #2494 )
...
Avoid dtype inconsistency when exporting
2025-07-02 22:48:28 +08:00
KamioRinn
6df61f58e4
语言分割及格式化优化 ( #2488 )
...
* better LangSegmenter
* add version num2str
* better version num2str
* sync fast infer
* sync api
* remove duplicate spaces
* remove unnecessary code
---------
Co-authored-by: RVC-Boss <129054828+RVC-Boss@users.noreply.github.com>
2025-06-27 11:58:41 +08:00
KamioRinn
90ebefa78f
make sure ort providers available ( #2489 )
2025-06-27 10:41:52 +08:00
XXXXRT666
6fdc67ca83
Fix bugs in install.sh, reduce log noise, and improve error reporting ( #2464 )
...
* Update Install.sh
* Format Code
* Delete dev null
* Update README, Support Dark Mode in CSS/JS
2025-06-17 15:21:36 +08:00
zzz
7dec5f5bb0
Merge pull request #2460 from L-jasmine/export_v2pro
...
优化 torch_script 导出模型
2025-06-13 22:10:11 +08:00
csh
5c91e66d2e
export_torch_script.py support v2Pro & v2ProPlus
2025-06-12 21:53:14 +08:00
RVC-Boss
ed89a02337
修复“修复ge.sum数值可能爆炸的”可能导致的训练爆炸的问题
...
修复“修复ge.sum数值可能爆炸的”可能导致的训练爆炸的问题
2025-06-11 23:14:52 +08:00
RVC-Boss
cd6de7398e
Merge pull request #2449 from KamioRinn/maga
...
support v4 v2Pro v2ProPlus for api & optimize LangSegmenter
2025-06-11 10:29:39 +08:00
YYuX-1145
dd2b9253aa
Update TTS.py ( #2450 )
2025-06-11 10:28:42 +08:00
KamioRinn
746cb536c6
Fix LangSegmenter
2025-06-10 19:18:05 +08:00
Emmanuel Ferdman
0d2f273402
Resolve Python Logger warnings ( #2379 )
...
Signed-off-by: Emmanuel Ferdman <emmanuelferdman@gmail.com>
2025-06-10 18:03:23 +08:00
RVC-Boss
8056efe4ab
修复ge.sum数值可能爆炸问题
...
修复ge.sum数值可能爆炸问题
2025-06-09 23:53:16 +08:00
wzy3650
d6b78c927a
fix configs error ( #2439 )
...
* fix configs error
* fix configs error
---------
Co-authored-by: wangzeyuan <wangzeyuan@agora.io>
Co-authored-by: wangzeyuan <wangzeyuan@shengwang.cn>
2025-06-09 11:25:55 +08:00
SapphireLab
d7c2210da8
Update Documentation ( #2436 )
...
* docs(Changelog_CN): Reformat the Changlog_CN before 2024.08
* docs(README): Update Multi-Language README
* docs(Changelog_CN): Separate links and content
* docs(Changelog_CN): fix missing issue.
* docs(Changelog_EN): Update Changelog_EN to date
* docs(Changelog_EN): fix typo
* docs(Changelog_JA): Update Changelog_JA to date
* docs(Changelog_KO): Update Changelog_KO to date
* docs(Changelog_TR): Update Changelog_TR to date
* docs(i18n): Update Multi-Language i18n JSON
2025-06-06 10:30:17 +08:00
wzy3650
ab53062bdd
fix _merge_yi crash ( #2432 )
...
* fix _merge_yi crash
* fix _merge_yi crash
---------
Co-authored-by: wangzeyuan <wangzeyuan@agora.io>
2025-06-06 10:25:41 +08:00
XXXXRT666
132f6e7b8b
Fix Bugs, Modified Layout ( #2434 )
...
Co-authored-by: RVC-Boss <129054828+RVC-Boss@users.noreply.github.com>
2025-06-05 18:37:19 +08:00
Jialiang Zhu
035dcbad03
Fix AttributeError when prompt_cache['refer_spec'][0] is a tuple ( #2428 )
...
Co-authored-by: tzrain <tz_rain@foxmail.com>
2025-06-05 10:55:21 +08:00
RVC-Boss
a080e19f91
去除不需要的告警AttributeError: module 'onnxruntime' has no attribute 'preload_dlls'
...
去除不需要的告警AttributeError: module 'onnxruntime' has no attribute 'preload_dlls'
2025-06-05 10:48:50 +08:00
RVC-Boss
3fcffb2e95
fix v3v4 resample function
...
fix v3v4 resample function
2025-06-05 10:47:32 +08:00
RVC-Boss
298ebb03c5
fix sv path
2025-06-04 18:05:57 +08:00
zzz
6d12a6a6cb
添加导出 v4 的部分 ( #2417 )
...
* feat: 添加导出v4的script
* 改名 export_torch_script_v3.py 为 export_torch_script_v3v4.py
* export_torch_script_v3v4 中优化函数名称和参数
2025-06-04 15:50:16 +08:00
RVC-Boss
e909c93c63
support sovits v2Pro v2ProPlus
...
support sovits v2Pro v2ProPlus
2025-06-04 15:47:40 +08:00
RVC-Boss
584fcae9a5
support sovits v2Pro v2ProPlus
...
support sovits v2Pro v2ProPlus
2025-06-04 15:25:52 +08:00
RVC-Boss
ad158b0f50
support sovits v2Pro v2ProPlus
...
support sovits v2Pro v2ProPlus
2025-06-04 15:20:04 +08:00
RVC-Boss
c920261d6a
support sovits v2Pro v2ProPlus
...
support sovits v2Pro v2ProPlus
2025-06-04 15:19:47 +08:00
RVC-Boss
92819d0b31
support sovits v2Pro v2ProPlus
...
support sovits v2Pro v2ProPlus
2025-06-04 15:19:20 +08:00