baicai-1145
|
30a4557d8d
|
Implement last inference statistics tracking in Text2SemanticDecoder and enhance TTS class with prompt semantic extraction. This includes methods for setting and retrieving inference stats, as well as improvements to audio processing and feature extraction in TTS.
|
2026-03-08 23:08:27 +08:00 |
|
baicai-1145
|
b250e62402
|
Enhance G2PW model input handling by introducing polyphonic context character support and updating the data preparation method to return additional query IDs. This improves the processing of polyphonic characters in sentences.
|
2026-03-08 03:01:20 +08:00 |
|
baicai-1145
|
800acd45ff
|
Enhance G2P processing by implementing batch input handling in _g2p function, improving efficiency. Update prepare_onnx_input to utilize caching for tokenization and add optional parameters for character ID mapping and phoneme masks. Refactor G2PWOnnxConverter to streamline model loading and configuration management.
|
2026-03-07 05:47:22 +08:00 |
|
XXXXRT666
|
2d9193b0d3
|
Migrate to miniforge, add missing dependencies, update docker file, remove deprecated files (#2732)
* Migrate to miniforge, add missing dependencies, update docker file, remove deprecated files
* Add Env Vars and Secrets
|
2026-02-09 15:05:25 +08:00 |
|
Oarora
|
9986880b3f
|
fix Conda 条款未同意导致的构建失败 (#2727)
|
2026-02-08 23:52:04 +08:00 |
|
ChasonJiang
|
c767f0b83b
|
修复bug (#2704)
* 修复bug
* fallbak and bug fix
|
2025-12-30 16:00:21 +08:00 |
|
ChasonJiang
|
9080a967d5
|
修复采样错误 (#2703)
|
2025-12-30 15:21:03 +08:00 |
|
sushistack
|
51df9f7384
|
Fix model file name in README instructions (#2700)
|
2025-12-25 16:44:21 +08:00 |
|
ChasonJiang
|
bfca0f6b2d
|
对齐naive_infer的解码策略,防止吞句 (#2697)
|
2025-12-19 17:37:19 +08:00 |
|
ChasonJiang
|
abe984395c
|
对齐gpt topk默认采样参数 (#2696)
|
2025-12-19 16:05:36 +08:00 |
|
RVC-Boss
|
cc89c3660e
|
Update requirements.txt
|
2025-12-19 15:54:54 +08:00 |
|
ChasonJiang
|
36b3231c6f
|
bug fix (#2689)
|
2025-12-15 14:23:06 +08:00 |
|
RVC-Boss
|
9ec3a60f30
|
Update config.py
|
2025-12-01 20:23:49 +08:00 |
|
RVC-Boss
|
fc533b6fb7
|
Update fasterwhisper_asr.py
|
2025-12-01 11:38:37 +08:00 |
|
XXXXRT666
|
857799276c
|
Fix Modelscope (#2679)
|
2025-12-01 11:13:15 +08:00 |
|
Spr_Aachen
|
92d2d337fd
|
Fix training error caused by float type of default_batch_size parameter (#2662)
|
2025-11-28 22:53:43 +08:00 |
|
ChasonJiang
|
6fb441f65e
|
更友好的流模式选项 (#2678)
|
2025-11-28 22:13:48 +08:00 |
|
XXXXRT666
|
c85c54eca9
|
Add ModelScope Snapshot Download For ASR (#2627)
* Add ModelScope Snapshot Download For ASR
* Typo Fix
* Remove YUE in whisper
* Remove HF ENDPOINT
* Add FunASR Download
|
2025-11-28 22:10:49 +08:00 |
|
RVC-Boss
|
cb00840c4e
|
Add files via upload
|
2025-11-28 22:02:03 +08:00 |
|
wzy3650
|
60a4a214af
|
vq distributed training support (#2577)
Co-authored-by: wangzeyuan <wangzeyuan@agora.io>
|
2025-11-28 21:57:13 +08:00 |
|
zzz
|
6375bbe316
|
尝试 stream infer (#2469)
* 尝试 stream infer
* 在 stream_infer 脚本中绘制生成的音频
* stream_infer 增加导出部分。
* stream_infer: 更方便找规律的图
* stream_infer: 在拼接音频时进行相关性搜索,减少拼接带来基频断裂的情况
* stream_infer: 导出 `find_best_audio_offset_fast`
* stream_infer: 优化波形显示,方便对比
* stream_v2pro.py 从命令行读取参数
* stream_v2pro.py 减少用于导出的文本长度
* stream_v2pro: 修复由于 spectrogram_torch 输入是 half 导致 spec 溢出最终没有声音的问题
* stream_v2pro: 新增 --lang 参数提示参考文字的语言类型
|
2025-11-28 21:36:57 +08:00 |
|
KamioRinn
|
e00ca92140
|
Fix ASMD (#2636)
|
2025-11-28 21:22:43 +08:00 |
|
ChasonJiang
|
92ab59c553
|
更细粒度的流式推理模式 (#2671)
* 更好的流式推理模式
* 清理无用代码
* modified: GPT_SoVITS/AR/models/t2s_model.py
modified: GPT_SoVITS/TTS_infer_pack/TTS.py
modified: GPT_SoVITS/module/models.py
* modified: GPT_SoVITS/TTS_infer_pack/TTS.py
* modified: .gitignore
modified: GPT_SoVITS/AR/models/t2s_model.py
modified: GPT_SoVITS/TTS_infer_pack/TTS.py
modified: GPT_SoVITS/module/models.py
* modified: GPT_SoVITS/AR/models/t2s_model.py
modified: GPT_SoVITS/TTS_infer_pack/TTS.py
modified: GPT_SoVITS/module/models.py
modified: api_v2.py
* modified: GPT_SoVITS/TTS_infer_pack/TTS.py
* 更正拼写错误
* 支持固定chunk长度的流式推理,优化sola算法
* 修复api_v2的ogg格式传输问题
|
2025-11-28 21:12:41 +08:00 |
|
RVC-Boss
|
11aa78bd9b
|
修复环境变量可能不为str的问题
修复环境变量可能不为str的问题
|
2025-09-10 15:01:04 +08:00 |
|
XXXXRT666
|
fdf794e31d
|
Update WSL Rocm (#2561)
|
2025-08-02 17:47:15 +08:00 |
|
多玩幻灵qwq
|
0be59c8043
|
fix: 更正链接 (#2539)
|
2025-07-19 00:29:48 +08:00 |
|
ChasonJiang
|
b5a67e6247
|
修复gpt的loss计算问题 (#2537)
* 修复gpt的loss计算问题
* fallback tts config
|
2025-07-18 14:59:59 +08:00 |
|
ChasonJiang
|
b9211657d8
|
优化TTS_Config的代码逻辑 (#2536)
* 优化TTS_Config的代码逻辑
* 在载入vits权重之后保存tts_config
|
2025-07-18 11:54:40 +08:00 |
|
XXXXRT666
|
cefafee32c
|
Add Distil (#2531)
|
2025-07-17 20:28:25 +08:00 |
|
RVC-Boss
|
2d09bbe63a
|
Update tts_infer.yaml
|
2025-07-16 15:44:04 +08:00 |
|
RVC-Boss
|
4d8ebf8523
|
Update TTS.py
|
2025-07-16 15:43:26 +08:00 |
|
jiangsier-xyz
|
e476b01f30
|
解决 TTS.py 无法识别真正支持版本 v2Pro、v2ProPlus 的问题 (#2490)
同时更新一版默认配置。
Co-authored-by: jiangsier-xyz <jiangsier131@gmail.com>
|
2025-07-16 15:42:36 +08:00 |
|
RVC-Boss
|
42586e20f7
|
add RTF performence
add RTF performence
|
2025-07-14 19:01:26 +08:00 |
|
RVC-Boss
|
85035f7ac0
|
add RTF performence
add RTF performence
|
2025-07-14 18:56:22 +08:00 |
|
RVC-Boss
|
706bec74f8
|
Update assets.py
|
2025-07-11 16:11:08 +08:00 |
|
XXXXRT666
|
ec1218893e
|
Update Badge (#2518)
* Update README.md
* Update README.md
* Update Badges
* specify ranges
|
2025-07-11 16:10:07 +08:00 |
|
RVC-Boss
|
fec515dcce
|
Update Changelog_CN.md
|
2025-07-10 18:33:18 +08:00 |
|
RVC-Boss
|
426e1a2bb4
|
提升推理进程优先级
|
2025-07-10 18:16:45 +08:00 |
|
RVC-Boss
|
4e3c69043c
|
Update inference_webui.py
|
2025-07-10 18:16:24 +08:00 |
|
RVC-Boss
|
e63e0901fd
|
Update assets.py
|
2025-07-10 18:12:24 +08:00 |
|
RVC-Boss
|
97e37c74d8
|
Update README.md
|
2025-07-10 18:06:04 +08:00 |
|
RVC-Boss
|
3a75f5023f
|
Update README.md
|
2025-07-10 18:05:03 +08:00 |
|
RVC-Boss
|
0899b7e432
|
Update README.md
|
2025-07-10 17:59:49 +08:00 |
|
Yixiao Chen
|
8c579d46dd
|
Update export_torch_script.py (#2494)
Avoid dtype inconsistency when exporting
|
2025-07-02 22:48:28 +08:00 |
|
KamioRinn
|
6df61f58e4
|
语言分割及格式化优化 (#2488)
* better LangSegmenter
* add version num2str
* better version num2str
* sync fast infer
* sync api
* remove duplicate spaces
* remove unnecessary code
---------
Co-authored-by: RVC-Boss <129054828+RVC-Boss@users.noreply.github.com>
|
2025-06-27 11:58:41 +08:00 |
|
KamioRinn
|
90ebefa78f
|
make sure ort providers available (#2489)
|
2025-06-27 10:41:52 +08:00 |
|
XXXXRT666
|
4839e82148
|
Add Windows Install Powershell Scripts (#2487)
|
2025-06-27 01:04:18 +08:00 |
|
XXXXRT666
|
37f5abfcb4
|
Fix Issues with libstdcxx and conda sysroot (#2482)
|
2025-06-25 14:52:27 +08:00 |
|
Ella Zhang
|
4987df5a71
|
fixed syntax errors in api_v2.py (#2473)
|
2025-06-19 15:34:11 +08:00 |
|
XXXXRT666
|
d46c069e52
|
Remove Debug Code (#2471)
|
2025-06-18 10:38:54 +08:00 |
|