GPT-SoVITS_V4解压即用N卡整合版_250424

ainiderousi · 发表于 2025-5-15 23:58:01

大哥厉害

w19860613 · 发表于 2025-5-16 00:41:20

4解压即用N卡整合版

jerry · 发表于 2025-5-16 09:35:29

下载软件111111111111111

422611847 · 发表于 2025-5-16 13:36:27

感谢博主啦

lxxxx1 · 发表于 2025-5-16 17:41:54

666666666666

laughbest · 发表于 2025-5-17 11:32:36

非常感谢，好用！

laughbest · 发表于 2025-5-17 12:26:33

请问多音字应该怎么处理啊？谢谢！

我按如下方法操作，有些多音字发音正确，有些仍然错误：

1、添加多音字，修改 \GPT_SoVITS\text\g2pw 目录下的 polyphonic-fix.rep 文件并保存
2、删去缓存文件 polyphonic.pickle，把pycache也删了
3、关闭GPT-SoVITS，重新打开。

在 polyphonic-fix.rep 文件第一行添加了三行，
数数: ['shu3', 'shu4']
祇树给: ['qi2', 'shu4', 'ji3']
长公主: ['zhang3', 'gong1', 'zhu3']

合成语音后，
  [文本中出现两次“数数”，第一次正确；第二次错成 'shu4', 'shu4']
  [错成 gei3]
  [错成 chang3]

18075522628 · 发表于 2025-5-18 13:35:11

666666666666666666666

121281712 · 发表于 2025-5-18 21:30:52

这是什么，我素材3分钟多，分割音频后，也只得到了一个完整的3分多的音频，3连的时候出现以下代码：
Traceback (most recent call last):
  File "D:\GPT-SoVITS_V4_250424\GPT_SoVITS\prepare_datasets\1-get-text.py", line 95, in process
bert_feature = get_bert_feature(norm_text, word2ph)
  File "D:\GPT-SoVITS_V4_250424\GPT_SoVITS\prepare_datasets\1-get-text.py", line 73, in get_bert_feature
res = bert_model(**inputs, output_hidden_states=True)
  File "D:\GPT-SoVITS_V4_250424\env\lib\site-packages\torch\nn\modules\module.py", line 1736, in _wrapped_call_impl
return self._call_impl(*args, **kwargs)
  File "D:\GPT-SoVITS_V4_250424\env\lib\site-packages\torch\nn\modules\module.py", line 1747, in _call_impl
return forward_call(*args, **kwargs)
  File "D:\GPT-SoVITS_V4_250424\env\lib\site-packages\transformers\models\bert\modeling_bert.py", line 1461, in forward
outputs = self.bert(
  File "D:\GPT-SoVITS_V4_250424\env\lib\site-packages\torch\nn\modules\module.py", line 1736, in _wrapped_call_impl
return self._call_impl(*args, **kwargs)
  File "D:\GPT-SoVITS_V4_250424\env\lib\site-packages\torch\nn\modules\module.py", line 1747, in _call_impl
return forward_call(*args, **kwargs)
  File "D:\GPT-SoVITS_V4_250424\env\lib\site-packages\transformers\models\bert\modeling_bert.py", line 1078, in forward
embedding_output = self.embeddings(
  File "D:\GPT-SoVITS_V4_250424\env\lib\site-packages\torch\nn\modules\module.py", line 1736, in _wrapped_call_impl
return self._call_impl(*args, **kwargs)
  File "D:\GPT-SoVITS_V4_250424\env\lib\site-packages\torch\nn\modules\module.py", line 1747, in _call_impl
return forward_call(*args, **kwargs)
  File "D:\GPT-SoVITS_V4_250424\env\lib\site-packages\transformers\models\bert\modeling_bert.py", line 217, in forward
embeddings += position_embeddings
RuntimeError: The size of tensor a (728) must match the size of tensor b (512) at non-singleton dimension 1

Traceback (most recent call last):
  File "D:\GPT-SoVITS_V4_250424\webui.py", line 1166, in open1abc
assert len("".join(opt)) > 0, process_info(process_name_1a, "failed")
AssertionError: 文本分词与特征提取失败
训练集格式化一键三连进程已终止
训练集格式化一键三连进程已终止

alyty · 发表于 2025-5-20 11:46:05

版主的作品很不错，支持下！期待能发表更多好作品!

		自动登录	找回密码
密码			立即注册