diff --git a/README.md b/README.md index efa67dc..d1b97d9 100644 --- a/README.md +++ b/README.md @@ -38,13 +38,21 @@ https://github.com/RVC-Boss/GPT-SoVITS/assets/129054828/05bee1fa-bdd8-4d85-9350- Tested with Python 3.9, PyTorch 2.0.1, and CUDA 11. -### Pip Packages +### Quick Install with Conda ```bash -pip install torch numpy scipy tensorboard librosa==0.9.2 numba==0.56.4 pytorch-lightning gradio==3.14.0 ffmpeg-python onnxruntime tqdm==4.59.0 cn2an pypinyin pyopenjtalk g2p_en +conda create -n GPTSoVits python=3.9 +conda activate GPTSoVits +bash install.sh +``` +### Install Manually +#### Pip Packages + +```bash +pip install torch numpy scipy tensorboard librosa==0.9.2 numba==0.56.4 pytorch-lightning gradio==3.14.0 ffmpeg-python onnxruntime tqdm cn2an pypinyin pyopenjtalk g2p_en chardet ``` -### Additional Requirements +#### Additional Requirements If you need Chinese ASR (supported by FunASR), install: @@ -52,9 +60,14 @@ If you need Chinese ASR (supported by FunASR), install: pip install modelscope torchaudio sentencepiece funasr ``` -### FFmpeg +#### FFmpeg -#### Ubuntu/Debian Users +##### Conda Users +```bash +conda install ffmpeg +``` + +##### Ubuntu/Debian Users ```bash sudo apt install ffmpeg @@ -62,13 +75,13 @@ sudo apt install libsox-dev conda install -c conda-forge 'ffmpeg<7' ``` -#### MacOS Users +##### MacOS Users ```bash brew install ffmpeg ``` -#### Windows Users +##### Windows Users Download and place [ffmpeg.exe](https://huggingface.co/lj1995/VoiceConversionWebUI/blob/main/ffmpeg.exe) and [ffprobe.exe](https://huggingface.co/lj1995/VoiceConversionWebUI/blob/main/ffprobe.exe) in the GPT-SoVITS root. diff --git a/install.sh b/install.sh new file mode 100644 index 0000000..8dadb26 --- /dev/null +++ b/install.sh @@ -0,0 +1,6 @@ +#!/bin/bash +conda install -c conda-forge gcc +conda install -c conda-forge gxx +conda install ffmpeg cmake +conda install pytorch==2.1.1 torchvision==0.16.1 torchaudio==2.1.1 pytorch-cuda=11.8 -c pytorch -c nvidia +pip install -r requirements.txt diff --git a/requirements.txt b/requirements.txt index 665602e..7c613b3 100644 --- a/requirements.txt +++ b/requirements.txt @@ -7,7 +7,7 @@ pytorch-lightning gradio==3.14.0 ffmpeg-python onnxruntime -tqdm==4.59.0 +tqdm funasr cn2an pypinyin @@ -17,4 +17,5 @@ torchaudio modelscope sentencepiece transformers -PyYAML \ No newline at end of file +chardet +PyYAML