A plugin of ComfyUI for CosyVoice2, one component for text to Sonic Video
A plugin of ComfyUI for CosyVoice2,
one component for text to long Sonic video in ComfyUI.
In the ./ComfyUI/custom_node directory, run the following:
git clone https://github.com/benda1989/CosyVoice2_ComfyUI.git
cd CosyVoice2_ComfyUI
conda install -y -c conda-forge pynini==2.1.5
pip install -r requirements.txt
# If you encounter sox compatibility issues
# ubuntu
sudo apt-get install sox libsox-dev
# centos
sudo yum install sox sox-devel
By default project will download CosyVoice2-0.5B and CosyVoice-ttsfrd into pretrained_models,
You can copy it there if you downloaded before.
file tree looks like this at least:
-- ComfyUI/custom_node/CosyVoice2_ComfyUI/pretrained_models
|-- CosyVoice2-0.5B
|-- CosyVoice-ttsfrd
This project is design for v2, if you want to change between v1 and v2, you should reboot ComfyUI after once used.