OuteTTS - Unified Text-To-Speech. A node for ComfyUI
Text-to-speech, voice cloning (source audio up to 20 seconds), auto-saving speakers.
Clean, loud, and clear voice cloning works best.
[2025-04-14] ⚒️: Released v1.0.0.
cd ComfyUI/custom_nodes
git clone https://github.com/billwuhao/ComfyUI_OuteTTS.git
cd ComfyUI_OuteTTS
pip install -r requirements.txt
# python_embeded
./python_embeded/python.exe -m pip install -r requirements.txt
ComfyUI/models/TTS
directory.ComfyUI/models/TTS/DAC.speech.v1.0
directory.ComfyUI/models/TTS/whisper-large-v3-turbo
directory.