ComfyUI Extension: ComfyUI-Orpheus
TTS with emotional speech capabilities in 8 Languages 24 speakers.
Custom Nodes (0)
README
ComfyUI-Orpheus Node
2 custom nodes for ComfyUI that enables text-to-speech generation using the GGUF Orpheus model with emotional speech capabilities.
<img src="doc/demo.png" width="100%">Features
- High-quality text-to-speech synthesis
- Multiple voice options (24 different voices, depend language used) in English, French, Spanish, Italian, Chinese, Korean, German, Hindi
- Emotional speech capabilities
- Seamless integration with ComfyUI workflow
Available Voices
-
English Voices:
supported tags : chuckle, cough, gasp, groan, laugh, sigh, sniffle, yawn
tara
- Female voiceleah
- Female voicejess
- Female voiceleo
- Male voicedan
- Male voicemia
- Female voicezac
- Male voicezoe
- Female voice
-
French Voices:
supported tags : chuckle, cough, gasp, groan, laugh, sigh, sniffle, whimper, yawn
pierre
- Male voiceamelie
- Female voicemarie
- Female voice (doesn't works well)
-
German Voices:
supported tags : chuckle, cough, gasp, groan, laugh, sigh, sniffle, yawn
jana
- Female voicethomas
- Male voicemax
- Male voice
-
Korean Voices:
supported tags : 한숨, 헐, 헛기침, 훌쩍, 하품, 낄낄, 신음, 작은 웃음, 기침, 으르렁
유나
- ?^^준서
- ?^^
-
Chinese Voices:
supported tags : 嬉笑, 轻笑, 呻吟, 大笑, 咳嗽, 抽鼻子, 咳
长乐
- ?^^白芷
- ?^^
-
Hindi:
supported tags : unknow
ऋतिका
- ? ^^
-
Spanish Voices:
supported tags : groan, chuckle, gasp, resoplido, laugh, yawn, cough
javi
- Male voicesergio
- Male voicemaria
- Female voice
-
Italian Voices:
supported tags : sigh, laugh, cough, sniffle, groan, yawn, gemito, gasp
pietro
- Male voicegiulia
- Female voicecarlo
- Male voice
Requirements
- Last ComfyUI version with python 3.12.9 (may be works with older versions but I haven't test it)
Installation
- Clone this repository into your ComfyUI custom nodes directory:
cd ComfyUI/custom_nodes
git clone https://github.com/your-repo/ComfyUI-Orpheus.git
- Install the required dependencies:
load venv and :
pip install -r ComfyUI-Orpheus/requirements.txt
Or use python_embeded :
python_embeded\python.exe -m pip install -r ComfyUI-Orpheus/requirements.txt
-
Download the required GGUF model from FreddyAboulton huggingface page, and place it in your ComfyUI models directory under
<img src="doc/models.png" width="100%">models/unet/
. -
GPU Support
On windows, default installation of llama-cpp-python doesn't take NVIDIA GPU support. If you want NVIDIA GPU Support you need to locate nvcc.exe
folder and:
set CMAKE_ARGS="-DGGML_CUDA=on"
set CUDA_CXX="YOUR_CUDA_DIR\v12.x.x\bin\nvcc.exe"
python_embeded\python.exe -m pip install llama-cpp-python[server] --upgrade --force-reinstall --no-cache-dir
Be patient, can takes time...
Usage
-
In ComfyUI, locate the "Orpheus ⛓️" node in the node menu.
-
Configure the node parameters:
model_name
: Select your GGUF modelvoice
: Choose from available voicesprompt
: Enter the text you want to convert to speech, you can add emotive tags :<laugh>, <chuckle>, <sigh>, <cough>, <sniffle>, <groan>, <yawn>, <gasp>
-
Connect the node outputs:
audio
: Contains the generated audio waveform and sample rate
Limitations
- Maximum text length determined by MAX_TOKENS
- Processing speed depends on GPU capabilities
- Requires CUDA support for optimal performance
Credits
- Original Orpheus implementation
- Freddy Aboulton