This node provides advanced text-to-speech functionality powered by KokoroTTS. Follow the instructions below to install, configure, and use the node within your portable ComfyUI installation.
This node provides advanced text-to-speech functionality powered by KokoroTTS. Follow the instructions below to install, configure, and use the node within your portable ComfyUI installation.
Download/Clone the Node:
Copy or download the entire DJZ-KokoroTTS
folder into the custom_nodes
folder inside your ComfyUI installation.
Running the Installer for Portable ComfyUI:
For Windows portable users:
install-portable.bat
file.Tip: Make sure your ComfyUI portable installation is not running while you execute the batch file. ⚠️
This node requires two specific models to work correctly.
/comfyui/models/kokoro/
models.json
configuration file. Please ensure that the models (e.g., the voice synthesis model and the text processing model) are exactly in this folder for the node to detect and load them properly.Configuration:
Open your preferred workflow editor in ComfyUI and add the KokoroTTS node to your workflow.
Configure the available options directly on the node. Options may include parameters such as voice type, speed, pitch, or any additional experimental features defined in the node's interface.
Running the Workflow:
Once configured, run your workflow. The node processes text input and produces high-quality speech output. The audio will be played directly or saved to a user-defined file directory, based on your settings.
Troubleshooting:
install-portable.bat
successfully./comfyui/models/kokoro/
.requirements.txt
have been installed.Inside the node, you can adjust various parameters to suit your needs:
These settings are fully customizable through the node interface within ComfyUI.
Recent Updates:
The KokoroTTS Node is a robust tool to add state-of-the-art text-to-speech capabilities to your image generation workflows. Follow the above steps to set up and enjoy seamless speech synthesis!
Happy synthesizing! 😊