Parler-TTS is a lightweight text-to-speech (TTS) model that can generate high-quality, natural sounding speech in the style of a given speaker (gender, pitch, speaking style, etc)
Parler-TTS is a lightweight text-to-speech (TTS) model that can generate high-quality, natural sounding speech in the style of a given speaker (gender, pitch, speaking style, etc).This is a simple ComfyUI custom node based on Parler_tts.
2024/12/06
Previous updates
In the .\ComfyUI \ custom_node directory, run the following:
git clone https://github.com/smthemex/ComfyUI_ParlerTTS.git
pip install requirements.txt
3.1 using huggingface_hub
Use the default repo_id or fill in "parler-tts/parler-tts-mini-jenny-30H" ,"parler-tts/parler-tts-mini-v1 ","parler-tts/parler-tts-large-v1"....
使用默认的repo_id 或者填写"parler-tts/parler-tts-mini-jenny-30H",这类地址,去掉引号;
3.2 offline
在repo_id填写绝对地址,Fill in the absolute address in the repo id :
X:/XXX/XXX/parler-tts/parler-tts-mini-jenny-30H
If you found this repository useful, please consider citing this work and also the original Stability AI paper:
@misc{lacombe-etal-2024-parler-tts,
author = {Yoach Lacombe and Vaibhav Srivastav and Sanchit Gandhi},
title = {Parler-TTS},
year = {2024},
publisher = {GitHub},
journal = {GitHub repository},
howpublished = {\url{https://github.com/huggingface/parler-tts}}
}
@misc{lyth2024natural,
title={Natural language guidance of high-fidelity text-to-speech with synthetic annotations},
author={Dan Lyth and Simon King},
year={2024},
eprint={2402.01912},
archivePrefix={arXiv},
primaryClass={cs.SD}
}