ComfyUI Extension: ComfyUI_HunyuanAvatar_Sm
HunyuanVideo-Avatar: High-Fidelity Audio-Driven Human Animation for Multiple Characters,try it in comfyUI ,if your VRAM >24G.
Custom Nodes (0)
README
ComfyUI_HunyuanAvatar_Sm
- HunyuanVideo-Avatar: High-Fidelity Audio-Driven Human Animation for Multiple Characters,try it in comfyUI ,if your VRAM >24G
TIPS:
- 因为没用大显存测试,目前fp8 ,超低分辨率(小于256*256)出黑图,猜测是face emb尺寸太小导致的,也有可能是适配fp8量化,修改了许多block的代码导致的。
1.Installation
In the ./ComfyUI /custom_node directory, run the following:
git clone https://github.com/smthemex/ComfyUI_HunyuanAvatar_Sm.git
2.requirements
pip install -r requirements.txt
3 models
- download files from tencent/HunyuanVideo-Avatar
├── ComfyUI/models/HunyuanAvatar/
| ├── det_align/
| ├──detface.pt
| ├── llava_llama_image/
| ├──config.json
| ├── ...所有json文件以及所有safetensors模型
| ├──text_encoder_2/
| ├──config.json
| ├── ... 所有json文件以及model.safetensors模型
| ├──vae/
| ├──config.json
| ├── pytorch_model.pt
| ├──whisper-tiny/
| ├──config.json
| ├── ... 所有json文件以及model.safetensors模型
| ├── mp_rank_00_model_states_fp8_map.pt #104K
| ├── mp_rank_00_model_states_fp8.pt.pt #24.9G
4 example
🔗 BibTeX
If you find HunyuanVideo-Avatar useful for your research and applications, please cite using this BibTeX:
@misc{hu2025HunyuanVideo-Avatar,
title={HunyuanVideo-Avatar: High-Fidelity Audio-Driven Human Animation for Multiple Characters},
author={Yi Chen and Sen Liang and Zixiang Zhou and Ziyao Huang and Yifeng Ma and Junshu Tang and Qin Lin and Yuan Zhou and Qinglin Lu},
year={2025},
eprint={2505.20156},
archivePrefix={arXiv},
primaryClass={cs.CV},
url={https://arxiv.org/pdf/2505.20156},
}
Acknowledgements
We would like to thank the contributors to the HunyuanVideo, SD3, FLUX, Llama, LLaVA, Xtuner, diffusers and HuggingFace repositories, for their open research and exploration.