ComfyUI Extension: ComfyUI_HunyuanAvatar_Sm

Authored by smthemex

Created 2 months ago

Updated about a month ago

76 stars

HunyuanVideo-Avatar: High-Fidelity Audio-Driven Human Animation for Multiple Characters,try it in comfyUI ,if your VRAM >24G.

Custom Nodes (0)

README

ComfyUI_HunyuanAvatar_Sm

HunyuanVideo-Avatar: High-Fidelity Audio-Driven Human Animation for Multiple Characters,try it in comfyUI ,if your VRAM >12 or 10G

TIPS:

fix disable cpu_offload causes error. 修复关闭cpu卸载引发出错的bug
try dual role ,face size to contol input image face mask #测试双人代码，不一定有效 ,face size参数用是获取垫图的人脸范围，如果脸小，就用小数值，默认是3.0

1.Installation

In the ./ComfyUI /custom_node directory, run the following:

git clone https://github.com/smthemex/ComfyUI_HunyuanAvatar_Sm.git

2.requirements

pip install -r requirements.txt

3 models

download files from tencent/HunyuanVideo-Avatar

├── ComfyUI/models/HunyuanAvatar/
|   ├── det_align/
|         ├──detface.pt
|   ├── llava_llama_image/
|         ├──config.json
|         ├── ...所有json文件以及所有safetensors模型
|   ├──text_encoder_2/
|         ├──config.json
|         ├── ... 所有json文件以及model.safetensors模型
|   ├──vae/
|         ├──config.json
|         ├── pytorch_model.pt
|   ├──whisper-tiny/
|         ├──config.json
|         ├── ... 所有json文件以及model.safetensors模型
|   ├── mp_rank_00_model_states_fp8_map.pt #104K if use fp8  如果用fp8则下载
|   ├── mp_rank_00_model_states_fp8.pt.pt #24.9G  if use fp8  如果用fp8则下载
|   ├──mp_rank_00_model_states.pt

4 example

🔗 BibTeX

If you find HunyuanVideo-Avatar useful for your research and applications, please cite using this BibTeX:

@misc{hu2025HunyuanVideo-Avatar,
      title={HunyuanVideo-Avatar: High-Fidelity Audio-Driven Human Animation for Multiple Characters}, 
      author={Yi Chen and Sen Liang and Zixiang Zhou and Ziyao Huang and Yifeng Ma and Junshu Tang and Qin Lin and Yuan Zhou and Qinglin Lu},
      year={2025},
      eprint={2505.20156},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/pdf/2505.20156}, 
}

Acknowledgements

We would like to thank the contributors to the HunyuanVideo, SD3, FLUX, Llama, LLaVA, Xtuner, diffusers and HuggingFace repositories, for their open research and exploration.