ComfyUI Extension: ComfyUI_HunyuanAvatar_Sm

Authored by smthemex

Created

Updated

64 stars

HunyuanVideo-Avatar: High-Fidelity Audio-Driven Human Animation for Multiple Characters,try it in comfyUI ,if your VRAM >24G.

Custom Nodes (0)

    README

    ComfyUI_HunyuanAvatar_Sm

    • HunyuanVideo-Avatar: High-Fidelity Audio-Driven Human Animation for Multiple Characters,try it in comfyUI ,if your VRAM >12 or 10G

    TIPS:

    • fix disable cpu_offload causes error. 修复关闭cpu卸载引发出错的bug
    • try dual role ,face size to contol input image face mask #测试双人代码,不一定有效 ,face size参数用是获取垫图的人脸范围,如果脸小,就用小数值,默认是3.0

    1.Installation

    In the ./ComfyUI /custom_node directory, run the following:

    git clone https://github.com/smthemex/ComfyUI_HunyuanAvatar_Sm.git
    

    2.requirements

    pip install -r requirements.txt
    

    3 models

    ├── ComfyUI/models/HunyuanAvatar/
    |   ├── det_align/
    |         ├──detface.pt
    |   ├── llava_llama_image/
    |         ├──config.json
    |         ├── ...所有json文件以及所有safetensors模型
    |   ├──text_encoder_2/
    |         ├──config.json
    |         ├── ... 所有json文件以及model.safetensors模型
    |   ├──vae/
    |         ├──config.json
    |         ├── pytorch_model.pt
    |   ├──whisper-tiny/
    |         ├──config.json
    |         ├── ... 所有json文件以及model.safetensors模型
    |   ├── mp_rank_00_model_states_fp8_map.pt #104K if use fp8  如果用fp8则下载
    |   ├── mp_rank_00_model_states_fp8.pt.pt #24.9G  if use fp8  如果用fp8则下载
    |   ├──mp_rank_00_model_states.pt
    

    4 example

    🔗 BibTeX

    If you find HunyuanVideo-Avatar useful for your research and applications, please cite using this BibTeX:

    @misc{hu2025HunyuanVideo-Avatar,
          title={HunyuanVideo-Avatar: High-Fidelity Audio-Driven Human Animation for Multiple Characters}, 
          author={Yi Chen and Sen Liang and Zixiang Zhou and Ziyao Huang and Yifeng Ma and Junshu Tang and Qin Lin and Yuan Zhou and Qinglin Lu},
          year={2025},
          eprint={2505.20156},
          archivePrefix={arXiv},
          primaryClass={cs.CV},
          url={https://arxiv.org/pdf/2505.20156}, 
    }
    

    Acknowledgements

    We would like to thank the contributors to the HunyuanVideo, SD3, FLUX, Llama, LLaVA, Xtuner, diffusers and HuggingFace repositories, for their open research and exploration.