This is an implementation of a/Qwen2-Audio-7B-Instruct-Int4 by a/ComfyUI, including support for text-based queries and audio queries to generate captions or responses.
This is an implementation of Qwen2-Audio-7B-Instruct-Int4 by ComfyUI, including support for text-based queries and audio queries to generate captions or responses.
Install from ComfyUI Manager (search for Qwen2
)
Download or git clone this repository into the ComfyUI\custom_nodes\
directory and run:
pip install -r requirements.txt
All the models will be downloaded automatically when running the workflow if they are not found in the ComfyUI\models\prompt_generator\
directory.