ComfyUI Node: Qwen2 VQA
Category
Comfyui_Qwen2-VL-Instruct
Inputs
text STRING
model
- Qwen2-VL-2B-Instruct-GPTQ-Int4
- Qwen2-VL-2B-Instruct-GPTQ-Int8
- Qwen2-VL-2B-Instruct
- Qwen2-VL-7B-Instruct-GPTQ-Int4
- Qwen2-VL-7B-Instruct-GPTQ-Int8
- Qwen2-VL-7B-Instruct
quantization
- none
- 4bit
- 8bit
keep_model_loaded BOOLEAN
temperature FLOAT
max_new_tokens INT
min_pixels INT
max_pixels INT
seed INT
source_path PATH
Outputs
STRING
Extension: ComfyUI_Qwen2-VL-Instruct
This is an implementation of a/Qwen2-VL-Instruct by a/ComfyUI, which includes, but is not limited to, support for text-based queries, video queries, single-image queries, and multi-image queries to generate captions or responses.
Authored by IuvenisSapiens
Run ComfyUI workflows in the Cloud!
No downloads or installs are required. Pay only for active GPU usage, not idle time. No complex setups and dependency issues
Learn more