ComfyUI Node: Qwen2 VQA

Authored by IuvenisSapiens

Created

Updated

87 stars

Category

Comfyui_Qwen2-VL-Instruct

Inputs

text STRING
model
  • Qwen2-VL-2B-Instruct-GPTQ-Int4
  • Qwen2-VL-2B-Instruct-GPTQ-Int8
  • Qwen2-VL-2B-Instruct
  • Qwen2-VL-7B-Instruct-GPTQ-Int4
  • Qwen2-VL-7B-Instruct-GPTQ-Int8
  • Qwen2-VL-7B-Instruct
quantization
  • none
  • 4bit
  • 8bit
keep_model_loaded BOOLEAN
temperature FLOAT
max_new_tokens INT
min_pixels INT
max_pixels INT
seed INT
source_path PATH

Outputs

STRING

Extension: ComfyUI_Qwen2-VL-Instruct

This is an implementation of a/Qwen2-VL-Instruct by a/ComfyUI, which includes, but is not limited to, support for text-based queries, video queries, single-image queries, and multi-image queries to generate captions or responses.

Authored by IuvenisSapiens

Run ComfyUI workflows in the Cloud!

No downloads or installs are required. Pay only for active GPU usage, not idle time. No complex setups and dependency issues

Learn more