ComfyUI Node: Qwen2 VQA

Authored by IuvenisSapiens

Created 11 months ago

Updated 4 months ago

104 stars

Inputs

text STRING

model

Qwen2-VL-2B-Instruct-GPTQ-Int4
Qwen2-VL-2B-Instruct-GPTQ-Int8
Qwen2-VL-2B-Instruct
Qwen2-VL-7B-Instruct-GPTQ-Int4
Qwen2-VL-7B-Instruct-GPTQ-Int8
Qwen2-VL-7B-Instruct

quantization

none
4bit
8bit

keep_model_loaded BOOLEAN

temperature FLOAT

max_new_tokens INT

min_pixels INT

max_pixels INT

seed INT

source_path PATH

Outputs

STRING

Extension: ComfyUI_Qwen2-VL-Instruct

This is an implementation of a/Qwen2-VL-Instruct by a/ComfyUI, which includes, but is not limited to, support for text-based queries, video queries, single-image queries, and multi-image queries to generate captions or responses.

Authored by IuvenisSapiens

View Nodes

Run ComfyUI workflows in the Cloud!

No downloads or installs are required. Pay only for active GPU usage, not idle time. No complex setups and dependency issues

Learn more

ComfyUI Node: Qwen2 VQA

Category

Inputs

Outputs

Extension: ComfyUI_Qwen2-VL-Instruct

Run ComfyUI workflows in the Cloud!