ComfyUI Node: Qwen3 VQA

Authored by IuvenisSapiens

Created 2 years ago

Updated 9 months ago

566 stars

Run ComfyUI workflows without the setup

No installs, no CUDA version roulette, no GPU sitting idle on your bill. Bring a workflow and run it in the browser.

Inputs

text STRING

model

Qwen3-VL-4B-Instruct-FP8
Qwen3-VL-4B-Thinking-FP8
Qwen3-VL-8B-Instruct-FP8
Qwen3-VL-8B-Thinking-FP8
Qwen3-VL-4B-Instruct
Qwen3-VL-4B-Thinking
Qwen3-VL-8B-Instruct
Qwen3-VL-8B-Thinking

quantization

none
4bit
8bit

keep_model_loaded BOOLEAN

temperature FLOAT

max_new_tokens INT

min_pixels INT

max_pixels INT

seed INT

attention

eager
sdpa
flash_attention_2

source_path PATH

image IMAGE

Outputs

STRING

Extension: Comfyui_Qwen3-VL-Instruct

This is an implementation of Qwen3-VL-Instruct by ComfyUI, which includes, but is not limited to, support for text-based queries, video queries, single-image queries, and multi-image queries to generate captions or responses.

Authored by IuvenisSapiens

View Nodes

Looking for a different node?

Also provided by 1 other extension

Qwen3 VQA — ComfyUI_Qwen3-VL-Instruct
Comfyui_Qwen3-VL-Instruct

More nodes in Comfyui_Qwen3-VL-Instruct

Run ComfyUI workflows without the setup

No installs, no CUDA version roulette, no GPU sitting idle on your bill. Bring a workflow and run it in the browser.

Learn more

ComfyUI Node: Qwen3 VQA

Category

Inputs

Outputs

Extension: Comfyui_Qwen3-VL-Instruct

Also provided by 1 other extension

More nodes in Comfyui_Qwen3-VL-Instruct

Run ComfyUI workflows without the setup