ComfyUI Node: QwenVL (Advanced)

Authored by 1038lab

Created

Updated

563 stars

Category

๐ŸงชAILab/QwenVL

Inputs

model_name
  • Qwen3-VL-2B-Instruct
  • Qwen3-VL-2B-Thinking
  • Qwen3-VL-2B-Instruct-FP8
  • Qwen3-VL-2B-Thinking-FP8
  • Qwen3-VL-4B-Instruct
  • Qwen3-VL-4B-Thinking
  • Qwen3-VL-4B-Instruct-FP8
  • Qwen3-VL-4B-Thinking-FP8
  • Qwen3-VL-8B-Instruct
  • Qwen3-VL-8B-Thinking
  • Qwen3-VL-8B-Instruct-FP8
  • Qwen3-VL-8B-Thinking-FP8
  • Qwen3-VL-32B-Instruct
  • Qwen3-VL-32B-Thinking
  • Qwen3-VL-32B-Instruct-FP8
  • Qwen3-VL-32B-Thinking-FP8
  • Qwen2.5-VL-3B-Instruct
  • Qwen2.5-VL-7B-Instruct
quantization
  • 4-bit (VRAM-friendly)
  • 8-bit (Balanced)
  • None (FP16)
attention_mode
  • auto
  • flash_attention_2
  • sdpa
use_torch_compile BOOLEAN
device
  • auto
  • cpu
  • mps
  • cuda:0
preset_prompt
  • ๐Ÿ–ผ๏ธ Tags
  • ๐Ÿ–ผ๏ธ Simple Description
  • ๐Ÿ–ผ๏ธ Detailed Description
  • ๐Ÿ–ผ๏ธ Ultra Detailed Description
  • ๐ŸŽฌ Cinematic Description
  • ๐Ÿ–ผ๏ธ Detailed Analysis
  • ๐Ÿ“น Video Summary
  • ๐Ÿ“– Short Story
  • ๐Ÿช„ Prompt Refine & Expand
custom_prompt STRING
max_tokens INT
temperature FLOAT
top_p FLOAT
num_beams INT
repetition_penalty FLOAT
frame_count INT
keep_model_loaded BOOLEAN
seed INT
image IMAGE
video IMAGE

Outputs

STRING

Extension: ComfyUI-QwenVL

ComfyUI-QwenVL custom node: Integrates the Qwen-VL series, including Qwen2.5-VL and the latest Qwen3-VL, to enable advanced multimodal AI for text generation, image understanding, and video analysis.

Authored by 1038lab

Run ComfyUI workflows in the Cloud!

No downloads or installs are required. Pay only for active GPU usage, not idle time. No complex setups and dependency issues

Learn more