ComfyUI Node: QwenVL

Authored by 1038lab

Created

Updated

563 stars

Category

🧪AILab/QwenVL

Inputs

model_name
  • Qwen3-VL-2B-Instruct
  • Qwen3-VL-2B-Thinking
  • Qwen3-VL-2B-Instruct-FP8
  • Qwen3-VL-2B-Thinking-FP8
  • Qwen3-VL-4B-Instruct
  • Qwen3-VL-4B-Thinking
  • Qwen3-VL-4B-Instruct-FP8
  • Qwen3-VL-4B-Thinking-FP8
  • Qwen3-VL-8B-Instruct
  • Qwen3-VL-8B-Thinking
  • Qwen3-VL-8B-Instruct-FP8
  • Qwen3-VL-8B-Thinking-FP8
  • Qwen3-VL-32B-Instruct
  • Qwen3-VL-32B-Thinking
  • Qwen3-VL-32B-Instruct-FP8
  • Qwen3-VL-32B-Thinking-FP8
  • Qwen2.5-VL-3B-Instruct
  • Qwen2.5-VL-7B-Instruct
quantization
  • 4-bit (VRAM-friendly)
  • 8-bit (Balanced)
  • None (FP16)
attention_mode
  • auto
  • flash_attention_2
  • sdpa
preset_prompt
  • 🖼️ Tags
  • 🖼️ Simple Description
  • 🖼️ Detailed Description
  • 🖼️ Ultra Detailed Description
  • 🎬 Cinematic Description
  • 🖼️ Detailed Analysis
  • 📹 Video Summary
  • 📖 Short Story
  • 🪄 Prompt Refine & Expand
custom_prompt STRING
max_tokens INT
keep_model_loaded BOOLEAN
seed INT
image IMAGE
video IMAGE

Outputs

STRING

Extension: ComfyUI-QwenVL

ComfyUI-QwenVL custom node: Integrates the Qwen-VL series, including Qwen2.5-VL and the latest Qwen3-VL, to enable advanced multimodal AI for text generation, image understanding, and video analysis.

Authored by 1038lab

Run ComfyUI workflows in the Cloud!

No downloads or installs are required. Pay only for active GPU usage, not idle time. No complex setups and dependency issues

Learn more