ComfyUI Node: QwenVL
Category
🧪AILab/QwenVL
Inputs
model_name
- Qwen3-VL-2B-Instruct
- Qwen3-VL-2B-Thinking
- Qwen3-VL-2B-Instruct-FP8
- Qwen3-VL-2B-Thinking-FP8
- Qwen3-VL-4B-Instruct
- Qwen3-VL-4B-Thinking
- Qwen3-VL-4B-Instruct-FP8
- Qwen3-VL-4B-Thinking-FP8
- Qwen3-VL-8B-Instruct
- Qwen3-VL-8B-Thinking
- Qwen3-VL-8B-Instruct-FP8
- Qwen3-VL-8B-Thinking-FP8
- Qwen3-VL-32B-Instruct
- Qwen3-VL-32B-Thinking
- Qwen3-VL-32B-Instruct-FP8
- Qwen3-VL-32B-Thinking-FP8
- Qwen2.5-VL-3B-Instruct
- Qwen2.5-VL-7B-Instruct
quantization
- 4-bit (VRAM-friendly)
- 8-bit (Balanced)
- None (FP16)
attention_mode
- auto
- flash_attention_2
- sdpa
preset_prompt
- 🖼️ Tags
- 🖼️ Simple Description
- 🖼️ Detailed Description
- 🖼️ Ultra Detailed Description
- 🎬 Cinematic Description
- 🖼️ Detailed Analysis
- 📹 Video Summary
- 📖 Short Story
- 🪄 Prompt Refine & Expand
custom_prompt STRING
max_tokens INT
keep_model_loaded BOOLEAN
seed INT
image IMAGE
video IMAGE
Outputs
STRING
Extension: ComfyUI-QwenVL
ComfyUI-QwenVL custom node: Integrates the Qwen-VL series, including Qwen2.5-VL and the latest Qwen3-VL, to enable advanced multimodal AI for text generation, image understanding, and video analysis.
Authored by 1038lab
Run ComfyUI workflows in the Cloud!
No downloads or installs are required. Pay only for active GPU usage, not idle time. No complex setups and dependency issues
Learn more