ComfyUI Node: Qwen2-VL Model

Authored by gokayfem

Created

Updated

481 stars

Category

VLM Nodes/Qwen2-VL

Inputs

image IMAGE
text_input STRING
model_name
  • Qwen2-VL-2B
  • Qwen2-VL-7B
  • Qwen2-VL-72B
  • Qwen2-VL-2B-AWQ
  • Qwen2-VL-2B-GPTQ-Int4
  • Qwen2-VL-2B-GPTQ-Int8
  • Qwen2-VL-7B-AWQ
  • Qwen2-VL-7B-GPTQ-Int4
  • Qwen2-VL-7B-GPTQ-Int8
  • Qwen2-VL-72B-AWQ
  • Qwen2-VL-72B-GPTQ-Int4
  • Qwen2-VL-72B-GPTQ-Int8
memory_mode
  • Balanced (8-bit)
  • Maximum Savings (4-bit)
  • CPU Offload
  • Default
max_new_tokens INT
temperature FLOAT
top_p FLOAT
video_frames IMAGE
fps FLOAT

Outputs

STRING

Extension: VLM_nodes

Custom Nodes for Vision Language Models (VLM) , Large Language Models (LLM), Image Captioning, Automatic Prompt Generation, Creative and Consistent Prompt Suggestion, Keyword Extraction

Authored by gokayfem

Run ComfyUI workflows in the Cloud!

No downloads or installs are required. Pay only for active GPU usage, not idle time. No complex setups and dependency issues

Learn more