ComfyUI Node: Image to Text - Auto Caption

Authored by christian-byrne

Created

Updated

85 stars

Category

img2txt

Inputs

input_image IMAGE
use_blip_model BOOLEAN
use_llava_model BOOLEAN
use_mini_pcm_model BOOLEAN
use_all_models BOOLEAN
blip_caption_prefix STRING
prompt_questions STRING
temperature FLOAT
repetition_penalty FLOAT
min_words INT
max_words INT
search_beams INT
exclude_terms STRING

Outputs

STRING

Extension: img2txt-comfyui-nodes

Get general description or specify questions to ask about images (medium, art style, background, etc.). Supports Chinese 🇨🇳 questions via MiniCPM model.

Authored by christian-byrne

Run ComfyUI workflows in the Cloud!

No downloads or installs are required. Pay only for active GPU usage, not idle time. No complex setups and dependency issues

Learn more