ComfyUI Node: Image to Text 🐼

Authored by zhongpei

Created

Updated

272 stars

Category

fofo🐼/image2prompt

Inputs

model IMAGE2TEXT_MODEL
image IMAGE
query
  • Describe this photograph.
  • What is this?
  • Please describe this image in detail.
  • As an AI image tagging expert, please provide precise tags for these images to enhance CLIP model's understanding of the content. Employ succinct keywords or phrases, steering clear of elaborate sentences and extraneous conjunctions. Prioritize the tags by relevance. Your tags should capture key elements such as the main subject, setting, artistic style, composition, image quality, color tone, filter, and camera specifications, and any other tags crucial for the image. When tagging photos of people, include specific details like gender, nationality, attire, actions, pose, expressions, accessories, makeup, composition type, age, etc. For other image categories, apply appropriate and common descriptive tags as well. Recognize and tag any celebrities, well-known landmark or IPs if clearly featured in the image. Your tags should be accurate, non-duplicative, and within a 20-75 word count range.
custom_query STRING
print_log BOOLEAN

Outputs

STRING

Extension: Comfyui_image2prompt

Nodes:Image to Text, Loader Image to Text Model.

Authored by zhongpei

Run ComfyUI workflows in the Cloud!

No downloads or installs are required. Pay only for active GPU usage, not idle time. No complex setups and dependency issues

Learn more