ComfyUI Node: Recognize Anything Model (RAM)

Authored by Hangover3832

Created

Updated

18 stars

Category

Hangover

Inputs

image IMAGE
model
  • ram_swin_large_14m.pth
  • ram_plus_swin_large_14m.pth
  • tag2text_swin_14m.pth
device
  • cpu
spec_tag2text STRING

Outputs

STRING

STRING

STRING

Extension: Recognize Anything Model (RAM) for ComfyUI

This is an image recognition node for ComfyUI based on the RAM++ model from a/xinyu1205. This node outputs a string of tags with all the recognized objects and elements in the image in English or Chinese language. For image tagging and captioning.

Authored by Hangover3832

Run ComfyUI workflows in the Cloud!

No downloads or installs are required. Pay only for active GPU usage, not idle time. No complex setups and dependency issues

Learn more