ComfyUI Node: Recognize Anything Model (RAM)
Category
Hangover
Inputs
image IMAGE
model
- ram_swin_large_14m.pth
- ram_plus_swin_large_14m.pth
- tag2text_swin_14m.pth
device
- cpu
- gpu
spec_tag2text STRING
Outputs
STRING
STRING
STRING
Extension: Recognize Anything Model (RAM) for ComfyUI
This is an image recognition node for ComfyUI based on the RAM++ model from a/xinyu1205. This node outputs a string of tags with all the recognized objects and elements in the image in English or Chinese language. For image tagging and captioning.
Authored by Hangover3832
Run ComfyUI workflows in the Cloud!
No downloads or installs are required. Pay only for active GPU usage, not idle time. No complex setups and dependency issues
Learn more