ComfyUI Extension: Recognize Anything Model (RAM) for ComfyUI

Authored by Hangover3832

Created about a year ago

Updated 2 months ago

22 stars

This is an image recognition node for ComfyUI based on the RAM++ model from a/xinyu1205. This node outputs a string of tags with all the recognized objects and elements in the image in English or Chinese language. For image tagging and captioning.

Custom Nodes (1)

Recognize Anything Model (RAM)

README

Recognize Anything Model (RAM++) for ComfyUI

This repository will no longer be maintained.

Image Tagging and Captioning

The counterpart to Segment Anything Model (SAM)

This is an image recognition node for ComfyUI based on the RAM++ model from xinyu1205.

https://huggingface.co/xinyu1205/recognize-anything-plus-model
https://github.com/xinyu1205/recognize-anything
This node outputs a string of tags with all the recognized objects and elements in the image
3 different models.
RAM and RAM++ outputs tags in English and Chinese language:
tag2text outputs image tags and an image caption at the same time. It also takes user specified tags to be considered in the output:

Installation

Git clone this repository inside the custom_nodes folder or use ComfyUI-Manager and search for "RAM".

ℹ️ In order to make this node work, the "ram" package need to be installed. To do so, open a terminal inside the python_embedded folder within your ComfyUI installation:

python.exe -m pip install git+https://github.com/Hangover3832/recognize-anything.git

Furthermore you need to download the RAM, RAM++ and tag2text models and place it in the /ComfyUI/models/rams/ folder or use the ComfyUI-Manager model downloader. You can also configure the location in 'extra_model_paths.yaml' in the Confyui folder.

Updates

Added RAM and tag2text models.
Support for the 'extra_model_paths.yaml' configuration.