ComfyUI Extension: ComfyUI-AutoLabel

Authored by fexploit

Created about a year ago

Updated 5 months ago

7 stars

ComfyUI-AutoLabel is a custom node for ComfyUI that uses BLIP (Bootstrapping Language-Image Pre-training) to generate detailed descriptions of the main object in an image. This node leverages the power of BLIP to provide accurate and context-aware captions for images. by Fexploit.

Custom Nodes (1)

Auto Label

README

ComfyUI-AutoLabel

Features

Image to Text Description: Generate detailed descriptions of the main object in an image.
Customizable Prompts: Provide your own prompt to guide the description generation.
Flexible Inference Modes: Supports GPU, GPU with float16, and CPU inference modes.
Offline Mode: Option to download and use models offline.

Installation

Clone the Repository: Clone this repository into your custom_nodes folder in ComfyUI.

git clone https://github.com/fexploit/ComfyUI-AutoLabel custom_nodes/ComfyUI-AutoLabel

Install Dependencies: Navigate to the cloned folder and install the required dependencies.
```
cd custom_nodes/ComfyUI-AutoLabel
pip install -r requirements.txt
```

Usage

Adding the Node

Start ComfyUI.
Add the AutoLabel node from the custom nodes list.
Connect an image input and configure the parameters as needed.

Parameters

image (required): The input image tensor.
prompt (optional): A string to guide the description generation (default: "a photography of").
repo_id (optional): The Hugging Face model repository ID (default: "Salesforce/blip-image-captioning-base").
inference_mode (optional): The inference mode, can be "gpu_float16", "gpu", or "cpu" (default: "gpu").
get_model_online (optional): Boolean flag to download the model online if not already present (default: True).

Contributing

Contributions are welcome! Please open an issue or submit a pull request with your changes.

License

This project is licensed under the MIT License.

Acknowledgements

ComfyUI
BLIP

Contact

For any inquiries, please open an issue on the GitHub repository.