ComfyUI Node: Doubutsu Image Describer
Category
image/text
Inputs
image IMAGE
question STRING
max_new_tokens INT
temperature FLOAT
precision
- float16
- bfloat16
Outputs
STRING
Extension: ComfyUI-Doubutsu-Describer
This custom node for ComfyUI allows you to use the Doubutsu small VLM model to describe images. Credit and further information on Doubutsu: a/https://huggingface.co/qresearch/doubutsu-2b-pt-756
Authored by EnragedAntelope
Run ComfyUI workflows in the Cloud!
No downloads or installs are required. Pay only for active GPU usage, not idle time. No complex setups and dependency issues
Learn more