ComfyUI Node: Doubutsu Image Describer

Authored by EnragedAntelope

Created

Updated

10 stars

Category

image/text

Inputs

image IMAGE
question STRING
max_new_tokens INT
temperature FLOAT
precision
  • float16
  • bfloat16

Outputs

STRING

Extension: ComfyUI-Doubutsu-Describer

This custom node for ComfyUI allows you to use the Doubutsu small VLM model to describe images. Credit and further information on Doubutsu: a/https://huggingface.co/qresearch/doubutsu-2b-pt-756

Authored by EnragedAntelope

Run ComfyUI workflows in the Cloud!

No downloads or installs are required. Pay only for active GPU usage, not idle time. No complex setups and dependency issues

Learn more