ComfyUI Node: Florence2

Authored by spacepxl

Created about a year ago

Updated about a year ago

82 stars

Category

Florence2

Inputs

FLORENCE2 FLORENCE2

image IMAGE

task

caption
detailed caption
more detailed caption
object detection
dense region caption
region proposal
caption to phrase grounding
referring expression segmentation
region to segmentation
open vocabulary detection
region to category
region to description
OCR
OCR with region

text_input STRING

max_new_tokens INT

num_beams INT

do_sample BOOLEAN

fill_mask BOOLEAN

Outputs

IMAGE

STRING

F_BBOXES

Extension: ComfyUI-Florence-2

a/https://huggingface.co/microsoft/Florence-2-large-ft Large or base model, support for captioning and bbox task modes, more coming soon.

Authored by spacepxl

Run ComfyUI workflows in the Cloud!

No downloads or installs are required. Pay only for active GPU usage, not idle time. No complex setups and dependency issues