a/https://huggingface.co/microsoft/Florence-2-large-ft Large or base model, support for captioning and bbox task modes, more coming soon.
https://huggingface.co/microsoft/Florence-2-large
All four models, initial support for all output types
<details> <summary>Examples</summary> </details>TODO: