ComfyUI Extension: Unfied Vision Prompt Generator

Authored by GraftingRayman

Created

Updated

1 stars

A Qwen 2/2.5, Moondream and SmolVLM LLM caption/prompt genrator

Custom Nodes (0)

    README

    A prompt generator for ComfyUI using various different LLM models

        'qwen2vl': [
            'Qwen/Qwen2-VL-2B-Instruct',
            'Qwen/Qwen2-VL-7B-Instruct',
            'Qwen/Qwen2.5-VL-3B-Instruct',
            'Qwen/Qwen2.5-VL-7B-Instruct',
            'Qwen/Qwen2.5-VL-72B-Instruct',
        ],
        'moondream': [
            'vikhyatk/moondream2',
            'vikhyatk/moondream1',
        ],
        'smolvlm': [
            'HuggingFaceTB/SmolVLM-Instruct',
            'HuggingFaceTB/SmolVLM-256M-Instruct',
            'HuggingFaceTB/SmolVLM-500M-Instruct',
    

    I am a bit limited with the amount of VRAM (16GB) that I have, so unable to check any of the high parameter Qwen models, os if you have used them, let me know if they are doing what they should be