ComfyUI Node: FL Gemini Video Captioner

Authored by filliptm

Created

Updated

434 stars

Category

๐Ÿต๏ธFill Nodes/AI

Inputs

api_key STRING
model
  • gemini-1.0-pro-vision
  • gemini-1.5-pro
  • gemini-1.5-flash
  • gemini-2.0-flash
frames_per_second FLOAT
max_duration_minutes FLOAT
prompt STRING
process_audio
  • false
  • true
temperature FLOAT
max_output_tokens INT
top_p FLOAT
top_k INT
seed INT
video_path STRING
image IMAGE

Outputs

STRING

IMAGE

Extension: ComfyUI_Fill-Nodes

Fill-Nodes is a versatile collection of custom nodes for ComfyUI that extends functionality across multiple domains. Features include advanced image processing (pixelation, slicing, masking), visual effects generation (glitch, halftone, pixel art), comprehensive file handling (PDF creation/extraction, Google Drive integration), AI model interfaces (GPT, DALL-E, Hugging Face), utility nodes for workflow enhancement, and specialized tools for video processing, captioning, and batch operations. The pack provides both practical workflow solutions and creative tools within a unified node collection.

Authored by filliptm

Run ComfyUI workflows in the Cloud!

No downloads or installs are required. Pay only for active GPU usage, not idle time. No complex setups and dependency issues

Learn more