ComfyUI Extension: IF_Gemini
Enjoy the latest GEMINI V2 API for ComfyUI - generate images, analyze content, and use multimodal capabilities with Google's Gemini models
Custom Nodes (0)
README
ComfyUI-IF_Gemini
Enjoy Google Gemini API for ComfyUI generate images, transcribe audio, sumarize videos. Making a separate implemetation of my old IF_AI tools for easy installation
Features
- Text Generation: Create content, answer questions, and generate creative text formats
- Image Analysis: Describe, analyze, and extract information from images
- Image Generation: Generate images with Gemini's image generation capabilities
- Multi-Modal Input: Combine text and images in your prompts
- Customizable Parameters: Control temperature, output tokens, and other generation settings
- Chat Mode: Maintain conversation history for interactive sessions
- Batch Processing: Generate multiple outputs with a single prompt
Installation
- Clone this repository into your ComfyUI custom nodes folder:
cd ComfyUI/custom_nodes
git clone https://github.com/if-ai/ComfyUI-IF_Gemini
- Install the required Python packages:
cd ComfyUI-IF_Gemini
pip install -r requirements.txt
- Restart ComfyUI to load the new node
Usage
The Gemini node appears in the "ImpactFrames💥🎞️/LLM" category in the ComfyUI node browser.
-
Restart ComfyUI
-
Add your Gemini API key using one of these methods:
-
Shell configuration file (recommended for macOS/Linux):
# In ~/.zshrc, ~/.bashrc, or ~/.bash_profile: export GEMINI_API_KEY=your_api_key_here
Then restart your terminal or run
source ~/.zshrc
(or relevant file) -
System environment variable:
export GEMINI_API_KEY=your_api_key
-
Directly in the node: Enter your API key in the "external_api_key" field
-
In a
.env
file in the custom node directory:GEMINI_API_KEY=your_api_key
-
-
Add the "IF LLM Gemini AI" node to your workflow
-
Verify your API key using the "Verify API Key" button in the node
-
Configure the node:
- For text generation, set "operation_mode" to "analysis" or "generate_text"
- For image generation, set "operation_mode" to "generate_images"
- Connect reference images (optional) for style-based generation
-
Set additional parameters as needed:
- Prompt: Your text instructions
- Model version: Select appropriate Gemini model
- Temperature: Controls randomness (0.0-1.0)
- Seed: For reproducible results
Troubleshooting
- If you encounter API key errors, use the "Verify API Key" button to check its validity
- For image safety errors, try modifying your prompt to avoid content that may trigger safety filters
- Ensure your Gemini API has appropriate quotas for your usage
License
MIT
Support
If you find this tool useful, please consider supporting my work by:
- Starring this repo on GitHub
- Subscribing to my YouTube channel: Impact Frames
- Follow me on X: Impact Frames X
- Supporting me on Ko-fi: Impact Frames Ko-fi
- Becoming a patron on Patreon: Impact Frames Patreon Thank You!