ComfyUI Extension: ComfyUI Qwen-Image Node
A custom node for ComfyUI that integrates Alibaba's Qwen-Image model for high-quality image generation with exceptional text rendering capabilities.
Custom Nodes (0)
README
ComfyUI Qwen-Image Node
A custom node for ComfyUI that integrates Alibaba's Qwen-Image model for high-quality image generation with exceptional text rendering capabilities.
⚠️ Important Notes
- System Requirements: Current operation requires 24GB GPU memory and 64GB+ RAM. Virtual memory may need to be enabled for optimal performance.
- MMGP Optimization: We are aware that our MMGP implementation may have optimization issues that prevent proper VRAM release, potentially causing problems with multiple consecutive runs. We are actively working on further optimizations to resolve this issue.
✨ Features
- 🎨 High-Quality Image Generation: Powered by 20B parameter MMDiT architecture
- 📝 Superior Text Rendering: Especially excels at Chinese text rendering and complex text layouts
- 🎯 Multiple Aspect Ratios: Preset ratios including 1:1, 16:9, 9:16, 4:3, 3:4
- 🌐 Multi-Language Support: Auto-detection of Chinese/English with language-specific prompt enhancement
- ⚙️ Rich Parameter Control: CFG scale, inference steps, seed control and more
- 🚀 Batch Generation: Generate multiple image variants in one run
- 🔧 Prompt Enhancement: Automatic prompt optimization and enhancement
- 📊 Real-Time Progress: Detailed progress bars with time estimation during inference
- 💾 Memory Optimization: VAE tiling, CPU offload, MMGP optimization support
🔧 Node List
Core Nodes
- Qwen-Image Model Loader: Specialized model loading and optimization node
- RH_Qwen-Image Generator: Efficient image generator with real-time progress display
- RH_Qwen-Image Prompt Enhancer: Advanced prompt preprocessing and enhancement
🚀 Quick Installation
Step 1: Install the Node
# Navigate to ComfyUI custom_nodes directory
cd ComfyUI/custom_nodes
# Clone the repository
git clone https://github.com/HM-RunningHub/ComfyUI_RH_Qwen-Image
# Install dependencies
cd ComfyUI_RH_Qwen-Image
pip install -r requirements.txt
Step 2: Download the Model
# Navigate to ComfyUI models directory
cd ../../models
# Download Qwen-Image model locally
git lfs clone https://huggingface.co/Qwen/Qwen-Image
# Restart ComfyUI
Step 3: Verify Installation
# Verify local model setup
cd custom_nodes/ComfyUI_RH_Qwen-Image
python verify_model.py
📖 Usage
Recommended Workflow (Optimized)
-
Add Model Loader:
- Add "Qwen-Image Model Loader" node in ComfyUI
- Keep
model_path
as default"local"
- Configure optimization options based on your GPU
-
Add Generator:
- Add "RH_Qwen-Image Generator" node
- Connect the
pipeline
output from model loader - Configure generation parameters
-
Basic Workflow:
[Model Loader] → [RH Generator] → [Save Image]
Example Prompts
English Example
A coffee shop entrance features a chalkboard sign reading "Qwen Coffee 😊 $2 per cup," with a neon light beside it displaying "通义千问"
Chinese Example
一只可爱的小猫咪坐在窗台上,阳光透过窗户洒在它身上,旁边放着一杯热咖啡,咖啡杯上写着"通义千问"
📐 Aspect Ratio Presets
- 1:1: 1328x1328 (Square)
- 16:9: 1664x928 (Landscape)
- 9:16: 928x1664 (Portrait)
- 4:3: 1472x1140 (Traditional Landscape)
- 3:4: 1140x1472 (Traditional Portrait)
- custom: Use custom width and height
🛠️ Technical Details
- Built on Diffusers library
- Supports CUDA and CPU inference
- Automatic language-specific prompt enhancement
- Support for bf16 precision for improved performance
- Advanced memory optimization with MMGP support
⚠️ Requirements
- Storage: ~20GB disk space for local model
- Performance: CUDA environment recommended, 24GB+ GPU memory, 64GB+ RAM
- Dependencies: Latest version of diffusers library required
- Special Feature: Optimized for Chinese text rendering
📁 Local Model Advantages
- ✅ No internet connection required
- ✅ Faster loading times
- ✅ Complete offline usage
- ✅ Avoid repeated downloads
- ✅ Data privacy protection
🔧 Troubleshooting
Common Issue 1: "module diffusers has no attribute QwenImagePipeline"
This error indicates an outdated diffusers version. Solution:
# Uninstall old version
pip uninstall diffusers -y
# Install latest version
pip install git+https://github.com/huggingface/diffusers
# Restart ComfyUI
Common Issue 2: Model Download Failed
- Check network connection
- Ensure access to HuggingFace Hub
- Verify sufficient disk space (~20GB)
Common Issue 3: GPU Out of Memory
- Close other GPU-intensive programs
- Use smaller image dimensions
- Reduce batch generation count
- Enable CPU offload in model loader
- Enable VAE tiling optimization
📄 License
Licensed under Apache 2.0, consistent with the original Qwen-Image project.
🔗 References
🤝 Contributing
Contributions are welcome! Please feel free to submit issues and pull requests.
⭐ Star History
If you find this project helpful, please consider giving it a star!