A ComfyUI custom node extension that integrates the Janus-Pro-7B vision-language model from DeepSeek AI on your's local computer, enabling powerful image understanding and multi-turn conversation capabilities.
A ComfyUI custom node extension that integrates the Janus-Pro-7B vision-language model from DeepSeek AI on your's local computer, enabling powerful image understanding and multi-turn conversation capabilities.
cd ComfyUI/custom_nodes
git clone https://github.com/ShmuelRonen/ComfyUI-Janus_pro_vision.git
pip install requests
pip install tqdm
The model files will be automatically downloaded on first use from DeepSeek's HuggingFace repository.
If automatic model download failes you can download them manualy to models\Janus-Pro
folder:
git clone https://huggingface.co/deepseek-ai/Janus-Pro-7B
Handles model loading and management.
Main analysis node with chat capabilities.
Inputs:
janus_model
: Model object from loader nodeimage_a
: Primary image for analysisimage_b
: (Optional) Secondary image for comparisonprompt
: Text prompt/question about the image(s)chat_mode
: Enable/disable chat functionalityseed
: Random seed for generationtemperature
: Generation temperature (0.0 - 2.0)top_p
: Top-p sampling parameter (0.0 - 1.0)max_tokens
: Maximum generation lengthimage_size
: Target image size for processing (512-2048)frame_size
: Border thickness for image display (1-10)reset_chat
: Clear chat historyOutputs:
response
: Model's response textchat_history
: Formatted chat history (in chat mode)image_size
: Controls the maximum dimension while maintaining aspect ratio (default: 1024)
frame_size
: Border thickness for visual separation (default: 2)
temperature
: Controls response randomness
top_p
: Nucleus sampling parameter (0.95 recommended)max_tokens
: Maximum length of generated responseThis extension uses the Janus-Pro-7B model from DeepSeek AI, which offers:
This project is MIT licensed. The Janus-Pro-7B model has its own license from DeepSeek AI.
Contributions are welcome! Please feel free to submit a Pull Request.