Call LLM and VLM in a simple way using the OpenAI API standard from ComfyUI
This repository contains ComfyUI nodes that integrates with the OpenAI API: it allows you to use language models and vision language models within your workflow.
It is KISS by design and intended for those who only wants basic capabilities without having to import massive projects like LLM party: only the chat completions endpoint (with vision support) is implemented as it should be enough for 99% of use cases.
Thanks to its simplicity the project has a low footprint: it only has 1 external dependency (3 in total) !
openai
the official OpenAI bindings for Pythonnumpy
for computation but it is already a dependency of ComfyUIPillow
also already used by ComfyUI for image processingThe default base_url
parameter value targets the official OpenAI API endpoint but by changing it, you can also use this project with any openai compatible servers like vLLM, TGI, etc...
Search for OpenAI API
in the Custom Nodes Manager
and install it.
On the github interface, click the green <> Code
button and then Download ZIP
. Extract the root folder of the zip file into your ComfyUI/custom_nodes
directory.
You only need 2 nodes to get started:
If you want to customize the chat completion, you can chain options to modify the request. Most common options are available as predefined nodes but you can inject any key/value pair using the Extra body
node.
Also, several chat completions can be chained to share the context history of previous turns.