Nodes Browser
ComfyDeploy: How ComfyUI-VLM_Captions works in ComfyUI?
What is ComfyUI-VLM_Captions?
A simple ComfyUI node that let's you use Claude or ChatGPT 4o's VLM capabilities to generate captions/tags for images.
How to install it in ComfyDeploy?
Head over to the machine page
- Click on the "Create a new machine" button
- Select the
Edit
build steps - Add a new step -> Custom Node
- Search for
ComfyUI-VLM_Captions
and select it - Close the build step dialig and then click on the "Save" button to rebuild the machine
ComfyUI-VLM-Captions
A simple ComfyUI node that let's you use Claude or ChatGPT 4o's VLM capabilities to generate captions/tags for images.
Installation
- git clone this repository into Comfyui/custom_nodes/
Usage
The node accepts an image and a prompt as inputs to generate captions. The input image is automatically resized to 512 pixels to optimize performance and reduce costs. To generate a caption, provide a prompt such as "Create a concise description for the given image" in the text field. Be sure to replace the placeholder API key with your own to enable functionality.
Workflow Example
Right click > Convert widget to input to convert conditioning text box into a node input