Nodes Browser

ComfyDeploy: How ComfyUI_pixtral_vision works in ComfyUI?

What is ComfyUI_pixtral_vision?

The ComfyUI_pixtral_vision is a powerful ComfyUI node designed to integrate seamlessly with the Mistral Pixtral API. It facilitates the analysis of images through deep learning models, interpreting and describing the visual content. Users can input an image directly and provide prompts for context, utilizing an API key for authentication.

How to install it in ComfyDeploy?

Head over to the machine page

  1. Click on the "Create a new machine" button
  2. Select the Edit build steps
  3. Add a new step -> Custom Node
  4. Search for ComfyUI_pixtral_vision and select it
  5. Close the build step dialig and then click on the "Save" button to rebuild the machine

Update 20 nov

add preview_text node

Update 28 sep

Add maximum_tokens option

Update 25 sep

Add multiply images input node - 'Multi Images Input'

ComfyUI_pixtral_vision

The ComfyUI_pixtral_vision is a powerful ComfyUI node designed to integrate seamlessly with the Mistral Pixtral API. It facilitates the analysis of images through deep learning models, interpreting and describing the visual content. Users can input an image directly and provide prompts for context, utilizing an API key for authentication.

image

image

Overview

The ComfyUI_pixtral_vision node integrates with the Mistral Pixtral API to provide advanced image analysis capabilities within the ComfyUI framework. This node allows users to upload images and receive descriptive insights generated by deep learning models. It is particularly useful for applications requiring detailed visual understanding and content description.

Features

  • Image Analysis: Analyze images using the state-of-the-art Pixtral 12B model.
  • Dynamic Interactions: Adjust the randomness of responses with a temperature control.
  • Secure API Integration: Utilizes an API key for authenticated access to the Mistral Pixtral API.

Installation

To install the ComfyUI_pixtral_vision node, follow these steps:

  1. Clone the repository:
    git clone https://github.com/yourusername/ComfyUI_pixtral_vision.git
    
  2. Navigate to the cloned directory:
    cd ComfyUI_pixtral_vision
    
  3. Install the required dependencies:
    pip install -r requirements.txt
    

Getting the free API Key

  1. Visit Mistral AI and sign up or log into your account.
  2. Navigate to the API section and follow the instructions to generate a new API key.
  3. Once you have your API key, enter it into the node configuration as described in the setup instructions.

Usage

To use the node, input an image and a prompt describing what you are looking for in the image. Adjust the temperature setting as needed to control the response's randomness.

Credits

This project utilizes the Mistral Pixtral API. For more detailed information about the API, visit the official documentation.

References


For support, feature requests, or contributions, please visit the project's GitHub page.


This README includes a technical description of the node, installation instructions, guidance on obtaining an API key, usage instructions, and links to relevant resources. Adjust the GitHub URLs and any specific instructions according to your actual repository and setup details.