Nodes Browser

ComfyDeploy: How ComfyUI_Qwen2-Audio-7B-Instruct-Int4 works in ComfyUI?

What is ComfyUI_Qwen2-Audio-7B-Instruct-Int4?

This is an implementation of [a/Qwen2-Audio-7B-Instruct-Int4](https://github.com/QwenLM/Qwen2-Audio) by [a/ComfyUI](https://github.com/comfyanonymous/ComfyUI), including support for text-based queries and audio queries to generate captions or responses.

How to install it in ComfyDeploy?

Head over to the machine page

  1. Click on the "Create a new machine" button
  2. Select the Edit build steps
  3. Add a new step -> Custom Node
  4. Search for ComfyUI_Qwen2-Audio-7B-Instruct-Int4 and select it
  5. Close the build step dialig and then click on the "Save" button to rebuild the machine

ComfyUI_Qwen2-Audio-7B-Instruct-Int4

This is an implementation of Qwen2-Audio-7B-Instruct-Int4 by ComfyUI, including support for text-based queries and audio queries to generate captions or responses.


Basic Workflow

  • Text-based Query: Users can submit textual queries to request information or generate descriptions. For instance, a user might input a description like "What is the meaning of life?"

Chat_with_text_workflow preview

  • Audio Query: When a user uploads an audio file, the system can analyze the content and generate a detailed caption or a summary of the entire audio. For example, "Tell me what you hear in this audio clip."

Chat_with_audio_workflow preview

Installation

  • Install from ComfyUI Manager (search for Qwen2)

  • Download or git clone this repository into the ComfyUI\custom_nodes\ directory and run:

pip install -r requirements.txt

Download Models

All the models will be downloaded automatically when running the workflow if they are not found in the ComfyUI\models\prompt_generator\ directory.