Nodes Browser
ComfyDeploy: How ComfyUI-WhisperX works in ComfyUI?
What is ComfyUI-WhisperX?
a comfyui cuatom node for audio subtitling based on [a/whisperX](https://github.com/m-bain/whisperX.git) and [a/translators](https://github.com/UlionTse/translators)
How to install it in ComfyDeploy?
Head over to the machine page
- Click on the "Create a new machine" button
- Select the
Edit
build steps - Add a new step -> Custom Node
- Search for
ComfyUI-WhisperX
and select it - Close the build step dialig and then click on the "Save" button to rebuild the machine
ComfyUI-WhisperX
a comfyui cuatom node for audio subtitling based on whisperX and translators
<div> <figure> <img alt='webpage' src="web.png?raw=true" width="600px"/> <figure> </div>Features
- export
srt
file for subtitle was supported - translate was supported by translators with huge number engine
- mutiple speaker diarization was supported by pyannote-audio
- huge comfyui custom nodes can merge in whisperx
How to use
make sure ffmpeg
is worked in your commandline
for Linux
apt update
apt install ffmpeg
for Windows,you can install ffmpeg
by WingetUI automatically
then!
git clone https://github.com/AIFSH/ComfyUI-WhisperX.git
cd ComfyUI-WhisperX
pip install -r requirements.txt
weights
will be downloaded from huggingface automatically! if you in china,make sure your internet attach the huggingface
or if you still struggle with huggingface, you may try follow hf-mirror to config your env.
to speaker diarization, you need!
- Accept
pyannote/segmentation-3.0
user conditions - Accept
pyannote/speaker-diarization-3.1
user conditions - Create access token at
hf.co/settings/tokens
.