ComfyUI Extension: ComfyUI-WhisperX

Authored by AIFSH

Created about a year ago

Updated 5 months ago

51 stars

a comfyui cuatom node for audio subtitling based on a/whisperX and a/translators

Custom Nodes (0)

README

ComfyUI-WhisperX

a comfyui cuatom node for audio subtitling based on whisperX and translators

Features

export srt file for subtitle was supported
translate was supported by translators with huge number engine
mutiple speaker diarization was supported by pyannote-audio
huge comfyui custom nodes can merge in whisperx

How to use

make sure ffmpeg is worked in your commandline for Linux

apt update
apt install ffmpeg

for Windows,you can install ffmpeg by WingetUI automatically

then!

git clone https://github.com/AIFSH/ComfyUI-WhisperX.git
cd ComfyUI-WhisperX
pip install -r requirements.txt

weights will be downloaded from huggingface automatically! if you in china,make sure your internet attach the huggingface or if you still struggle with huggingface, you may try follow hf-mirror to config your env.

to speaker diarization, you need!

Accept pyannote/segmentation-3.0 user conditions
Accept pyannote/speaker-diarization-3.1 user conditions
Create access token at hf.co/settings/tokens.

Tutorial

Demo

WeChat Group && Donate