a comfyui cuatom node for audio subtitling based on a/whisperX and a/translators
a comfyui cuatom node for audio subtitling based on whisperX and translators
<div> <figure> <img alt='webpage' src="web.png?raw=true" width="600px"/> <figure> </div>srt
file for subtitle was supportedmake sure ffmpeg
is worked in your commandline
for Linux
apt update
apt install ffmpeg
for Windows,you can install ffmpeg
by WingetUI automatically
then!
git clone https://github.com/AIFSH/ComfyUI-WhisperX.git
cd ComfyUI-WhisperX
pip install -r requirements.txt
weights
will be downloaded from huggingface automatically! if you in china,make sure your internet attach the huggingface
or if you still struggle with huggingface, you may try follow hf-mirror to config your env.
to speaker diarization, you need!
pyannote/segmentation-3.0
user conditionspyannote/speaker-diarization-3.1
user conditionshf.co/settings/tokens
.