ComfyUI Extension: ComfyUI-WhisperX

Authored by AIFSH

Created

Updated

34 stars

a comfyui cuatom node for audio subtitling based on a/whisperX and a/translators

Custom Nodes (0)

    README

    ComfyUI-WhisperX

    a comfyui cuatom node for audio subtitling based on whisperX and translators

    <div> <figure> <img alt='webpage' src="web.png?raw=true" width="600px"/> <figure> </div>

    Features

    • export srt file for subtitle was supported
    • translate was supported by translators with huge number engine
    • mutiple speaker diarization was supported by pyannote-audio
    • huge comfyui custom nodes can merge in whisperx

    How to use

    make sure ffmpeg is worked in your commandline for Linux

    apt update
    apt install ffmpeg
    

    for Windows,you can install ffmpeg by WingetUI automatically

    then!

    git clone https://github.com/AIFSH/ComfyUI-WhisperX.git
    cd ComfyUI-WhisperX
    pip install -r requirements.txt
    

    weights will be downloaded from huggingface automatically! if you in china,make sure your internet attach the huggingface or if you still struggle with huggingface, you may try follow hf-mirror to config your env.

    to speaker diarization, you need!

    1. Accept pyannote/segmentation-3.0 user conditions
    2. Accept pyannote/speaker-diarization-3.1 user conditions
    3. Create access token at hf.co/settings/tokens.

    Tutorial

    Demo

    WeChat Group && Donate

    <div> <figure> <img alt='Wechat' src="wechat.jpg?raw=true" width="300px"/> <img alt='donate' src="donate.jpg?raw=true" width="300px"/> <figure> </div>

    Thanks