ComfyUI Node: Apply Silero VAD

Authored by kale4eat

Created

Updated

12 stars

Category

speech-dataset-toolkit/ai/SileroVAD

Inputs

model SILERO_VAD
audio AUDIO
threshold FLOAT
min_speech_duration_ms INT
max_speech_duration_s FLOAT
min_silence_duration_ms INT
window_size_samples INT
speech_pad_ms INT

Outputs

SILERO_VAD_TIMESTAMPS

Extension: ComfyUI-speech-dataset-toolkit

Basic audio tools using torchaudio for ComfyUI. It is assumed to assist in the speech dataset creation for ASR, TTS, etc.

Authored by kale4eat

Run ComfyUI workflows in the Cloud!

No downloads or installs are required. Pay only for active GPU usage, not idle time. No complex setups and dependency issues

Learn more