ComfyUI Node: Apply Silero VAD
Category
speech-dataset-toolkit/ai/SileroVAD
Inputs
model SILERO_VAD
audio AUDIO
threshold FLOAT
min_speech_duration_ms INT
max_speech_duration_s FLOAT
min_silence_duration_ms INT
window_size_samples INT
speech_pad_ms INT
Outputs
SILERO_VAD_TIMESTAMPS
Extension: ComfyUI-speech-dataset-toolkit
Basic audio tools using torchaudio for ComfyUI. It is assumed to assist in the speech dataset creation for ASR, TTS, etc.
Authored by kale4eat
Run ComfyUI workflows in the Cloud!
No downloads or installs are required. Pay only for active GPU usage, not idle time. No complex setups and dependency issues
Learn more