speech-dataset-toolkit/ai/kotoba-whisper
KOTOBA_WHISPER_SEGMENT
Basic audio tools using torchaudio for ComfyUI. It is assumed to assist in the speech dataset creation for ASR, TTS, etc.
Authored by kale4eat
No downloads or installs are required. Pay only for active GPU usage, not idle time. No complex setups and dependency issues
Learn more