ComfyUI Node: Transcribe by kotoba-whisper (Short-Form)
Category
speech-dataset-toolkit/ai/kotoba-whisper
Inputs
model KOTOBA_WHISPER_SHORT
audio AUDIO
prompt STRING
Outputs
STRING
KOTOBA_WHISPER_SEGMENTS
Extension: ComfyUI-speech-dataset-toolkit
Basic audio tools using torchaudio for ComfyUI. It is assumed to assist in the speech dataset creation for ASR, TTS, etc.
Authored by kale4eat
Run ComfyUI workflows in the Cloud!
No downloads or installs are required. Pay only for active GPU usage, not idle time. No complex setups and dependency issues
Learn more