ComfyUI Node: Transcribe by kotoba-whisper (Short-Form)

Authored by kale4eat

Created

Updated

9 stars

Category

speech-dataset-toolkit/ai/kotoba-whisper

Inputs

model KOTOBA_WHISPER_SHORT
audio AUDIO
prompt STRING

Outputs

STRING

KOTOBA_WHISPER_SEGMENTS

Extension: ComfyUI-speech-dataset-toolkit

Basic audio tools using torchaudio for ComfyUI. It is assumed to assist in the speech dataset creation for ASR, TTS, etc.

Authored by kale4eat

Run ComfyUI workflows in the Cloud!

No downloads or installs are required. Pay only for active GPU usage, not idle time. No complex setups and dependency issues

Learn more