ComfyUI Node: Transcribe by kotoba-whisper

Authored by kale4eat

Created

Updated

12 stars

Category

speech-dataset-toolkit/ai/kotoba-whisper

Inputs

model KOTOBA_WHISPER
audio AUDIO
prompt STRING

Outputs

STRING

Extension: ComfyUI-speech-dataset-toolkit

Basic audio tools using torchaudio for ComfyUI. It is assumed to assist in the speech dataset creation for ASR, TTS, etc.

Authored by kale4eat

Run ComfyUI workflows in the Cloud!

No downloads or installs are required. Pay only for active GPU usage, not idle time. No complex setups and dependency issues

Learn more