ComfyUI Node: Transcribe by kotoba-whisper (Short-Form)

Authored by kale4eat

Created 2 years ago

Updated about a year ago

23 stars

Run ComfyUI workflows without the setup

No installs, no CUDA version roulette, no GPU sitting idle on your bill. Bring a workflow and run it in the browser.

Inputs

model KOTOBA_WHISPER_SHORT

audio AUDIO

prompt STRING

STRING

KOTOBA_WHISPER_SEGMENTS

Basic audio tools using torchaudio for ComfyUI. It is assumed to assist in the speech dataset creation for ASR, TTS, etc.

Authored by kale4eat

Looking for a different node?

No installs, no CUDA version roulette, no GPU sitting idle on your bill. Bring a workflow and run it in the browser.