ComfyUI Node: Transcribe by nemo-asr

Authored by kale4eat

Created

Updated

15 stars

Category

speech-dataset-toolkit/ai/nemo-asr

Inputs

model NEMO_ASR
audio AUDIO

Outputs

STRING

NEMO_ASR_SUBWORDS

NEMO_ASR_SEGMENTS

Extension: ComfyUI-speech-dataset-toolkit

Basic audio tools using torchaudio for ComfyUI. It is assumed to assist in the speech dataset creation for ASR, TTS, etc.

Authored by kale4eat

Run ComfyUI workflows in the Cloud!

No downloads or installs are required. Pay only for active GPU usage, not idle time. No complex setups and dependency issues

Learn more