ComfyUI Node: Audio To Text (mtb)

Authored by melMass

Created

Updated

534 stars

Category

mtb/audio

Inputs

pipeline WHISPER_PIPELINE
audio AUDIO
language
  • auto
  • de
  • en
  • es
  • fr
  • it
  • ja
  • ko
  • nl
  • pt
  • ru
  • zh
return_timestamps BOOLEAN

Outputs

STRING

WHISPER_OUTPUT

Extension: MTB Nodes

NODES: Face Swap, Film Interpolation, Latent Lerp, Int To Number, Bounding Box, Crop, Uncrop, ImageBlur, Denoise, ImageCompare, RGV to HSV, HSV to RGB, Color Correct, Modulo, Deglaze Image, Smart Step, ...

Authored by melMass

Run ComfyUI workflows in the Cloud!

No downloads or installs are required. Pay only for active GPU usage, not idle time. No complex setups and dependency issues

Learn more