ComfyUI Node: Whisper STT 👂

Authored by 1038lab

Created

Updated

29 stars

Category

🧪AILab/🔊Audio

Inputs

audio AUDIO
model_size
  • tiny
  • base
  • small
  • medium
  • large
language
  • auto
  • af
  • ar
  • hy
  • az
  • be
  • bs
  • bg
  • ca
  • zh
  • hr
  • cs
  • da
  • nl
  • en
  • et
  • fi
  • fr
  • gl
  • de
  • el
  • he
  • hi
  • hu
  • is
  • id
  • it
  • ja
  • kn
  • kk
  • ko
  • lv
  • lt
  • mk
  • ms
  • mr
  • mi
  • ne
  • no
  • fa
  • pl
  • pt
  • ro
  • ru
  • sr
  • sk
  • sl
  • es
  • sw
  • sv
  • tl
  • ta
  • th
  • tr
  • uk
  • ur
  • vi
  • cy

Outputs

STRING

Extension: ComfyUI-EdgeTTS

ComfyUI-EdgeTTS is a powerful text-to-speech node for ComfyUI, leveraging Microsoft's Edge TTS capabilities. It enables seamless conversion of text into natural-sounding speech, supporting multiple languages and voices. Ideal for enhancing user interactions, this node is easy to integrate and customize, making it perfect for various applications.

Authored by 1038lab

Run ComfyUI workflows in the Cloud!

No downloads or installs are required. Pay only for active GPU usage, not idle time. No complex setups and dependency issues

Learn more