ComfyUI Extension: ComfyUI_parakeet-tdt

Authored by billwuhao

Created

Updated

2 stars

parakeet-tdt-0.6b-v2: Automatic speech recognition (ASR) model designed for high-quality English transcription, featuring support for punctuation, capitalization, and accurate timestamp prediction.

Custom Nodes (0)

    README

    中文|English

    ComfyUI Node for parakeet-tdt-0.6b-v2

    An accurate and fast automatic speech recognition (ASR) model. Designed for high-quality English transcription, supporting punctuation, capitalization, and accurate timestamp prediction.

    Usage

    • Quickly add captions:

    📣 Updates

    [2025-05-20]⚒️: Released v1.0.0.

    Installation

    cd ComfyUI/custom_nodes
    git clone https://github.com/billwuhao/ComfyUI_parakeet-tdt.git
    cd ComfyUI_parakeet-tdt
    pip install -r requirements.txt
    
    # Linux
    pip install nemo_toolkit['asr']
    
    # Windows
    git clone https://github.com/NVIDIA/NeMo
    cd NeMo
    pip install '.[asr]'
    

    If an error occurs: RuntimeError: CUDA error: operation not supported. Add --disable-cuda-malloc to the ComfyUI launch parameters, for example:

    .\python_embeded\python.exe -s ComfyUI\main.py --windows-standalone-build --disable-cuda-malloc
    

    Model Download

    Acknowledgments

    NeMo