ComfyUI Extension: ComfyUI-IF_AI_WishperSpeechNode

Authored by if-ai

Created

Updated

37 stars

This repository hosts a Text-to-Speech (TTS) application that leverages Whisper Speech for voice synthesis, allowing users to train a voice model on-the-fly. It is built on ComfyUI and supports rapid training and inference processes.

Custom Nodes (0)

    README

    ComfyUI-IF_AI_WishperSpeechNode

    A Convenient and Fast Text-to-Speech Application with Whisper Speech

    This repository hosts a Text-to-Speech (TTS) application that leverages Whisper Speech for voice synthesis, allowing users to train a voice model on-the-fly. It is built on ComfyUI and supports rapid training and inference processes.

    Features

    • On-the-fly Voice Training: Train a custom voice model using a short audio recording.
    • Fast Inference: Optional support for torch_Compile to enhance performance during inference and training.

    Installation

    • Git clone the repository to your custom_nodes folder
    • pip install -r requirements.txt

    IF dlib troubles try this workarounds

    DEDICATED ENV

    1-.VIA PIP activate env

    pip install cmake
    pip install dlib
    
    

    2-. VIA CLONING DLIB REPO

      git clone https://github.com/davisking/dlib.git
      cd dlib
    

    activate env

        python.exe setup.py install
    

    if nothing works try this with the terminal as admin 3-.VIA CONDA PKG

    1-.Activate the env on conda env conda install -c conda-forge dlib on micromamba env micromamba install -c conda-forge dlib

    Portable ENV

    1-.VIA PIP

    Open terminal as admin H:\ComfyUI_windows_portable\python_embeded\python.exe -m pip install cmake H:\ComfyUI_windows_portable\python_embeded\python.exe -m pip install dlib

    2-. VIA CLONING DLIB REPO git clone https://github.com/davisking/dlib.git cd dlib H:\ComfyUI_windows_portable\python_embeded\python.exe setup.py install