ComfyUI Extension: ComfyUI_AudioTools
A ComfyUI node containing multiple audio processing tools.
Custom Nodes (0)
README
ComfyUI Nodes for Audio Processing and Related Tasks
π£ Updates
[2025-06-03]βοΈ: v1.2.0. Added Music/Vocal Separation, Vocal Extraction, and Audio Merging nodes. Download models TIGER-speech, TIGER-DnR, and place the entire folders into the models\TTS
directory.
[2025-05-27]βοΈ: Added Audio Denoising and Enhancement node. Download model last_best_checkpoint.pt and place it into the models\TTS\MossFormer2_SE_48K
directory.
[2025-05-23]βοΈ: Fixed logic issues with the Pause node. Now, the Pause node will pause on the first execution when connected in series or parallel. On subsequent executions, it will automatically pass if the preceding nodes have not changed.
[2025-04-28]βοΈ: Audio Loading, with custom loading paths including subdirectories.
[2025-04-26]βοΈ: Pause workflow anywhere.
[2025-04-25]βοΈ: String Editing.
[2025-03-28]βοΈ: Added Watermark Embedding node.
[2025-03-26]βοΈ: Released version v1.0.0.
π Introduction
Audio acts as a bridge connecting text, video, and images. A video without audio or text is tasteless. This project currently includes the following main nodes:
- Music/Vocal Separation, Vocal Extraction, Audio Merging, Audio Concatenation
- Audio Denoising and Enhancement
- Pause workflow anywhere
- Audio Loading, with custom loading paths including subdirectories
- Please rename the
extra_help_file.yaml.example
file toextra_help_file.yaml
, uncomment#
, and add custom loading directories likeaudios_dir: D:\AIGC\ComfyUI-Data\audios_input
. For Linux, use/
.
- Please rename the
- String Editing.
- Automatic Video Subtitling
- Audio Trimming at Arbitrary Time Markers
- Audio Volume, Speed, Pitch, Echo Processing, etc.
- Remove Silent Parts from Audio
- Audio Recording
- Audio Watermark Embedding
Examples:
- Music/Vocal Separation:
- Vocal Separation and Extraction:
- Merge Audio:
- Denoising and Enhancement:
- Audio Loading:
- String Editing.
- Add Subtitles to Video:
- Trim Audio at Arbitrary Time Markers:
- Audio Volume, Speed, Pitch, Echo Processing, etc.:
- Audio Recording and Remove Silent Parts:
- Audio Watermark Embedding (Embedding disabled, if watermark exists, it will be automatically detected):
- To use this node, download all models from SilentCipher and place them into the
ComfyUI\models\TTS\SilentCipher\44_1_khz\73999_iteration
directory.
Installation
Install sox and add it to your system's PATH.
cd ComfyUI/custom_nodes
git clone https://github.com/billwuhao/ComfyUI_AudioTools.git
cd ComfyUI_AudioTools
pip install -r requirements.txt
# python_embeded
./python_embeded/python.exe -m pip install -r requirements.txt