ComfyUI Extension: JK AceStep Nodes
Advanced nodes optimized for ACE-Step audio generation in ComfyUI.
Custom Nodes (0)
README
JK AceStep Nodes for ComfyUI
Advanced nodes optimized for ACE-Step audio generation in ComfyUI.
š¦ Installation
cd ComfyUI/custom_nodes/
git clone https://github.com/jeankassio/JK-AceStep-Nodes.git
Restart ComfyUI. Nodes will appear under JK AceStep Nodes/ categories.
šµ Nodes
Ace-Step KSampler (Basic)
Full-featured sampler with quality check, advanced guidance (APG, CFG++, Dynamic CFG), anti-autotune smoothing, and automatic step optimization.
Category: JK AceStep Nodes/Sampling
Ace-Step KSampler (Advanced)
Extended sampler with start/end step control for multi-pass workflows and refinement.
Category: JK AceStep Nodes/Sampling
Ace-Step Prompt Gen
Prompt generator with 150+ professional music styles (Electronic, Rock, Jazz, Classical, Brazilian, World Music, and more).
Category: JK AceStep Nodes/Prompt
š¤ Lyrics Generators
Ten AI-powered lyrics generation nodes supporting various LLM providers:
Ace-Step OpenAI Lyrics
Lyrics generation using OpenAI GPT models.
Supported Models (December 2025):
gpt-5.1- Reasoning model (latest)gpt-5.1-codex- Coding optimizedgpt-5- High performancegpt-5-pro- Professional variantgpt-4o- Multimodal (recommended)gpt-4o-mini- Fast variantgpt-4-turbo- High performancegpt-4- Stable baseo3- Reasoning modelo3-mini- Compact reasoningo1- Advanced reasoningo1-mini- Compact advanced reasoning
Category: JK AceStep Nodes/Lyrics
Ace-Step Claude Lyrics
Lyrics generation using Anthropic Claude models.
Supported Models (December 2025):
claude-opus-4.5- Latest flagship (recommended)claude-opus-4.1- Previous flagshipclaude-sonnet-4.5- Latest balancedclaude-sonnet-4- Previous balancedclaude-haiku-4.5- Latest fastclaude-haiku-3.5- Previous fastclaude-3-5-sonnet-20241022- Snapshot variantclaude-3-5-haiku-20241022- Snapshot variantclaude-3-opus-20250219- Dated variant
Category: JK AceStep Nodes/Lyrics
Ace-Step Gemini Lyrics
Lyrics generation using Google Gemini API.
Supported Models (December 2025):
gemini-3-pro- Latest pro model (recommended)gemini-2.5-flash- Fast with latest capabilitiesgemini-2.5-flash-lite- Ultra-fast variantgemini-2.5-pro- High qualitygemini-2.0-flash- Previous generationgemini-2.0-flash-lite- Previous generation lite
Category: JK AceStep Nodes/Lyrics
Ace-Step Groq Lyrics
High-speed lyrics generation using Groq API.
Supported Models (December 2025):
llama-3.3-70b-versatile- Meta Llama 3.3 70B (best quality)llama-3.1-8b-instant- Meta Llama 3.1 8B (fast)llama-guard-4-12b- Meta Guard modeldeepseek-v3- DeepSeek V3mistral-small-3- Mistral Small v3gpt-oss-120b- OpenAI OSS 120Bgpt-oss-20b- OpenAI OSS 20B- Plus additional production models
Category: JK AceStep Nodes/Lyrics
Ace-Step Perplexity Lyrics
Lyrics generation using Perplexity Sonar models.
Supported Models (December 2025):
sonar- Standard modelsonar-pro- Professional variantsonar-reasoning- Reasoning-focusedsonar-reasoning-pro- Advanced reasoningsonar-deep-research- Deep research variant
Category: JK AceStep Nodes/Lyrics
Ace-Step Cohere Lyrics
Lyrics generation using Cohere Command models.
Supported Models (December 2025):
command-a-03-2025- Latest Command Acommand-r7b-12-2024- December 2024 variantcommand-r-plus-08-2024- R+ August 2024command-r-08-2024- R August 2024command-a-translate- Translation specialistcommand-a-reasoning- Reasoning-focusedcommand-a-vision- Vision capabilitiesaya-expanse-32b- Aya Expanse 32Baya-expanse-8b- Aya Expanse 8Baya-vision- Aya with visionaya-translate- Aya translation specialist
Category: JK AceStep Nodes/Lyrics
Ace-Step Replicate Lyrics
Lyrics generation using Replicate API models.
Supported Models (December 2025):
meta/llama-3.1-405b-instruct- 405B instruction-tunedmeta/llama-3.1-70b-instruct- 70B instruction-tunedmeta/llama-3.1-8b-instruct- 8B instruction-tunedmeta/llama-3-70b-instruct- Llama 3 70Bmeta/llama-2-70b-chat- Llama 2 chat 70Bmistralai/mistral-7b-instruct-v0.3- Mistral 7B v0.3mistralai/mistral-small-24b-instruct-2501- Mistral Small 24Bmistralai/mixtral-8x7b-instruct-v0.1- Mixtral MoE
Category: JK AceStep Nodes/Lyrics
Ace-Step HuggingFace Lyrics
Lyrics generation using HuggingFace Inference API.
Supported Models (December 2025):
meta-llama/Llama-3.1-405B-Instruct- Large instruction-tunedmeta-llama/Llama-3.3-70B-Instruct-Turbo- Llama 3.3 70B turbometa-llama/Llama-3.1-70B-Instruct- 70B instruction-tunedmistralai/Mistral-Large- Large Mistral variantmicrosoft/Phi-4- Phi-4 modeldeepseek-ai/deepseek-v3- DeepSeek V3Qwen/Qwen2.5-72B-Instruct- Qwen 2.5 72Bgoogle/gemma-2-27b- Gemma 2 27Btiiuae/falcon-180b- Falcon 180B
Category: JK AceStep Nodes/Lyrics
Ace-Step Together AI Lyrics
Lyrics generation using Together AI serverless models.
Supported Models (December 2025):
meta-llama/Llama-3.3-70B-Instruct-Turbo- Llama 3.3 70B turbometa-llama/Llama-3.1-405B-Instruct-Turbo- Llama 3.1 405B turbomistralai/Mistral-Small-24B-Instruct-2501- Mistral Small 24BQwen/Qwen2.5-72B-Instruct- Qwen 2.5 72Bdeepseek-ai/DeepSeek-V3- DeepSeek V3moonshotai/Kimi-K2-Instruct- Kimi K2GLM-4-Plus- GLM 4 PlusNous-Hermes-3-70B- Nous Hermes 3 70B- Plus 100+ additional models available
Category: JK AceStep Nodes/Lyrics
Ace-Step Fireworks Lyrics
Lyrics generation using Fireworks AI models (100+ available).
Supported Models (December 2025):
deepseek-ai/deepseek-v3- DeepSeek V3 (latest)deepseek-ai/deepseek-r1- DeepSeek R1 reasoningQwen/Qwen3-235B-A22B-Instruct- Qwen 3 235BQwen/Qwen2.5-72B-Instruct-Turbo- Qwen 2.5 72B turbometa-llama/Llama-4-Maverick-17B- Llama 4 Maverickmeta-llama/Llama-4-Scout-17B- Llama 4 Scoutmeta-llama/Llama-3.3-70B-Instruct- Llama 3.3 70Bmeta-llama/Llama-3.1-405B-Instruct- Llama 3.1 405Bmistralai/Mistral-Large-3-675B-Instruct- Mistral Large 675Bmistralai/Mistral-Small-24B-Instruct-2501- Mistral Small 24Bgoogle/GLM-4.6- GLM 4.6moonshotai/Kimi-K2- Kimi K2google/Gemma-3-27b- Gemma 3 27B- Plus 90+ additional models available
Category: JK AceStep Nodes/Lyrics
Ace-Step Save Text
Text saver with auto-incremented filenames and folder support. Works with any lyrics generator.
Category: JK AceStep Nodes/IO
šØ JKASS Custom Sampler
Just Keep Audio Sampling Simple - custom sampler optimized for audio generation.
Available Variants
-
jkass_quality- Second-order Heun method for maximum audio quality- Superior accuracy and detail preservation
- Recommended for final renders
- ~2x slower than fast variant
-
jkass_fast- First-order Euler method for faster generation- Optimized for speed with vectorized operations
- Good quality with reduced computation time
- Best for iterations and previews
Key Features
- No noise normalization (preserves audio dynamics)
- Clean sampling path (prevents word cutting/stuttering)
- Optimized for long-form audio
Select your preferred variant from any sampler dropdown (default: jkass_quality).
š Recommended Settings
- Sampler:
jkass_quality(for best quality) orjkass_fast(for speed) - Scheduler:
sgm_uniform - Steps: 80-100
- CFG: 4.0-4.5
- Anti-Autotune: 0.25-0.35 (vocals), 0.0-0.15 (instruments)
ā Extra settings for reducing 'AI' female vocals
- Sampler:
jkass_quality(for best quality) orjkass_fast(for speed) - Frequency Damping: 0.15-0.5 for female vocals to reduce metallic sizzle (0=disabled)
- Temporal Smoothing: 0.02-0.12 to reduce pitch quantization and temporal discontinuities
- Beat Stability: 0.05-0.2 to preserve stable rhythmic strikes and avoid per-frame jitter
- Anti-Autotune: 0.25-0.35 (vocals), 0.0-0.15 (instruments)
šÆ Quality Check Feature
Automatically tests multiple step counts to find optimal settings for your prompt.
Important: Quality scores are comparative only. Compare within same prompt/style. Electronic music naturally scores lower than acoustic (both can be excellent quality).
š§ Troubleshooting
- Word cutting/stuttering: Use
jkass_qualitysampler, disable advanced optimizations - Metallic voice: Increase
anti_autotune_strengthto 0.3-0.4 - AI-sounding female voice: Try the following sequence:
- Use
jkass_qualityand 80-120 steps, CFG 4.0-4.5, APG enabled - Set Anti-Autotune (0.25-0.35), Frequency Damping (0.15-0.4), Temporal Smoothing (0.02-0.06)
- Use the Prompt Gen with
voice_style->natural_femaleand add 'breathy, micro pitch variation' in extra prompt - Decode using a high-quality VAE/vocoder (HiFi-GAN, or validated VAE) for improved timbre
- If still metallic: de-esser and mild EQ cut at 7-12 kHz; add subtle formant correction and breath overlay
Ace-Step Post Processnode to apply a quick de-essing (reduce 6-10 kHz energy), spectral smoothing, and subtle breath overlay to humanize the vocal further. - Use
- Poor quality: Increase steps (80-120), use CFG 4.0-4.5, enable APG, try
jkass_qualitysampler
š Project Structure
JK-AceStep-Nodes/
āāā __init__.py # Central node aggregator
āāā ace_step_ksampler.py # Samplers (Basic + Advanced)
āāā ace_step_prompt_gen.py # Prompt generator (150+ styles)
āāā lyrics_nodes.py # 10 lyrics generators consolidated
āāā ace_step_save_text.py # Text saver node
āāā requirements.txt
āāā py/
āāā jkass_sampler.py # Custom audio sampler
Available Lyrics Generators
- OpenAI - gpt-4o, gpt-4-turbo, gpt-4, gpt-3.5-turbo, and more
- Anthropic Claude - Claude 3.5 Sonnet, Claude 3 Opus/Sonnet/Haiku
- Google Gemini - gemini-2.5-flash, gemini-2.5-pro, gemini-2.0-flash, gemini-1.5-pro/flash
- Groq - Llama 3.3 70B, Llama 3.1 8B, Llama Guard 3, GPT-OSS (120B/20B), and Llama 4 preview models
- Perplexity - Sonar, Sonar Pro, Sonar Reasoning (with 128k context)
- Cohere - Command A/R+ (with reasoning & vision), Aya (multilingual)
- Replicate - Llama 3.1 (405B/70B/8B), Mistral Small/Nemo, Mixtral
- HuggingFace - Llama 3.1, Mistral, DeepSeek, Qwen, Falcon, and 100+ more
- Together AI - Llama 3.3/3.1, DeepSeek, Qwen 3, Mistral variants, and 50+ more
- Fireworks AI - DeepSeek V3/R1, Qwen 3, Llama 3.3/3.1, Mistral Large/Small, GLM, and 90+ more
š£ļø How to Use the Vocoder (ADaMoSHiFiGAN)
To enable audio conversion with the vocoder (for improved final audio quality):
https://huggingface.co/ACE-Step/ACE-Step-v1-3.5B/tree/main/music_vocoder
-
Obtain the vocoder files:
diffusion_pytorch_model.safetensors(vocoder model)config.json(vocoder configuration)
-
Place both files in the folder:
JK-AceStep-Nodes/vocoder/
The final path should be:
JK-AceStep-Nodes/vocoder/diffusion_pytorch_model.safetensors JK-AceStep-Nodes/vocoder/config.json -
Done!
- The system will automatically detect these files when using nodes with vocoder enabled.
- If the files are not present, audio will be generated without the vocoder.
Tip: Always use the correct file pair (model + config) to avoid artifacts or loading errors.
š License
MIT License
Version: 2.3
Last Updated: December 2025
šµ Happy music generation!