ComfyUI Extension: ControlAltAI Nodes

Authored by gseth

Created about a year ago

Updated 6 months ago

178 stars

Quality of Life Nodes from ControlAltAI. Flux Resolution Calculator, Flux Sampler, Flux Union ControlNet Apply, Noise Plus Blend, Boolean Logic, and Flux Region Nodes.

README

ComfyUI ControlAltAI Nodes

This repository contains custom nodes designed for the ComfyUI framework, focusing on quality-of-life improvements. These nodes aim to make tasks easier and more efficient. Two Flux nodes are available to enhance functionality and streamline workflows within ComfyUI.

Nodes

List of Nodes:

Flux
- Flux Resolution Calculator (Updated, May 2025)
- Flux Sampler
- Flux Union ControlNet Apply
Logic
- Boolean Basic
- Boolean Reverse
- Integer Settings
- Integer Settings Advanced (New, June 2025)
- Switch Two Way (New, June 2025)
- Switch Three Way (New, June 2025)
- Choose Upscale Model
Image
- Get Image Size & Ratio
- Perturbation Texture (New, June 2025)
- Noise Plus Blend
Flux Region
- Region Mask Generator
- Region Mask Processor
- Region Mask Validator
- Region Mask Conditioning
- Flux Attention Control
- Region Overlay Visualizer
- Flux Attention Cleanup
HiDream
- HiDream Resolution (New, June 2025)
Utility
- Text Bridge (New, June 2025)

Flux Resolution Calculator

The Flux Resolution Calculator is designed to determine the optimal image resolution for outputs generated using the Flux model, which is notably more oriented towards megapixels. Unlike traditional methods that rely on standard SDXL resolutions, this calculator operates based on user-specified megapixel inputs. Users can select their desired megapixel count, ranging from 0.1 to 2.0 megapixels, and aspect ratio. The calculator then provides the exact image dimensions necessary for optimal performance with the Flux model. This approach ensures that the generated images meet specific quality and size requirements tailored to the user's needs. Additionally, while the official limit is 2.0 megapixels, during testing, I have successfully generated images at higher resolutions, indicating the model's flexibility in accommodating various image dimensions without compromising quality.

Supported Megapixels: 0.1 MP - 2.5 MP (change stepping to 0.1 for fine-tuned selection)
Note: Generations above 1 MP may appear slightly blurry, but resolutions of 3k+ have been successfully tested on the Flux1.Dev model.
Custom Ratio: Custom Ratio is now supported. Enable or disable the Custom Ratio and input any ratio. (Example: 4:9).
Preview: The preview node is just a visual representation of the ratio.
Divisible By: You can now choose the divisibility by 8/16/32/64. By default, it is 64. To get fine-tuned results, choose divisibility by 8. Divisibility by 32/64 is recommended for Flux Dev 1.

Flux Sampler

The Flux Sampler node combines the functionality of the CustomSamplerAdvance node and input nodes into a single, streamlined node.

CFG Setting: The CFG is fixed at 1.
Conditioning Input: Only positive conditioning is supported.
Compatibility: Only the samplers and schedulers compatible with the Flux model are included.
Latent Compatibility: Use SD3 Empty Latent Image only. The normal empty latent image node is not compatible.

ComfyUI Screenshot

Flux Union ControlNet Apply

The Flux Union ControlNet Apply node is an all-in-one node compatible with InstanX Union Pro ControlNet. It has been tested extensively with the union controlnet type and works as intended. You can combine two ControlNet Union units and get good results. Not recommended to combine more than two. The ControlNet is tested only on the Flux 1.Dev Model.

ComfyUI Screenshot

Recommended Settings: strength: 0.15-0.65. end percentage: 0.200 - 0.900.

Recommended PreProcessors: Canny: Canny Edge (ControlNet Aux). Tile: Tile (ControlNet Aux). Depth: Depth Anything V2 Relative (ControlNet Aux). Blue: Direct Input (Blurry Image) or Tile (ControlNet Aux). Pose: DWPose Estimator (ControlNet Aux). Gray: Image Desaturate (Comfy Essentials Custom Node). Low Quality: Direct Input.

Results: (Canny and Depth Examples not included. They are straightforward.) Pixel Low Resolution to High Resolution ComfyUI Screenshot

ComfyUI Screenshot

Photo Restoration ComfyUI Screenshot

ComfyUI Screenshot

Game Asset Low Resolution Upscale ComfyUI Screenshot

ComfyUI Screenshot

Blur to UnBlur ComfyUI Screenshot

Re-Color ComfyUI Screenshot

ComfyUI Screenshot

YouTube tutorial Union ControlNet Usage: <a href="https://www.youtube.com/watch?v=4_1A5pQkJkg">Video Tutorial</a>

Shakker Labs & InstantX Flux ControlNet Union Pro Model Download: <a href="https://huggingface.co/Shakker-Labs/FLUX.1-dev-ControlNet-Union-Pro">Hugging Face Link</a>

Get Image Size & Ratio

This node is designed to get the image resolution in width, height, and ratio. The node can be further connected to the Flux Resolution Calculator. To do so, follow the following steps:

Right-click on the Flux Resolution Calculator -- > Convert widget to input -- > Convert custom_aspect_ratio to input.
Connect Ratio output to custom_aspect_ratio input.

ComfyUI Screenshot

Integer Settings

This node is designed to give output as a raw value of 1 or 2 integers. Enable = 2, Disable = 1.

Use case: This can be set up before a two-way switch, allowing workflow logical control to flow in one or the other direction. As of now, it only controls two logical flows. In the future, we will upgrade the node to support three or more logical switch flows.

ComfyUI Screenshot

Integer Settings Advanced

This node is designed to give output as a raw value of 1, 2 or 3 integers. Only one integer output can be enabled at a time. Connect this node with the new Switch (Three Way) for logical control.

ComfyUI Screenshot

Switch (Two Way) / Switch (Three Way)

Unlike traditional switches, which accept only one type of input, these switches will accept multiple input types and pass through those inputs if connected to the correct output. Now seamlessly connect Switch (Two Way) with the Integer Settings and the Switch (Three Way) with the Integer Settings Advanced Nodes.

ComfyUI Screenshot

Choose Upscale Model

A simple node that can be connected with a boolean logic. A true response will use upscale model 1, and a false response will use upscale model 2.

ComfyUI Screenshot

Perturbation Texture Node

This node adds realistic texture overlays to images using advanced noise generation techniques. This node is particularly useful for enhancing portraits, adding film grain effects, or creating natural surface textures. This is an advanced version of the Noise Plus Blend Node. The node generates multi-channel noise patterns that respect the original image's color distribution, creating realistic textures that enhance rather than overpower the source material. Can be used pre/post upscale (pixel-to-pixel).

Settings: noise_scale: 0.25 - 0.50. texture_strength: 10-50. perturbation_factor: 0.10-0.25.

Node can be used with or without a mask.

ComfyUI Screenshot

Texture Type:

Natural: Balanced, organic texture — ideal for stylized portraits or general image enhancement without overwhelming details.
Film Grain: Adds cinematic noise — great for final renders or creative film looks.
Skin Pore: Subtle realism — best for close-ups or portraits needing natural facial texture.
Fine Details: Emphasizes high-frequency textures — perfect for mechanical, fabric, or intricate object renders.

Results: Example 1 Without Perturbation: ComfyUI Screenshot

With Perturbation: ComfyUI Screenshot

Example 2 Without Perturbation: ComfyUI Screenshot

With Perturbation: ComfyUI Screenshot

Noise Plus Blend

This node will generate a Gaussian blur noise based on the dimensions of the input image and will blend the noise into the entire image or only the mask region.

Issue: Generated faces/landscapes are realistic, but during upscale, the AI model smoothens the skin or texture, making it look plastic or adding smooth fine lines.

Solution: For upscaling, auto segment or manually mask the face or specified regions and add noise. Then, pass the blended image output to the K-Sampler and denoise at 0.20 - 0.50.

ComfyUI Screenshot

You can see the noise has been applied only to the face as per the mask. This will maintain the smooth bokeh and preserve the facial details during upscale. ComfyUI Screenshot

Denoise the image using Flux or SDXL sampler. Recommended sampler denoise: 0.10 - 0.50 ComfyUI Screenshot

Settings: noise_scale: 0.30 - 0.50. blend_opacity: 10-25.

If you find too many artifacts on the skin or other textures, reduce both values. Increase the values if upscaling output results in plastic, velvet-like smooth lines.

Best Setting for AI-generated Faces: noise_scale: 0.40-0.50. blend_opacity: 15-25.

Best Setting for AI-generated texture (landscapes): noise_scale: 0.30. blend_opacity: 12-15.

Results: Example 1 Without Noise Blend: ComfyUI Screenshot

With Noise Blend: ComfyUI Screenshot

Example 2 Without Noise Blend: ComfyUI Screenshot

With Noise Blend: ComfyUI Screenshot

Example 3 Without Noise Blend: ComfyUI Screenshot

With Noise Blend: ComfyUI Screenshot

Example 4 Without Noise Blend: ComfyUI Screenshot

With Noise Blend: ComfyUI Screenshot

Flux Region (Spatial Control)

The node pipeline is as follows: Region Mask Generator --> Region Mask Processor --> Region Mask Validator --> Flux Region Conditioning --> Flux Attention Control --> Flux Overlay Visualizer (optional) --> Flux Attention Cleanup. Note: Watching the video tutorial is a must. The learning curve is a bit high to use Flux Region Spatial Control.

Region Mask Generator: This node generates the regions in mask and bbox format. This information is then passed on to the Mask Processor.

ComfyUI Screenshot

Region Mask Processor: This node processes the generated mask and applies Gaussian Blur and feathering. This pre-processor node preprocesses the mask and sends the preprocessed information in the pipeline.

ComfyUI Screenshot

Region Mask Validator: This node calculates the validity of the regions. The "is valid" message will be true if there are no overlaps. The validation message would show you detailed information on the overlapping regions and the overlap percentage. Although the methodology used requires zero overlaps, the issue is resolved in the flux attention control with feathering. Overlapping will only be an issue if it is excessive, beyond 40-50%.

ComfyUI Screenshot

Region Mask Conditioning: Up to three separate conditioning can be connected. The node will process based on the number of regions defined rather than the actual conditioning connections. The strength values are independent for each region. Strength 1 for Region 1, Strength 2 for Region 2, and Strength 3 for Region 3. The strength value range is from 0 to 10 with an increment/decrement step of 0.1. At Value 1, the region strength will match the base conditioning strength, which is always set at 1 as a global value. Strength Values are not only relative to the base conditioning value but are also relative to each other. They are also affected by the Region % area in the canvas and the feathering value in the attention control. Please note. Only use the dual clip and flux conditioning in comfy. The base + region flux guidance should be set to 1.

ComfyUI Screenshot

Flux Attention Control: The node takes the region conditioning + base conditioning + the feathering strengths and all the previous information in the pipeline and overrides the Flux Attention. When disabled, it only passes through the base conditioning to the Sampler.

Requirements Update: 8 Dec 2024: Flux Attention Control Node requires XFormers. Check your version of PyTorch and install a compatible version of XFormers. Please follow the instructions here: <a href="https://github.com/gseth/ControlAltAI-Nodes/blob/master/xformers_instructions.txt">xformers_instructions</a>

ComfyUI Screenshot

Region Overlay Visualizer: This node overlays the region on the final output for visual purposes only.

Flux Attention Cleanup: Since the attention is overridden in the model, a tensor mismatch error will occur when you switch the workflow. We also do not want the attention to be cleaned up in the existing workflow. This node automatically will preserve attention during re-runs in the existing workflow, but when switching workflow will do a fresh clean up and restore flux original attention. This process is achieved without a model unload or manual cache cleanup, as they will not work.

ComfyUI Screenshot

Xformers & Token/Attention Limits: The pipeline uses an advanced attention mechanism that combines text tokens from your prompts with spatial information from defined regions. As you increase prompt length or add multiple, complex regions, you create larger attention matrices. While xFormers helps optimize memory usage, there is still a practical limit on how many tokens and spatial positions the model can handle without causing dimension or shape alignment errors.

Example Error: 'Invalid shape for attention bias: torch.Size([1, 24, 5264, 5264]) (expected (1, 24, 5118, 5118))'

This limit isn’t about a fixed “5,000 x 5,000” size or a strict VRAM cap. Instead, it’s determined by the model’s architecture and how tokens are combined with spatial positions. Extremely long prompts or too many intricate regions can produce attention shapes that the model’s code cannot process, resulting in shape mismatch errors rather than running out of memory. If you encounter these errors, try shortening your prompt or reducing the complexity of your regional conditioning. There isn’t a simple formula linking VRAM size directly to token count. Instead, it’s about balancing your prompt length and region definitions to keep the attention mechanism within workable limits. Testing with the Flux model and T5-XXL in FP16 on a 4090 shows that keeping prompts relatively short (each clip under 80 tokens) and regions manageable helps avoid such issues.

GGUF & CivitAI fine-tune models: The Flux Region Pipeline was tested with GGUF models without issues. Third-party CivitAI Copax Timeless XPlus 3 Flux models also worked without problems.

LoRA Support: LoRA is supported and will apply to all attention. At this stage, using different LoRA for different Regions is not possible. Research work is still ongoing.

ControlNet Support: Currently not tested. Research work is still ongoing.

Results: Example 1 3 Region Split Blend using Advance LLM: Base Conditioning (ignored) + 3 Regions ComfyUI Screenshot

Example 2 Style manipulation: Base Conditioning + 1 Region ComfyUI Screenshot

Example 3 Simple Splitting Contrast: Base Conditioning (ignored) + 2 Regions ComfyUI Screenshot

Example 4 Simple Splitting Blend: Base Conditioning + 1 Region ComfyUI Screenshot

Example 5 3 Region Split Blend: Base Conditioning (ignored) + 3 Regions ComfyUI Screenshot

Example 6 3 Region Split Blend using Advance LLM: Base Conditioning (ignored) + 3 Regions ComfyUI Screenshot

Example 7 Color Manipulation: Base Conditioning (ignored) + 2 Regions ComfyUI Screenshot

YouTube tutorial Flux Region Usage: <a href="https://youtu.be/kNwz6kJRDc0">Flux Region Spatial Control Tutorial</a>

HiDream Resolution

HiDream supports hard-coded resolutions, similar to SDXL, and differs significantly from the pixel-based resolution supported by Flux.

ComfyUI Screenshot

Text Bridge

Utility node that provides flexible text input/output management with manual editing capabilities. This node serves as a text processing hub, accepting text from other nodes while allowing for manual overrides and edits.

Passthrough Mode: When connected to another node and the text input field is empty, the incoming text is passed through unchanged.
Manual Override: When text is entered in the text input field, it uses that text instead of any passthrough input.
Standalone Mode: Functions as a simple text input node when no passthrough connection is made.

ComfyUI Screenshot

YouTube tutorial for HiDream Perturbated Pipeline: Coming Soon

YouTube ComfyUI Tutorials

We are a team of two and create extensive tutorials for ComfyUI. Check out our YouTube channel: <a href="https://youtube.com/@controlaltai">ControlAltAI YouTube Tutorials</a>

Black Forest Labs AI

Black Forest Labs, a pioneering AI research organization, has developed the Flux model series, which includes the Flux1.[dev] and Flux1.[schnell] models. These models are designed to push the boundaries of image generation through advanced deep-learning techniques.

For more details on these models, their capabilities, and licensing information, you can visit the <a href="https://blackforestlabs.ai/">Black Forest Labs website</a>

HiDream

<a href="https://github.com/HiDream-ai/HiDream-I1">HiDream-I1 GitHub:</a> A High-Efficient Image Generative Foundation Model with Sparse Diffusion Transformer.

Flux Regional Spatial Control Acknowledgment

Inspired from: <a href="https://github.com/attashe/ComfyUI-FluxRegionAttention">Flux Region Attention by Attashe</a>

License

This project is licensed under the MIT License.

ComfyUI Extension: ControlAltAI Nodes

Custom Nodes (23)

README

ComfyUI ControlAltAI Nodes

Nodes

List of Nodes:

Flux Resolution Calculator

Flux Sampler

Flux Union ControlNet Apply

Get Image Size & Ratio

Integer Settings

Integer Settings Advanced

Switch (Two Way) / Switch (Three Way)

Choose Upscale Model

Perturbation Texture Node

Noise Plus Blend

Flux Region (Spatial Control)

HiDream Resolution

Text Bridge

YouTube ComfyUI Tutorials

Black Forest Labs AI

HiDream

Flux Regional Spatial Control Acknowledgment

License