ComfyUI Extension: Q Find Mask Size

Authored by angree

Created 2 months ago

Updated 2 months ago

0 stars

ComfyUI custom node for finding and analyzing mask sizes in images

Custom Nodes (0)

README

ComfyUI-Q_Image_Crop_Calculator

A ComfyUI custom node for intelligent cropping optimization, specifically designed for maximizing texture quality in Hunyuan 3D 2.0 workflows. This node automatically calculates the optimal crop area based on mask data to eliminate wasted space and ensure maximum detail utilization within the 518x518 input constraint.

Features

Automatically calculates optimal crop dimensions based on mask data from background removal
Maximizes space utilization for Hunyuan 3D 2.0's 518x518 input requirement
Eliminates wasted transparent/empty space around objects
Ensures better model accuracy and texture detail by using every available pixel
Configurable padding, step size, and minimum dimensions
Returns precise crop coordinates for seamless integration with image processing workflows
Square crop output optimized for 3D model generation

Installation

Clone this repository into your ComfyUI's custom_nodes directory:

cd ComfyUI/custom_nodes
git clone https://github.com/YOUR_USERNAME/ComfyUI-Q_Image_Crop_Calculator.git

Install required dependencies:

pip install -r requirements.txt

or install manually:

pip install numpy>=1.24.0 pillow>=10.0.0

Note: This node requires PyTorch, but since Hunyuan 3D 2.0 workflows require CUDA-enabled PyTorch, you should install PyTorch with CUDA support separately according to your system's CUDA version. Visit PyTorch's official installation guide for CUDA-specific installation instructions.

Restart ComfyUI

Dependencies

This module requires the following Python libraries:

torch: For tensor operations and ComfyUI compatibility (install CUDA version separately - see installation notes)
numpy: For efficient array processing and mask analysis (>=1.24.0)
pillow: For image processing operations (>=10.0.0)

A requirements.txt file is included for easy installation of numpy and pillow dependencies.

Usage with Hunyuan 3D 2.0

This node is specifically designed to work in Hunyuan 3D 2.0 workflows where input image quality directly impacts model accuracy:

Q Image Crop Calculator

The node provides intelligent crop calculation with the following parameters:

mask: Input mask from background removal (typically from InstantlyRemoveBackground or similar nodes)
step_size: Alignment step for crop dimensions (default: 32, range: 8-128)
padding: Additional space around detected object (default: 64, range: 0-256)
min_size: Minimum crop dimensions (default: 256, range: 128-1024)

Returns:

width: Calculated optimal crop width
height: Calculated optimal crop height
x_offset: Horizontal position for crop start
y_offset: Vertical position for crop start

Hunyuan 3D 2.0 Workflow Integration

Maximizing 518x518 Input Quality

Hunyuan 3D 2.0 (non-mini, non-turbo version) accepts 518x518 pixel images. Every pixel matters for model accuracy:

Background Removal: Use InstantlyRemoveBackground or similar node to isolate your subject
Mask Analysis: Connect the mask output to Q Image Crop Calculator
Optimal Cropping: The node calculates the tightest possible crop that contains your subject
Space Maximization: Eliminates wasted transparent areas, ensuring maximum detail density
3D Generation: Feed the optimally cropped image to Hunyuan 3D 2.0 for superior results

Workflow Connection Pattern

[Original Image] → [Background Remover] → [Mask Output] → [Q Image Crop Calculator]
                                      ↓
[Cropped Image] ← [Image Crop] ← [Crop Coordinates]
                        ↓
[Resize to 518x518] → [Hunyuan 3D 2.0]

Benefits for 3D Model Generation

Higher Detail Density: More pixels dedicated to the actual subject
Better Texture Resolution: Fine details are preserved at higher resolution
Improved Model Accuracy: Hunyuan 3D 2.0 has more detail to work with
Reduced Artifacts: Less empty space means fewer generation artifacts
Consistent Results: Automated cropping ensures reproducible workflows

Technical Details

Intelligent Crop Calculation

The node analyzes the input mask to:

Detect Object Boundaries: Finds the minimal bounding box containing all non-transparent pixels
Apply Smart Padding: Adds configurable padding while respecting image boundaries
Ensure Square Output: Creates square crops ideal for 3D model generation
Optimize Dimensions: Rounds to specified step sizes for consistent processing
Handle Edge Cases: Gracefully handles empty masks or oversized objects

Performance Optimizations

Efficient numpy-based mask analysis
Minimal memory footprint
Fast coordinate calculation
Compatible with ComfyUI's tensor format

Example Use Cases

Portrait to 3D Model

Remove background from portrait photo
Calculate optimal crop to focus on subject
Maximize facial detail in 518x518 constraint
Generate high-quality 3D head model

Product Photography

Isolate product from background
Eliminate wasted space around product
Ensure maximum product detail in final 3D model
Ideal for e-commerce 3D model generation

Character Design

Crop character artwork optimally
Preserve important design elements
Maximize character detail for 3D conversion
Perfect for game asset generation

License

MIT License