ComfyUI Extension: Pixel3DMM ComfyUI Nodes

Authored by A043-studios

Created about a month ago

Updated about a month ago

3 stars

Professional 3D face reconstruction for ComfyUI using the Pixel3DMM method

Custom Nodes (0)

README

Pixel3DMM ComfyUI Nodes

Professional 3D face reconstruction for ComfyUI using the Pixel3DMM method

Transform 2D face images into detailed 3D models with state-of-the-art neural networks, FLAME parametric models, and advanced optimization techniques.

Pixel3DMM Pipeline

🌟 Features

🎭 Complete 3D Face Reconstruction: From single images to detailed 3D meshes
🔥 FLAME Model Integration: Industry-standard parametric face model
🗺️ UV Coordinate Prediction: High-quality texture mapping
📐 Surface Normal Estimation: Detailed geometric surface information
⚡ Real-time Optimization: Interactive parameter refinement
📦 Multiple Export Formats: OBJ, PLY, STL mesh export
🎛️ User-Friendly Interface: Intuitive ComfyUI integration

🚀 Installation

Prerequisites

ComfyUI: Latest version installed and working
Python: 3.9 or higher
PyTorch: 2.0 or higher (CPU or CUDA)
System Memory: 8GB+ RAM recommended

Method 1: ComfyUI Manager (Recommended)

Open ComfyUI Manager
Search for "Pixel3DMM"
Click "Install"
Restart ComfyUI

Method 2: Manual Installation

Clone the repository:

cd ComfyUI/custom_nodes/
git clone https://github.com/your-repo/comfyui-pixel3dmm.git

Install dependencies:

cd comfyui-pixel3dmm
pip install -r requirements.txt

Restart ComfyUI

Method 3: Download and Extract

Download the latest release from Releases
Extract to ComfyUI/custom_nodes/comfyui-pixel3dmm/
Install dependencies: pip install -r requirements.txt
Restart ComfyUI

⚡ Quick Start

Basic 3D Face Reconstruction

Load an image using ComfyUI's Load Image node
Add Pixel3DMM Loader node and configure model path
Connect Face Reconstructor node to process the image
Add Mesh Exporter to save your 3D model

[Load Image] → [Pixel3DMM Loader] → [Face Reconstructor 3D] → [Mesh Exporter]

Example Workflow

{
  "workflow": "basic_reconstruction",
  "nodes": [
    {"type": "LoadImage", "inputs": {"image": "face_photo.jpg"}},
    {"type": "Pixel3DMMLoader", "inputs": {"model_path": "models/pixel3dmm.pth"}},
    {"type": "FaceReconstructor3D", "inputs": {"quality": "balanced"}},
    {"type": "MeshExporter", "inputs": {"format": "obj", "filename": "face_3d"}}
  ]
}

🎛️ Node Reference

🔧 Pixel3DMM Loader

Purpose: Load and initialize Pixel3DMM models

Inputs:

model_path (STRING): Path to model file
device (CHOICE): auto/cpu/cuda
precision (CHOICE): fp32/fp16
config_override (STRING): JSON config overrides

Outputs:

model (PIXEL3DMM_MODEL): Loaded model container
status (STRING): Loading status message

Usage:

# Basic usage
model_path = "models/pixel3dmm_model.pth"
device = "auto"  # Automatically detect GPU/CPU
precision = "fp32"  # Use fp16 for faster inference on GPU

🎭 Face Reconstructor 3D

Purpose: Complete 3D face reconstruction from images

Inputs:

model (PIXEL3DMM_MODEL): Loaded model from Pixel3DMM Loader
image (IMAGE): Input face image
reconstruction_quality (CHOICE): fast/balanced/high
optimize_flame (BOOLEAN): Enable parameter optimization
optimization_steps (INT): Number of optimization iterations
learning_rate (FLOAT): Optimization learning rate

Outputs:

rendered_face (IMAGE): Rendered 3D face view
flame_parameters (FLAME_PARAMS): FLAME model parameters
mesh_data (MESH_DATA): 3D mesh data
status (STRING): Reconstruction status

Quality Settings:

Fast: Quick reconstruction, lower quality
Balanced: Good quality-speed tradeoff (recommended)
High: Best quality, slower processing

🗺️ UV Predictor

Purpose: Predict UV coordinates for texture mapping

Inputs:

model (PIXEL3DMM_MODEL): Loaded model
image (IMAGE): Input face image
output_resolution (CHOICE): 256/512/1024
uv_smoothing (FLOAT): Smoothing amount (0.0-1.0)
confidence_threshold (FLOAT): Confidence threshold (0.0-1.0)

Outputs:

uv_map (IMAGE): UV coordinate visualization
uv_coordinates (UV_COORDS): Raw UV coordinate data
status (STRING): Prediction status

📐 Normal Predictor

Purpose: Predict surface normals for geometric detail

Inputs:

model (PIXEL3DMM_MODEL): Loaded model
image (IMAGE): Input face image
output_resolution (CHOICE): 256/512/1024
normal_space (CHOICE): camera/world
normal_smoothing (FLOAT): Smoothing amount (0.0-1.0)
enhance_details (BOOLEAN): Enable detail enhancement

Outputs:

normal_map (IMAGE): Normal map visualization
normal_vectors (NORMALS): Raw normal vector data
status (STRING): Prediction status

🔥 FLAME Optimizer

Purpose: Optimize FLAME parameters using geometric constraints

Inputs:

model (PIXEL3DMM_MODEL): Loaded model
image (IMAGE): Input face image
initial_params (FLAME_PARAMS): Initial FLAME parameters
optimization_steps (INT): Number of optimization steps
uv_coordinates (UV_COORDS): Optional UV constraints
surface_normals (NORMALS): Optional normal constraints
learning_rate (FLOAT): Optimization learning rate
uv_weight (FLOAT): UV loss weight
normal_weight (FLOAT): Normal loss weight
regularization_weight (FLOAT): Regularization weight

Outputs:

optimized_params (FLAME_PARAMS): Optimized parameters
mesh_data (MESH_DATA): Optimized mesh data
status (STRING): Optimization status

📦 Mesh Exporter

Purpose: Export 3D meshes to various formats

Inputs:

mesh_data (MESH_DATA): 3D mesh data to export
output_format (CHOICE): obj/ply/stl
filename (STRING): Output filename
output_directory (STRING): Output directory path
include_textures (BOOLEAN): Include texture coordinates
scale_factor (FLOAT): Mesh scaling factor
center_mesh (BOOLEAN): Center mesh at origin

Outputs:

file_path (STRING): Path to exported file
status (STRING): Export status

🔄 Workflow Examples

Example 1: Basic Reconstruction

[Load Image] → [Pixel3DMM Loader] → [Face Reconstructor 3D] → [Mesh Exporter]

Use Case: Quick 3D face model from photo Quality: Balanced Time: ~30 seconds

Example 2: High-Quality with Optimization

[Load Image] → [Pixel3DMM Loader] → [Face Reconstructor 3D]
                                           ↓
[UV Predictor] → [FLAME Optimizer] → [Mesh Exporter]
     ↑                ↑
[Normal Predictor] ----

Use Case: Professional-quality 3D reconstruction Quality: High Time: ~2-5 minutes

Example 3: Batch Processing

[Load Image Batch] → [Pixel3DMM Loader] → [Face Reconstructor 3D] → [Mesh Exporter Batch]

Use Case: Process multiple faces Quality: Configurable Time: Varies by batch size

🛠️ Troubleshooting

Common Issues

❌ "Model file not found"

Solution:

Check model path is correct
Download required model files
Ensure models are in the correct directory

❌ "CUDA out of memory"

Solutions:

Switch to CPU: Set device to "cpu"
Use FP16: Set precision to "fp16"
Reduce image resolution
Close other GPU applications

❌ "Import error: module not found"

Solutions:

Install dependencies: pip install -r requirements.txt
Restart ComfyUI completely
Check Python environment

❌ "Poor reconstruction quality"

Solutions:

Use higher quality settings
Enable FLAME optimization
Ensure good input image quality
Check lighting and face visibility

❌ "Slow processing"

Solutions:

Use GPU if available
Enable FP16 precision
Use "fast" quality setting
Reduce optimization steps

Performance Tips

GPU Usage: Always use GPU when available for 10x+ speedup
Image Size: 512x512 is optimal, larger images don't improve quality significantly
Batch Size: Process multiple images together for efficiency
Memory: Close other applications to free up GPU memory

Getting Help

Check the logs: ComfyUI console shows detailed error messages
GitHub Issues: Report bugs and request features
Community: Join our Discord for support and discussions
Documentation: Check our wiki for advanced tutorials

🔧 Advanced Usage

Custom Model Training

# Train custom UV predictor
from pixel3dmm.training import UVTrainer

trainer = UVTrainer(config)
trainer.train(dataset_path="path/to/uv_data")

API Usage

# Use nodes programmatically
from comfyui_pixel3dmm import Pixel3DMMLoader, FaceReconstructor3D

loader = Pixel3DMMLoader()
model, status = loader.load_model("models/pixel3dmm.pth")

reconstructor = FaceReconstructor3D()
result = reconstructor.reconstruct_face(model, image)

Configuration

Create config.json for custom settings:

{
  "model": {
    "encoder_backbone": "vit_base_patch14_dinov2.lvd142m",
    "embedding_dim": 128,
    "flame_dim": 101
  },
  "optimization": {
    "max_steps": 200,
    "learning_rate": 0.01,
    "convergence_threshold": 1e-6
  }
}

📚 Additional Resources

Paper: Pixel3DMM: Generating 3D Representations from Multi-view Images
FLAME Model: Official FLAME repository
ComfyUI: ComfyUI documentation
Tutorials: Video tutorials playlist

🤝 Contributing

We welcome contributions! Please see CONTRIBUTING.md for guidelines.

Development Setup

git clone https://github.com/your-repo/comfyui-pixel3dmm.git
cd comfyui-pixel3dmm
pip install -e .
pip install -r requirements-dev.txt

Running Tests

python -m pytest tests/
python tests/test_pixel3dmm_nodes.py

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgments

Pixel3DMM Research Team: Original research and methodology
FLAME Team: Parametric face model
ComfyUI Community: Framework and inspiration
PyTorch Team: Deep learning framework

📊 Citation

If you use this work in your research, please cite:

@article{pixel3dmm2023,
  title={Pixel3DMM: Generating 3D Representations from Multi-view Images},
  author={Research Team},
  journal={arXiv preprint arXiv:2023.xxxxx},
  year={2023}
}

Made with ❤️ by the MCP Multi-Agent System

For support, please open an issue on GitHub or join our Discord community.