ComfyUI Extension: Head-Orientation-Node - by PabloGFX

Authored by lazniak

Created 12 months ago

Updated 11 months ago

10 stars

A custom node for ComfyUI that analyzes and sorts images based on head orientation using MediaPipe. It detects facial landmarks, calculates head pose, and intelligently sorts images for enhanced AI image processing workflows.

Custom Nodes (1)

Head Orientation Node - by PabloGFX

README

Head Orientation Node Update version = "1.0.11"

New Data Output Format

The Head Orientation Node now provides a string output named "data" alongside the sorted images. This output contains information about the head orientation for each processed image.

Format Details:

Each line in the output represents one image's head orientation.
The format for each line is [x,y,z], where:
- x: Rotation around the X-axis (nodding up and down)
- y: Rotation around the Y-axis (turning left and right)
- z: Rotation around the Z-axis (tilting head side to side)
All values are in degrees and rounded to two decimal places.
Each orientation is on a new line.

Example Output:

Head Orientation Node for ComfyUI

Description

The Head Orientation Node is a custom node for ComfyUI that analyzes and sorts images based on head orientation. It uses the MediaPipe library to detect facial landmarks and calculate head pose, allowing for intelligent image sorting and matching.

Developed by PabloGFX, this node enhances ComfyUI's capabilities in facial analysis and image processing tasks.

Features

Detects facial landmarks using MediaPipe
Calculates head orientation (pitch, yaw, roll) for input images
Sorts input images based on similarity to reference images' head orientations
Supports batch processing of multiple images

Installation

Ensure you have ComfyUI installed and set up.

Clone this repository into your ComfyUI custom nodes directory:

git clone https://github.com/YourGitHubUsername/head-orientation-node.git

Install the required dependencies:
```
pip install -r requirements.txt
```

Requirements

numpy>=1.19.3
opencv-python>=4.5.5.64
mediapipe>=0.8.9.1
Pillow>=8.3.1
torch>=1.9.0

Usage

In ComfyUI, you'll find the node listed as "Head Orientation Node - by PabloGFX" in the node browser.
Connect an image or batch of images to the "image" input.
Connect a set of reference images to the "reference_images" input.
The node will output a sorted batch of images based on head orientation similarity to the reference images.

How it Works

The node analyzes both input and reference images using MediaPipe's face mesh detection.
It calculates the head orientation (pitch, yaw, roll) for each detected face.
Input images are then sorted to best match the orientations of the reference images.
If there are fewer input images than reference images, the last input image is repeated to match the count.

License

This project is licensed under the Apache License 2.0. This license is compatible with the licenses of the major dependencies used in this project:

MediaPipe: Apache License 2.0
OpenCV: Apache License 2.0
NumPy: BSD 3-Clause License
PyTorch: BSD 3-Clause License
Pillow: HPND License

The Apache License 2.0 allows you to use, modify, distribute, and sublicense the code, while also providing an express grant of patent rights from contributors to users. It requires preservation of copyright and license notices.

For the full license text, please see the LICENSE file in the project repository or visit: https://www.apache.org/licenses/LICENSE-2.0

Contributing

Contributions are welcome! Please feel free to submit a Pull Request. By contributing to this project, you agree to license your contributions under the Apache License 2.0.

Acknowledgements

Special thanks to the developers of MediaPipe, OpenCV, and other open-source libraries that made this project possible.