Unsupervised Anomaly Appraisal of Cleft Faces Using a StyleGAN2-based
Model Adaptation Technique
- URL: http://arxiv.org/abs/2211.06659v1
- Date: Sat, 12 Nov 2022 13:30:20 GMT
- Title: Unsupervised Anomaly Appraisal of Cleft Faces Using a StyleGAN2-based
Model Adaptation Technique
- Authors: Abdullah Hayajneh, Mohammad Shaqfeh, Erchin Serpedin, Mitchell A.
Stotland
- Abstract summary: This paper presents a novel machine learning framework to consistently detect, localize and rate congenital cleft lip anomalies in human faces.
The proposed method employs the StyleGAN2 generative adversarial network with model adaptation to produce normalized transformations of cleft-affected faces.
The anomaly scores yielded by the proposed computer model correlate closely with the human ratings of facial differences, leading to 0.942 Pearson's r score.
- Score: 5.224306534441244
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: This paper presents a novel machine learning framework to consistently
detect, localize and rate congenital cleft lip anomalies in human faces. The
goal is to provide a universal, objective measure of facial differences and
reconstructive surgical outcomes that matches human judgments. The proposed
method employs the StyleGAN2 generative adversarial network with model
adaptation to produce normalized transformations of cleft-affected faces in
order to allow for subsequent measurement of deformity using a pixel-wise
subtraction approach. The complete pipeline of the proposed framework consists
of the following steps: image preprocessing, face normalization, color
transformation, morphological erosion, heat-map generation and abnormality
scoring. Heatmaps that finely discern anatomic anomalies are proposed by
exploiting the features of the considered framework. The proposed framework is
validated through computer simulations and surveys containing human ratings.
The anomaly scores yielded by the proposed computer model correlate closely
with the human ratings of facial differences, leading to 0.942 Pearson's r
score.
Related papers
- Adapting Visual-Language Models for Generalizable Anomaly Detection in Medical Images [68.42215385041114]
This paper introduces a novel lightweight multi-level adaptation and comparison framework to repurpose the CLIP model for medical anomaly detection.
Our approach integrates multiple residual adapters into the pre-trained visual encoder, enabling a stepwise enhancement of visual features across different levels.
Our experiments on medical anomaly detection benchmarks demonstrate that our method significantly surpasses current state-of-the-art models.
arXiv Detail & Related papers (2024-03-19T09:28:19Z) - Forgery-aware Adaptive Transformer for Generalizable Synthetic Image
Detection [106.39544368711427]
We study the problem of generalizable synthetic image detection, aiming to detect forgery images from diverse generative methods.
We present a novel forgery-aware adaptive transformer approach, namely FatFormer.
Our approach tuned on 4-class ProGAN data attains an average of 98% accuracy to unseen GANs, and surprisingly generalizes to unseen diffusion models with 95% accuracy.
arXiv Detail & Related papers (2023-12-27T17:36:32Z) - Heterogeneous Face Frontalization via Domain Agnostic Learning [74.86585699909459]
We propose a domain agnostic learning-based generative adversarial network (DAL-GAN) which can synthesize frontal views in the visible domain from thermal faces with pose variations.
DAL-GAN consists of a generator with an auxiliary classifier and two discriminators which capture both local and global texture discriminations for better synthesis.
arXiv Detail & Related papers (2021-07-17T20:41:41Z) - Bridge the Gap Between Model-based and Model-free Human Reconstruction [10.818838437018682]
We present an end-to-end neural network that simultaneously predicts the pixel-aligned implicit surface and the explicit mesh model built by graph convolutional neural network.
Experiments on DeepHuman dataset showed that our approach is effective.
arXiv Detail & Related papers (2021-06-11T11:13:42Z) - High Resolution Face Editing with Masked GAN Latent Code Optimization [0.0]
Face editing is a popular research topic in the computer vision community.
Recent proposed methods are based on either training a conditional encoder-decoder Generative Adversarial Network (GAN) in an end-to-end fashion or on defining an operation in the latent space of a pre-trained vanilla GAN generator model.
We propose a GAN embedding optimization procedure with spatial and semantic constraints.
arXiv Detail & Related papers (2021-03-20T08:39:41Z) - Bilateral Asymmetry Guided Counterfactual Generating Network for
Mammogram Classification [48.4619620405991]
Mammogram benign or malignant classification with only image-level labels is challenging due to the absence of lesion annotations.
Motivated by the symmetric prior, we can explore a counterfactual problem that how would the features have behaved if there were no lesions in the image.
We derive a new theoretical result for counterfactual generation based on the symmetric prior.
arXiv Detail & Related papers (2020-09-30T03:15:30Z) - Deep Variational Network Toward Blind Image Restoration [60.45350399661175]
Blind image restoration is a common yet challenging problem in computer vision.
We propose a novel blind image restoration method, aiming to integrate both the advantages of them.
Experiments on two typical blind IR tasks, namely image denoising and super-resolution, demonstrate that the proposed method achieves superior performance over current state-of-the-arts.
arXiv Detail & Related papers (2020-08-25T03:30:53Z) - Appearance Consensus Driven Self-Supervised Human Mesh Recovery [67.20942777949793]
We present a self-supervised human mesh recovery framework to infer human pose and shape from monocular images.
We achieve state-of-the-art results on the standard model-based 3D pose estimation benchmarks.
The resulting colored mesh prediction opens up the usage of our framework for a variety of appearance-related tasks beyond the pose and shape estimation.
arXiv Detail & Related papers (2020-08-04T05:40:39Z) - Validation and generalization of pixel-wise relevance in convolutional
neural networks trained for face classification [0.0]
We show how relevance measures vary with and generalize across key model parameters.
Using relevance-based image masking, we find that relevance maps for face classification prove generally stable.
Fine-grained analyses of relevance maps across models revealed asymmetries in generalization that point to specific benefits of choice parameters.
arXiv Detail & Related papers (2020-06-16T23:20:40Z) - Detection of Pitt-Hopkins Syndrome based on morphological facial
features [0.0]
This work describes an automatic methodology to discriminate between individuals with Pitt-Hopkins syndrome (PTHS) and healthy individuals.
The methodology was tested on 71 individuals with PTHS and 55 healthy controls.
arXiv Detail & Related papers (2020-03-18T14:01:16Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.