Facial Anatomical Landmark Detection using Regularized Transfer Learning
with Application to Fetal Alcohol Syndrome Recognition
- URL: http://arxiv.org/abs/2109.05485v1
- Date: Sun, 12 Sep 2021 11:05:06 GMT
- Title: Facial Anatomical Landmark Detection using Regularized Transfer Learning
with Application to Fetal Alcohol Syndrome Recognition
- Authors: Zeyu Fu, Jianbo Jiao, Michael Suttie and J. Alison Noble
- Abstract summary: Fetal alcohol syndrome (FAS) caused by prenatal alcohol exposure can result in a series of cranio-facial anomalies.
Anatomical landmark detection is important to detect the presence of FAS associated facial anomalies.
Current deep learning-based heatmap regression methods designed for facial landmark detection in natural images assume availability of large datasets.
We develop a new regularized transfer learning approach that exploits the knowledge of a network learned on large facial recognition datasets.
- Score: 24.27777060287004
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Fetal alcohol syndrome (FAS) caused by prenatal alcohol exposure can result
in a series of cranio-facial anomalies, and behavioral and neurocognitive
problems. Current diagnosis of FAS is typically done by identifying a set of
facial characteristics, which are often obtained by manual examination.
Anatomical landmark detection, which provides rich geometric information, is
important to detect the presence of FAS associated facial anomalies. This
imaging application is characterized by large variations in data appearance and
limited availability of labeled data. Current deep learning-based heatmap
regression methods designed for facial landmark detection in natural images
assume availability of large datasets and are therefore not wellsuited for this
application. To address this restriction, we develop a new regularized transfer
learning approach that exploits the knowledge of a network learned on large
facial recognition datasets. In contrast to standard transfer learning which
focuses on adjusting the pre-trained weights, the proposed learning approach
regularizes the model behavior. It explicitly reuses the rich visual semantics
of a domain-similar source model on the target task data as an additional
supervisory signal for regularizing landmark detection optimization.
Specifically, we develop four regularization constraints for the proposed
transfer learning, including constraining the feature outputs from
classification and intermediate layers, as well as matching activation
attention maps in both spatial and channel levels. Experimental evaluation on a
collected clinical imaging dataset demonstrate that the proposed approach can
effectively improve model generalizability under limited training samples, and
is advantageous to other approaches in the literature.
Related papers
- Self-supervised pre-training with diffusion model for few-shot landmark detection in x-ray images [0.8793721044482612]
This study introduces a novel application of denoising diffusion probabilistic models (DDPMs) to the landmark detection task.
Our key innovation lies in leveraging DDPMs for self-supervised pre-training in landmark detection.
This method enables accurate landmark detection with minimal annotated training data.
arXiv Detail & Related papers (2024-07-25T15:32:59Z) - Spatial-aware Attention Generative Adversarial Network for Semi-supervised Anomaly Detection in Medical Image [63.59114880750643]
We introduce a novel Spatial-aware Attention Generative Adrialversa Network (SAGAN) for one-class semi-supervised generation of health images.
SAGAN generates high-quality health images corresponding to unlabeled data, guided by the reconstruction of normal images and restoration of pseudo-anomaly images.
Extensive experiments on three medical datasets demonstrate that the proposed SAGAN outperforms the state-of-the-art methods.
arXiv Detail & Related papers (2024-05-21T15:41:34Z) - Enhancing AI Diagnostics: Autonomous Lesion Masking via Semi-Supervised Deep Learning [1.4053129774629076]
This study presents an unsupervised domain adaptation method aimed at autonomously generating image masks outlining regions of interest (ROIs) for differentiating breast lesions in breast ultrasound (US) imaging.
Our semi-supervised learning approach utilizes a primitive model trained on a small public breast US dataset with true annotations.
This model is then iteratively refined for the domain adaptation task, generating pseudo-masks for our private, unannotated breast US dataset.
arXiv Detail & Related papers (2024-04-18T18:25:00Z) - Genetic InfoMax: Exploring Mutual Information Maximization in
High-Dimensional Imaging Genetics Studies [50.11449968854487]
Genome-wide association studies (GWAS) are used to identify relationships between genetic variations and specific traits.
Representation learning for imaging genetics is largely under-explored due to the unique challenges posed by GWAS.
We introduce a trans-modal learning framework Genetic InfoMax (GIM) to address the specific challenges of GWAS.
arXiv Detail & Related papers (2023-09-26T03:59:21Z) - Leveraging Semi-Supervised Graph Learning for Enhanced Diabetic
Retinopathy Detection [0.0]
Diabetic Retinopathy (DR) is a significant cause of blindness globally, highlighting the urgent need for early detection and effective treatment.
Recent advancements in Machine Learning (ML) techniques have shown promise in DR detection, but the availability of labeled data often limits their performance.
This research proposes a novel Semi-Supervised Graph Learning SSGL algorithm tailored for DR detection.
arXiv Detail & Related papers (2023-09-02T04:42:08Z) - Towards Intrinsic Common Discriminative Features Learning for Face
Forgery Detection using Adversarial Learning [59.548960057358435]
We propose a novel method which utilizes adversarial learning to eliminate the negative effect of different forgery methods and facial identities.
Our face forgery detection model learns to extract common discriminative features through eliminating the effect of forgery methods and facial identities.
arXiv Detail & Related papers (2022-07-08T09:23:59Z) - Generative Residual Attention Network for Disease Detection [51.60842580044539]
We present a novel approach for disease generation in X-rays using a conditional generative adversarial learning.
We generate a corresponding radiology image in a target domain while preserving the identity of the patient.
We then use the generated X-ray image in the target domain to augment our training to improve the detection performance.
arXiv Detail & Related papers (2021-10-25T14:15:57Z) - Assessing glaucoma in retinal fundus photographs using Deep Feature
Consistent Variational Autoencoders [63.391402501241195]
glaucoma is challenging to detect since it remains asymptomatic until the symptoms are severe.
Early identification of glaucoma is generally made based on functional, structural, and clinical assessments.
Deep learning methods have partially solved this dilemma by bypassing the marker identification stage and analyzing high-level information directly to classify the data.
arXiv Detail & Related papers (2021-10-04T16:06:49Z) - Margin-Aware Intra-Class Novelty Identification for Medical Images [2.647674705784439]
We propose a hybrid model - Transformation-based Embedding learning for Novelty Detection (TEND)
With a pre-trained autoencoder as image feature extractor, TEND learns to discriminate the feature embeddings of in-distribution data from the transformed counterparts as fake out-of-distribution inputs.
arXiv Detail & Related papers (2021-07-31T00:10:26Z) - Many-to-One Distribution Learning and K-Nearest Neighbor Smoothing for
Thoracic Disease Identification [83.6017225363714]
deep learning has become the most powerful computer-aided diagnosis technology for improving disease identification performance.
For chest X-ray imaging, annotating large-scale data requires professional domain knowledge and is time-consuming.
In this paper, we propose many-to-one distribution learning (MODL) and K-nearest neighbor smoothing (KNNS) methods to improve a single model's disease identification performance.
arXiv Detail & Related papers (2021-02-26T02:29:30Z) - Cross-Task Representation Learning for Anatomical Landmark Detection [20.079451546446712]
We propose to regularize the knowledge transfer across source and target tasks through cross-task representation learning.
The proposed method is demonstrated for extracting facial anatomical landmarks which facilitate the diagnosis of fetal alcohol syndrome.
We present two approaches for the proposed representation learning by constraining either final or intermediate model features on the target model.
arXiv Detail & Related papers (2020-09-28T21:22:49Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.