Neonatal Face and Facial Landmark Detection from Video Recordings
- URL: http://arxiv.org/abs/2302.04341v1
- Date: Wed, 8 Feb 2023 21:18:18 GMT
- Title: Neonatal Face and Facial Landmark Detection from Video Recordings
- Authors: Ethan Grooby, Chiranjibi Sitaula, Soodeh Ahani, Liisa Holsti, Atul
Malhotra, Guy A. Dumont, Faezeh Marzbanrad
- Abstract summary: This paper explores automated face and facial landmark detection of neonates.
It is an important first step in many video-based neonatal health applications, such as vital sign estimation, pain assessment, sleep-wake classification, and jaundice detection.
- Score: 2.6091909702028584
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: This paper explores automated face and facial landmark detection of neonates,
which is an important first step in many video-based neonatal health
applications, such as vital sign estimation, pain assessment, sleep-wake
classification, and jaundice detection. Utilising three publicly available
datasets of neonates in the clinical environment, 366 images (258 subjects) and
89 (66 subjects) were annotated for training and testing, respectively.
Transfer learning was applied to two YOLO-based models, with input training
images augmented with random horizontal flipping, photo-metric colour
distortion, translation and scaling during each training epoch. Additionally,
the re-orientation of input images and fusion of trained deep learning models
was explored. Our proposed model based on YOLOv7Face outperformed existing
methods with a mean average precision of 84.8% for face detection, and a
normalised mean error of 0.072 for facial landmark detection. Overall, this
will assist in the development of fully automated neonatal health assessment
algorithms.
Related papers
- Adapting Visual-Language Models for Generalizable Anomaly Detection in Medical Images [68.42215385041114]
This paper introduces a novel lightweight multi-level adaptation and comparison framework to repurpose the CLIP model for medical anomaly detection.
Our approach integrates multiple residual adapters into the pre-trained visual encoder, enabling a stepwise enhancement of visual features across different levels.
Our experiments on medical anomaly detection benchmarks demonstrate that our method significantly surpasses current state-of-the-art models.
arXiv Detail & Related papers (2024-03-19T09:28:19Z) - Towards early prediction of neurodevelopmental disorders: Computational
model for Face Touch and Self-adaptors in Infants [0.0]
evaluating a baby's movements is key to understanding possible risks of developmental disorders in their growth.
Previous research in psychology has shown that measuring specific movements or gestures such as face touches in babies is essential to analyse how babies understand themselves and their context.
This research proposes the first automatic approach that detects face touches from video recordings by tracking infants' movements and gestures.
arXiv Detail & Related papers (2023-01-07T18:08:43Z) - Neurodevelopmental Phenotype Prediction: A State-of-the-Art Deep
Learning Model [0.0]
We apply a deep neural network to analyse the cortical surface data of neonates.
Our goal is to identify neurodevelopmental biomarkers and to predict gestational age at birth based on these biomarkers.
arXiv Detail & Related papers (2022-11-16T11:15:23Z) - Few-Shot Meta Learning for Recognizing Facial Phenotypes of Genetic
Disorders [55.41644538483948]
Automated classification and similarity retrieval aid physicians in decision-making to diagnose possible genetic conditions as early as possible.
Previous work has addressed the problem as a classification problem and used deep learning methods.
In this study, we used a facial recognition model trained on a large corpus of healthy individuals as a pre-task and transferred it to facial phenotype recognition.
arXiv Detail & Related papers (2022-10-23T11:52:57Z) - Robust Neonatal Face Detection in Real-world Clinical Settings [4.263900348596098]
Current face detection algorithms are extremely generalized and can obtain decent accuracy when detecting the adult faces.
By training a state-of-the-art face detection model, You-Only-Look-Once, on a proprietary dataset containing labelled neonate faces in a clinical setting, this work achieves near real time neonate face detection.
Preliminary findings show an accuracy of 68.7%, compared to the off the shelf solution which detected neonate faces with an accuracy of 7.37%.
arXiv Detail & Related papers (2022-04-01T18:50:47Z) - FP-Age: Leveraging Face Parsing Attention for Facial Age Estimation in
the Wild [50.8865921538953]
We propose a method to explicitly incorporate facial semantics into age estimation.
We design a face parsing-based network to learn semantic information at different scales.
We show that our method consistently outperforms all existing age estimation methods.
arXiv Detail & Related papers (2021-06-21T14:31:32Z) - Wide & Deep neural network model for patch aggregation in CNN-based
prostate cancer detection systems [51.19354417900591]
Prostate cancer (PCa) is one of the leading causes of death among men, with almost 1.41 million new cases and around 375,000 deaths in 2020.
To perform an automatic diagnosis, prostate tissue samples are first digitized into gigapixel-resolution whole-slide images.
Small subimages called patches are extracted and predicted, obtaining a patch-level classification.
arXiv Detail & Related papers (2021-05-20T18:13:58Z) - Deep learning in the ultrasound evaluation of neonatal respiratory
status [11.308283140003676]
Lung ultrasound imaging is reaching growing interest from the scientific community.
Image analysis and pattern recognition approaches have proven their ability to fully exploit the rich information contained in these data.
We present a thorough analysis of recent deep learning networks and training strategies carried out on a vast and challenging multicenter dataset.
arXiv Detail & Related papers (2020-10-31T18:57:55Z) - Human Recognition Using Face in Computed Tomography [26.435782518817295]
We propose an automatic processing pipeline that first detects facial landmarks in 3D for ROI extraction and then generates aligned 2D depth images, which are used for automatic recognition.
Our method achieves a 1:56 identification accuracy of 92.53% and a 1:1 verification accuracy of 96.12%, outperforming other competing approaches.
arXiv Detail & Related papers (2020-05-28T18:59:59Z) - Y-Net for Chest X-Ray Preprocessing: Simultaneous Classification of
Geometry and Segmentation of Annotations [70.0118756144807]
This work introduces a general pre-processing step for chest x-ray input into machine learning algorithms.
A modified Y-Net architecture based on the VGG11 encoder is used to simultaneously learn geometric orientation and segmentation of radiographs.
Results were evaluated by expert clinicians, with acceptable geometry in 95.8% and annotation mask in 96.2%, compared to 27.0% and 34.9% respectively in control images.
arXiv Detail & Related papers (2020-05-08T02:16:17Z) - Retinopathy of Prematurity Stage Diagnosis Using Object Segmentation and
Convolutional Neural Networks [68.96150598294072]
Retinopathy of Prematurity (ROP) is an eye disorder primarily affecting premature infants with lower weights.
It causes proliferation of vessels in the retina and could result in vision loss and, eventually, retinal detachment, leading to blindness.
In recent years, there has been a significant effort to automate the diagnosis using deep learning.
This paper builds upon the success of previous models and develops a novel architecture, which combines object segmentation and convolutional neural networks (CNN)
Our proposed system first trains an object segmentation model to identify the demarcation line at a pixel level and adds the resulting mask as an additional "color" channel in
arXiv Detail & Related papers (2020-04-03T14:07:41Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.