Estimation of BMI from Facial Images using Semantic Segmentation based
Region-Aware Pooling
- URL: http://arxiv.org/abs/2104.04733v1
- Date: Sat, 10 Apr 2021 10:53:21 GMT
- Title: Estimation of BMI from Facial Images using Semantic Segmentation based
Region-Aware Pooling
- Authors: Nadeem Yousaf, Sarfaraz Hussein, Waqas Sultani
- Abstract summary: Body-Mass-Index (BMI) conveys important information about one's life such as health and socio-economic conditions.
Recent works have employed hand-crafted geometrical face features or face-level deep convolutional neural network features for face to BMI prediction.
We propose to use deep features that are pooled from different face regions.
- Score: 3.889462292853575
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Body-Mass-Index (BMI) conveys important information about one's life such as
health and socio-economic conditions. Large-scale automatic estimation of BMIs
can help predict several societal behaviors such as health, job opportunities,
friendships, and popularity. The recent works have either employed hand-crafted
geometrical face features or face-level deep convolutional neural network
features for face to BMI prediction. The hand-crafted geometrical face feature
lack generalizability and face-level deep features don't have detailed local
information. Although useful, these methods missed the detailed local
information which is essential for exact BMI prediction. In this paper, we
propose to use deep features that are pooled from different face regions (eye,
nose, eyebrow, lips, etc.,) and demonstrate that this explicit pooling from
face regions can significantly boost the performance of BMI prediction. To
address the problem of accurate and pixel-level face regions localization, we
propose to use face semantic segmentation in our framework. Extensive
experiments are performed using different Convolutional Neural Network (CNN)
backbones including FaceNet and VGG-face on three publicly available datasets:
VisualBMI, Bollywood and VIP attributes. Experimental results demonstrate that,
as compared to the recent works, the proposed Reg-GAP gives a percentage
improvement of 22.4\% on VIP-attribute, 3.3\% on VisualBMI, and 63.09\% on the
Bollywood dataset.
Related papers
- BMI Prediction from Handwritten English Characters Using a Convolutional Neural Network [0.0]
No previous research established a clear connection between deep learning techniques for handwriting analysis and BMI prediction.
This article addresses this research gap with a deep learning approach to estimating BMI from handwritten characters by developing a convolutional neural network (CNN)
A dataset containing samples from 48 people in lowercase English scripts is successfully captured for the BMI prediction task.
arXiv Detail & Related papers (2024-09-04T10:06:42Z) - Multi-Branch Deep Radial Basis Function Networks for Facial Emotion
Recognition [80.35852245488043]
We propose a CNN based architecture enhanced with multiple branches formed by radial basis function (RBF) units.
RBF units capture local patterns shared by similar instances using an intermediate representation.
We show it is the incorporation of local information what makes the proposed model competitive.
arXiv Detail & Related papers (2021-09-07T21:05:56Z) - Facial expression and attributes recognition based on multi-task
learning of lightweight neural networks [9.162936410696409]
We examine the multi-task training of lightweight convolutional neural networks for face identification and classification of facial attributes.
It is shown that it is still necessary to fine-tune these networks in order to predict facial expressions.
Several models are presented based on MobileNet, EfficientNet and RexNet architectures.
arXiv Detail & Related papers (2021-03-31T14:21:04Z) - An Efficient Multitask Neural Network for Face Alignment, Head Pose
Estimation and Face Tracking [9.39854778804018]
We propose an efficient multitask face alignment, face tracking and head pose estimation network (ATPN)
ATPN achieves improved performance compared to previous state-of-the-art methods while having less number of parameters and FLOPS.
arXiv Detail & Related papers (2021-03-13T04:41:15Z) - Regional Attention Network (RAN) for Head Pose and Fine-grained Gesture
Recognition [9.131161856493486]
We propose a novel end-to-end textbfRegional Attention Network (RAN), which is a fully Convolutional Neural Network (CNN)
Our regions consist of one or more consecutive cells and are adapted from the strategies used in computing HOG (Histogram of Oriented Gradient) descriptor.
The proposed approach outperforms the state-of-the-art by a considerable margin in different metrics.
arXiv Detail & Related papers (2021-01-17T10:14:28Z) - PGL: Prior-Guided Local Self-supervised Learning for 3D Medical Image
Segmentation [87.50205728818601]
We propose a PriorGuided Local (PGL) self-supervised model that learns the region-wise local consistency in the latent feature space.
Our PGL model learns the distinctive representations of local regions, and hence is able to retain structural information.
arXiv Detail & Related papers (2020-11-25T11:03:11Z) - AI-based BMI Inference from Facial Images: An Application to Weight
Monitoring [3.4601380631551146]
We evaluate and compare the performance of five different CNN architectures for BMI inference from facial images.
Experimental results suggest the efficacy of the deep learning methods in BMI inference from face images with minimum Mean Absolute Error (MAE) of $1.04$ obtained using ResNet50.
arXiv Detail & Related papers (2020-10-15T00:00:40Z) - Semi-Siamese Training for Shallow Face Learning [78.7386209619276]
We introduce a novel training method named Semi-Siamese Training (SST)
A pair of Semi-Siamese networks constitute the forward propagation structure, and the training loss is computed with an updating gallery queue.
Our method is developed without extra-dependency, thus can be flexibly integrated with the existing loss functions and network architectures.
arXiv Detail & Related papers (2020-07-16T15:20:04Z) - A Transfer Learning approach to Heatmap Regression for Action Unit
intensity estimation [50.261472059743845]
Action Units (AUs) are geometrically-based atomic facial muscle movements.
We propose a novel AU modelling problem that consists of jointly estimating their localisation and intensity.
A Heatmap models whether an AU occurs or not at a given spatial location.
arXiv Detail & Related papers (2020-04-14T16:51:13Z) - DotFAN: A Domain-transferred Face Augmentation Network for Pose and
Illumination Invariant Face Recognition [94.96686189033869]
We propose a 3D model-assisted domain-transferred face augmentation network (DotFAN)
DotFAN can generate a series of variants of an input face based on the knowledge distilled from existing rich face datasets collected from other domains.
Experiments show that DotFAN is beneficial for augmenting small face datasets to improve their within-class diversity.
arXiv Detail & Related papers (2020-02-23T08:16:34Z) - It's Written All Over Your Face: Full-Face Appearance-Based Gaze
Estimation [82.16380486281108]
We propose an appearance-based method that only takes the full face image as input.
Our method encodes the face image using a convolutional neural network with spatial weights applied on the feature maps.
We show that our full-face method significantly outperforms the state of the art for both 2D and 3D gaze estimation.
arXiv Detail & Related papers (2016-11-27T15:00:10Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.