Facial Age Estimation using Convolutional Neural Networks
- URL: http://arxiv.org/abs/2105.06746v1
- Date: Fri, 14 May 2021 10:09:47 GMT
- Title: Facial Age Estimation using Convolutional Neural Networks
- Authors: Adrian Kj{\ae}rran and Christian Bakke Venner{\o}d and Erling Stray
Bugge
- Abstract summary: This paper is a part of a student project in Machine Learning at the Norwegian University of Science and Technology.
A deep convolutional neural network with five convolutional layers and three fully-connected layers is presented to estimate the ages of individuals based on images.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: This paper is a part of a student project in Machine Learning at the
Norwegian University of Science and Technology. In this paper, a deep
convolutional neural network with five convolutional layers and three
fully-connected layers is presented to estimate the ages of individuals based
on images. The model is in its entirety trained from scratch, where a
combination of three different datasets is used as training data. These
datasets are the APPA dataset, UTK dataset, and the IMDB dataset. The images
were preprocessed using a proprietary face-recognition software. Our model is
evaluated on both a held-out test set, and on the Adience benchmark. On the
test set, our model achieves a categorical accuracy of 52%. On the Adience
benchmark, our model proves inferior compared with other leading models, with
an exact accuray of 30%, and an one-off accuracy of 46%. Furthermore, a script
was created, allowing users to estimate their age directly using their web
camera. The script, alongside all other code, is located in our GitHub
repository: AgeNet.
Related papers
- LiveXiv -- A Multi-Modal Live Benchmark Based on Arxiv Papers Content [62.816876067499415]
We propose LiveXiv: a scalable evolving live benchmark based on scientific ArXiv papers.
LiveXiv accesses domain-specific manuscripts at any given timestamp and proposes to automatically generate visual question-answer pairs.
We benchmark multiple open and proprietary Large Multi-modal Models (LMMs) on the first version of our benchmark, showing its challenging nature and exposing the models true abilities.
arXiv Detail & Related papers (2024-10-14T17:51:23Z) - Benchmark Analysis of Various Pre-trained Deep Learning Models on ASSIRA
Cats and Dogs Dataset [0.0]
The ASSIRA Cats & Dogs dataset is being used in this research for its overall acceptance and benchmark standards.
A comparison of various pre-trained models is demonstrated by using different types of benchmarkings and loss functions.
From this experiment, the highest accuracy which is 99.65% is gained using the NASNet Large.
arXiv Detail & Related papers (2024-01-09T16:48:11Z) - Uncertainty in AI: Evaluating Deep Neural Networks on
Out-of-Distribution Images [0.0]
This paper investigates the uncertainty of various deep neural networks, including ResNet-50, VGG16, DenseNet121, AlexNet, and GoogleNet, when dealing with perturbed data.
While ResNet-50 was the most accurate single model for OOD images, the ensemble performed even better, correctly classifying all images.
arXiv Detail & Related papers (2023-09-04T22:46:59Z) - Effective Robustness against Natural Distribution Shifts for Models with
Different Training Data [113.21868839569]
"Effective robustness" measures the extra out-of-distribution robustness beyond what can be predicted from the in-distribution (ID) performance.
We propose a new evaluation metric to evaluate and compare the effective robustness of models trained on different data.
arXiv Detail & Related papers (2023-02-02T19:28:41Z) - MATE: Masked Autoencoders are Online 3D Test-Time Learners [63.3907730920114]
MATE is the first Test-Time-Training (TTT) method designed for 3D data.
It makes deep networks trained for point cloud classification robust to distribution shifts occurring in test data.
arXiv Detail & Related papers (2022-11-21T13:19:08Z) - Portuguese Man-of-War Image Classification with Convolutional Neural
Networks [58.720142291102135]
Portuguese man-of-war (PMW) is a gelatinous organism with long tentacles capable of causing severe burns.
This paper reports on the use of convolutional neural networks for recognizing PMW images from the Instagram social media.
arXiv Detail & Related papers (2022-07-04T03:06:45Z) - Assessing Dataset Bias in Computer Vision [0.0]
biases have the tendency to propagate to the models that train on them, often leading to a poor performance in the minority class.
We will apply several augmentation techniques on a sample of the UTKFace dataset, such as undersampling, geometric transformations, variational autoencoders (VAEs), and generative adversarial networks (GANs)
We were able to show that our model has a better overall performance and consistency on age and ethnicity classification on multiple datasets when compared with the FairFace model.
arXiv Detail & Related papers (2022-05-03T22:45:49Z) - What Stops Learning-based 3D Registration from Working in the Real
World? [53.68326201131434]
This work identifies the sources of 3D point cloud registration failures, analyze the reasons behind them, and propose solutions.
Ultimately, this translates to a best-practice 3D registration network (BPNet), constituting the first learning-based method able to handle previously-unseen objects in real-world data.
Our model generalizes to real data without any fine-tuning, reaching an accuracy of up to 67% on point clouds of unseen objects obtained with a commercial sensor.
arXiv Detail & Related papers (2021-11-19T19:24:27Z) - EEGEyeNet: a Simultaneous Electroencephalography and Eye-tracking
Dataset and Benchmark for Eye Movement Prediction [5.10183147987411]
We present a new dataset with the goal of advancing research in the intersection of brain activities and eye movements.
EEGEyeNet consists of simultaneous Electroencephalography (EEG) and Eye-tracking (ET) recordings from 356 different subjects.
We also propose a benchmark to evaluate gaze prediction from EEG measurements.
arXiv Detail & Related papers (2021-11-06T15:41:27Z) - Application of Facial Recognition using Convolutional Neural Networks
for Entry Access Control [0.0]
The paper focuses on solving the supervised classification problem of taking images of people as input and classifying the person in the image as one of the authors or not.
Two approaches are proposed: (1) building and training a neural network called WoodNet from scratch and (2) leveraging transfer learning by utilizing a network pre-trained on the ImageNet database.
The results are two models classifying the individuals in the dataset with high accuracy, achieving over 99% accuracy on held-out test data.
arXiv Detail & Related papers (2020-11-23T07:55:24Z) - Omni-supervised Facial Expression Recognition via Distilled Data [120.11782405714234]
We propose omni-supervised learning to exploit reliable samples in a large amount of unlabeled data for network training.
We experimentally verify that the new dataset can significantly improve the ability of the learned FER model.
To tackle this, we propose to apply a dataset distillation strategy to compress the created dataset into several informative class-wise images.
arXiv Detail & Related papers (2020-05-18T09:36:51Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.