FOSS: Multi-Person Age Estimation with Focusing on Objects and Still
Seeing Surroundings
- URL: http://arxiv.org/abs/2010.07544v2
- Date: Fri, 19 Feb 2021 03:47:55 GMT
- Title: FOSS: Multi-Person Age Estimation with Focusing on Objects and Still
Seeing Surroundings
- Authors: Masakazu Yoshimura and Satoshi Ogata
- Abstract summary: In some situations, age estimation in the wild and for multi-person is needed.
We propose a method that can detect and estimate the age of multi-person with a single model.
We also adapted our proposed model to commonly used single person photographed age estimation datasets.
- Score: 1.370633147306388
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Age estimation from images can be used in many practical scenes. Most of the
previous works targeted on the estimation from images in which only one face
exists. Also, most of the open datasets for age estimation contain images like
that. However, in some situations, age estimation in the wild and for
multi-person is needed. Usually, such situations were solved by two separate
models; one is a face detector model which crops facial regions and the other
is an age estimation model which estimates from cropped images. In this work,
we propose a method that can detect and estimate the age of multi-person with a
single model which estimates age with focusing on faces and still seeing
surroundings. Also, we propose a training method which enables the model to
estimate multi-person well despite trained with images in which only one face
is photographed. In the experiments, we evaluated our proposed method compared
with the traditional approach using two separate models. As the result, the
accuracy could be enhanced with our proposed method. We also adapted our
proposed model to commonly used single person photographed age estimation
datasets and it is proved that our method is also effective to those images and
outperforms the state of the art accuracy.
Related papers
- Evaluating Multiview Object Consistency in Humans and Image Models [68.36073530804296]
We leverage an experimental design from the cognitive sciences which requires zero-shot visual inferences about object shape.
We collect 35K trials of behavioral data from over 500 participants.
We then evaluate the performance of common vision models.
arXiv Detail & Related papers (2024-09-09T17:59:13Z) - MiVOLO: Multi-input Transformer for Age and Gender Estimation [0.0]
We present MiVOLO, a straightforward approach for age and gender estimation using the latest vision transformer.
Our method integrates both tasks into a unified dual input/output model.
We compare our model's age recognition performance with human-level accuracy and demonstrate that it significantly outperforms humans across a majority of age ranges.
arXiv Detail & Related papers (2023-07-10T14:58:10Z) - PoseMatcher: One-shot 6D Object Pose Estimation by Deep Feature Matching [51.142988196855484]
We propose PoseMatcher, an accurate model free one-shot object pose estimator.
We create a new training pipeline for object to image matching based on a three-view system.
To enable PoseMatcher to attend to distinct input modalities, an image and a pointcloud, we introduce IO-Layer.
arXiv Detail & Related papers (2023-04-03T21:14:59Z) - NOPE: Novel Object Pose Estimation from a Single Image [67.11073133072527]
We propose an approach that takes a single image of a new object as input and predicts the relative pose of this object in new images without prior knowledge of the object's 3D model.
We achieve this by training a model to directly predict discriminative embeddings for viewpoints surrounding the object.
This prediction is done using a simple U-Net architecture with attention and conditioned on the desired pose, which yields extremely fast inference.
arXiv Detail & Related papers (2023-03-23T18:55:43Z) - Unsupervised Deep Learning Meets Chan-Vese Model [77.24463525356566]
We propose an unsupervised image segmentation approach that integrates the Chan-Vese (CV) model with deep neural networks.
Our basic idea is to apply a deep neural network that maps the image into a latent space to alleviate the violation of the piecewise constant assumption in image space.
arXiv Detail & Related papers (2022-04-14T13:23:57Z) - FP-Age: Leveraging Face Parsing Attention for Facial Age Estimation in
the Wild [50.8865921538953]
We propose a method to explicitly incorporate facial semantics into age estimation.
We design a face parsing-based network to learn semantic information at different scales.
We show that our method consistently outperforms all existing age estimation methods.
arXiv Detail & Related papers (2021-06-21T14:31:32Z) - Reducing Racial Bias in Facial Age Prediction using Unsupervised Domain
Adaptation in Regression [0.0]
We propose an approach for unsupervised domain adaptation for the task of estimating someone's age from a given face image.
In order to avoid the propagation of racial bias in most publicly available face image datasets, we perform domain adaptation to motivate the predictor to learn features that are invariant to ethnicity.
arXiv Detail & Related papers (2021-04-05T05:31:12Z) - Hierarchical Attention-based Age Estimation and Bias Estimation [16.335191345543063]
We propose a novel deep-learning approach for age estimation based on face images.
Our scheme is shown to outperform contemporary schemes and provide a new state-of-the-art age estimation accuracy.
arXiv Detail & Related papers (2021-03-17T19:41:34Z) - Chronological age estimation of lateral cephalometric radiographs with
deep learning [0.0]
The proposed saliency map enhancements chronological age estimation method of lateral cephalometric radiographs can work well in chronological age estimation task.
Our method was tested on 3014 LC images from 4 to 40 years old.
The MEA of the experimental result is 1.250, which is less than the result of the state-of-the-art benchmark.
arXiv Detail & Related papers (2021-01-28T03:43:24Z) - Investigating Bias in Deep Face Analysis: The KANFace Dataset and
Empirical Study [67.3961439193994]
We introduce the most comprehensive, large-scale dataset of facial images and videos to date.
The data are manually annotated in terms of identity, exact age, gender and kinship.
A method to debias network embeddings is introduced and tested on the proposed benchmarks.
arXiv Detail & Related papers (2020-05-15T00:14:39Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.