Towards a Real-Time Facial Analysis System
- URL: http://arxiv.org/abs/2109.10393v1
- Date: Tue, 21 Sep 2021 18:27:15 GMT
- Title: Towards a Real-Time Facial Analysis System
- Authors: Bishwo Adhikari, Xingyang Ni, Esa Rahtu, Heikki Huttunen
- Abstract summary: We present a system-level design of a real-time facial analysis system.
With a collection of deep neural networks for object detection, classification, and regression, the system recognizes age, gender, facial expression, and facial similarity for each person that appears in the camera view.
Results on common off-the-shelf architecture show that the system's accuracy is comparable to the state-of-the-art methods, and the recognition speed satisfies real-time requirements.
- Score: 13.649384403827359
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Facial analysis is an active research area in computer vision, with many
practical applications. Most of the existing studies focus on addressing one
specific task and maximizing its performance. For a complete facial analysis
system, one needs to solve these tasks efficiently to ensure a smooth
experience. In this work, we present a system-level design of a real-time
facial analysis system. With a collection of deep neural networks for object
detection, classification, and regression, the system recognizes age, gender,
facial expression, and facial similarity for each person that appears in the
camera view. We investigate the parallelization and interplay of individual
tasks. Results on common off-the-shelf architecture show that the system's
accuracy is comparable to the state-of-the-art methods, and the recognition
speed satisfies real-time requirements. Moreover, we propose a multitask
network for jointly predicting the first three attributes, i.e., age, gender,
and facial expression. Source code and trained models are available at
https://github.com/mahehu/TUT-live-age-estimator.
Related papers
- DeepFace-Attention: Multimodal Face Biometrics for Attention Estimation with Application to e-Learning [18.36413246876648]
This work introduces an innovative method for estimating attention levels (cognitive load) using an ensemble of facial analysis techniques applied to webcam videos.
Our approach adapts state-of-the-art facial analysis technologies to quantify the users' cognitive load in the form of high or low attention.
Our method outperforms existing state-of-the-art accuracies using the public mEBAL2 benchmark.
arXiv Detail & Related papers (2024-08-10T11:39:11Z) - Task-adaptive Q-Face [75.15668556061772]
We propose a novel task-adaptive multi-task face analysis method named as Q-Face.
Q-Face simultaneously performs multiple face analysis tasks with a unified model.
Our method achieves state-of-the-art performance on face expression recognition, action unit detection, face attribute analysis, age estimation, and face pose estimation.
arXiv Detail & Related papers (2024-05-15T03:13:11Z) - FaceXFormer: A Unified Transformer for Facial Analysis [59.94066615853198]
FaceXformer is an end-to-end unified transformer model for a range of facial analysis tasks.
Our model effectively handles images "in-the-wild," demonstrating its robustness and generalizability across eight different tasks.
arXiv Detail & Related papers (2024-03-19T17:58:04Z) - Faceptor: A Generalist Model for Face Perception [52.8066001012464]
Faceptor is proposed to adopt a well-designed single-encoder dual-decoder architecture.
Layer-Attention into Faceptor enables the model to adaptively select features from optimal layers to perform the desired tasks.
Our training framework can also be applied to auxiliary supervised learning, significantly improving performance in data-sparse tasks such as age estimation and expression recognition.
arXiv Detail & Related papers (2024-03-14T15:42:31Z) - A Real-Time Multi-Task Learning System for Joint Detection of Face,
Facial Landmark and Head Pose [3.661587008381534]
Extreme head postures pose a common challenge across a spectrum of facial analysis tasks.
This paper focuses on the integration of these tasks, particularly when addressing the complexities posed by large-angle face poses.
The primary contribution of this study is the proposal of a real-time multi-task detection system.
arXiv Detail & Related papers (2023-09-21T04:15:26Z) - SwinFace: A Multi-task Transformer for Face Recognition, Expression
Recognition, Age Estimation and Attribute Estimation [60.94239810407917]
This paper presents a multi-purpose algorithm for simultaneous face recognition, facial expression recognition, age estimation, and face attribute estimation based on a single Swin Transformer.
To address the conflicts among multiple tasks, a Multi-Level Channel Attention (MLCA) module is integrated into each task-specific analysis.
Experiments show that the proposed model has a better understanding of the face and achieves excellent performance for all tasks.
arXiv Detail & Related papers (2023-08-22T15:38:39Z) - Robustness Disparities in Face Detection [64.71318433419636]
We present the first of its kind detailed benchmark of face detection systems, specifically examining the robustness to noise of commercial and academic models.
Across all the datasets and systems, we generally find that photos of individuals who are $textitmasculine presenting$, of $textitolder$, of $textitdarker skin type$, or have $textitdim lighting$ are more susceptible to errors than their counterparts in other identities.
arXiv Detail & Related papers (2022-11-29T05:22:47Z) - Pre-training strategies and datasets for facial representation learning [58.8289362536262]
We show how to find a universal face representation that can be adapted to several facial analysis tasks and datasets.
We systematically investigate two ways of large-scale representation learning applied to faces: supervised and unsupervised pre-training.
Our main two findings are: Unsupervised pre-training on completely in-the-wild, uncurated data provides consistent and, in some cases, significant accuracy improvements.
arXiv Detail & Related papers (2021-03-30T17:57:25Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.