AnchorFace: An Anchor-based Facial Landmark Detector Across Large Poses
- URL: http://arxiv.org/abs/2007.03221v3
- Date: Sat, 6 Feb 2021 07:11:57 GMT
- Title: AnchorFace: An Anchor-based Facial Landmark Detector Across Large Poses
- Authors: Zixuan Xu, Banghuai Li, Miao Geng, Ye Yuan
- Abstract summary: We propose a split-and-aggregate strategy to address the problem of facial landmark localization across large poses.
Our proposed approach, named AnchorFace, obtains state-of-the-art results with extremely efficient inference speed on four challenging benchmarks.
- Score: 12.291368376851231
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Facial landmark localization aims to detect the predefined points of human
faces, and the topic has been rapidly improved with the recent development of
neural network based methods. However, it remains a challenging task when
dealing with faces in unconstrained scenarios, especially with large pose
variations. In this paper, we target the problem of facial landmark
localization across large poses and address this task based on a
split-and-aggregate strategy. To split the search space, we propose a set of
anchor templates as references for regression, which well addresses the large
variations of face poses. Based on the prediction of each anchor template, we
propose to aggregate the results, which can reduce the landmark uncertainty due
to the large poses. Overall, our proposed approach, named AnchorFace, obtains
state-of-the-art results with extremely efficient inference speed on four
challenging benchmarks, i.e. AFLW, 300W, Menpo, and WFLW dataset. Code will be
available at https://github.com/nothingelse92/AnchorFace.
Related papers
- GLACE: Global Local Accelerated Coordinate Encoding [66.87005863868181]
Scene coordinate regression methods are effective in small-scale scenes but face significant challenges in large-scale scenes.
We propose GLACE, which integrates pre-trained global and local encodings and enables SCR to scale to large scenes with only a single small-sized network.
Our method achieves state-of-the-art results on large-scale scenes with a low-map-size model.
arXiv Detail & Related papers (2024-06-06T17:59:50Z) - COMICS: End-to-end Bi-grained Contrastive Learning for Multi-face Forgery Detection [56.7599217711363]
Face forgery recognition methods can only process one face at a time.
Most face forgery recognition methods can only process one face at a time.
We propose COMICS, an end-to-end framework for multi-face forgery detection.
arXiv Detail & Related papers (2023-08-03T03:37:13Z) - Shape Preserving Facial Landmarks with Graph Attention Networks [3.996275177789895]
We propose a model based on the combination of a CNN with a cascade of Graph Attention Network regressors.
We introduce an encoding that jointly represents the appearance and location of facial landmarks and an attention mechanism to weigh the information according to its reliability.
Experiments confirm that the proposed model learns a global representation of the structure of the face, achieving top performance in popular benchmarks on head pose and landmark estimation.
arXiv Detail & Related papers (2022-10-13T17:58:02Z) - Zoom In and Out: A Mixed-scale Triplet Network for Camouflaged Object
Detection [0.0]
We propose a mixed-scale triplet network, bf ZoomNet, which mimics the behavior of humans when observing vague images.
Specifically, our ZoomNet employs the zoom strategy to learn the discriminative mixed-scale semantics by the designed scale integration unit and hierarchical mixed-scale unit.
Our proposed highly task-friendly model consistently surpasses the existing 23 state-of-the-art methods on four public datasets.
arXiv Detail & Related papers (2022-03-05T09:13:52Z) - Locally Aware Piecewise Transformation Fields for 3D Human Mesh
Registration [67.69257782645789]
We propose piecewise transformation fields that learn 3D translation vectors to map any query point in posed space to its correspond position in rest-pose space.
We show that fitting parametric models with poses by our network results in much better registration quality, especially for extreme poses.
arXiv Detail & Related papers (2021-04-16T15:16:09Z) - Sub-pixel face landmarks using heatmaps and a bag of tricks [0.0]
We present our heatmap regression approach to accurately localize face landmarks.
We use five na"ive face landmarks from a publicly available face detector to position and align the face.
We also show that it is possible to reduce the upscaling complexity by using a mixture of deconvolution and pixel-shuffle layers.
arXiv Detail & Related papers (2021-03-04T14:34:20Z) - Deep Structured Prediction for Facial Landmark Detection [59.60946775628646]
This paper proposes a method for deep structured facial landmark detection based on combining a deep Convolutional Network with a Conditional Random Field.
We demonstrate its superior performance to existing state-of-the-art techniques in facial landmark detection.
arXiv Detail & Related papers (2020-10-18T17:09:24Z) - Point-Set Anchors for Object Detection, Instance Segmentation and Pose
Estimation [85.96410825961966]
We argue that the image features extracted at a central point contain limited information for predicting distant keypoints or bounding box boundaries.
To facilitate inference, we propose to instead perform regression from a set of points placed at more advantageous positions.
We apply this proposed framework, called Point-Set Anchors, to object detection, instance segmentation, and human pose estimation.
arXiv Detail & Related papers (2020-07-06T15:59:56Z) - A Transfer Learning approach to Heatmap Regression for Action Unit
intensity estimation [50.261472059743845]
Action Units (AUs) are geometrically-based atomic facial muscle movements.
We propose a novel AU modelling problem that consists of jointly estimating their localisation and intensity.
A Heatmap models whether an AU occurs or not at a given spatial location.
arXiv Detail & Related papers (2020-04-14T16:51:13Z) - Multistage Model for Robust Face Alignment Using Deep Neural Networks [8.504539228134082]
A multistage model is proposed which takes advantage of spatial transformer networks, hourglass networks and exemplar-based shape constraints.
Experiments are performed to demonstrate the superior performance of the proposed method over other state-of-the-art methods.
arXiv Detail & Related papers (2020-02-04T01:13:58Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.