Robust Face Alignment by Multi-order High-precision Hourglass Network
- URL: http://arxiv.org/abs/2010.08722v1
- Date: Sat, 17 Oct 2020 05:40:30 GMT
- Title: Robust Face Alignment by Multi-order High-precision Hourglass Network
- Authors: Jun Wan, Zhihui Lai, Jun Liu, Jie Zhou, Can Gao
- Abstract summary: This paper proposes a heatmap subpixel regression (HSR) method and a multi-order cross geometry-aware (MCG) model.
The HSR method is proposed to achieve high-precision landmark detection by a well-designed subpixel detection loss (SDL) and subpixel detection technology (SDT)
At the same time, the MCG model is able to use the proposed multi-order cross information to learn more discriminative representations for enhancing facial geometric constraints and context information.
- Score: 44.94500006611075
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Heatmap regression (HR) has become one of the mainstream approaches for face
alignment and has obtained promising results under constrained environments.
However, when a face image suffers from large pose variations, heavy occlusions
and complicated illuminations, the performances of HR methods degrade greatly
due to the low resolutions of the generated landmark heatmaps and the exclusion
of important high-order information that can be used to learn more
discriminative features. To address the alignment problem for faces with
extremely large poses and heavy occlusions, this paper proposes a heatmap
subpixel regression (HSR) method and a multi-order cross geometry-aware (MCG)
model, which are seamlessly integrated into a novel multi-order high-precision
hourglass network (MHHN). The HSR method is proposed to achieve high-precision
landmark detection by a well-designed subpixel detection loss (SDL) and
subpixel detection technology (SDT). At the same time, the MCG model is able to
use the proposed multi-order cross information to learn more discriminative
representations for enhancing facial geometric constraints and context
information. To the best of our knowledge, this is the first study to explore
heatmap subpixel regression for robust and high-precision face alignment. The
experimental results from challenging benchmark datasets demonstrate that our
approach outperforms state-of-the-art methods in the literature.
Related papers
- Multi-Head Attention Residual Unfolded Network for Model-Based Pansharpening [2.874893537471256]
Unfolding fusion methods integrate the powerful representation capabilities of deep learning with the robustness of model-based approaches.
In this paper, we propose a model-based deep unfolded method for satellite image fusion.
Experimental results on PRISMA, Quickbird, and WorldView2 datasets demonstrate the superior performance of our method.
arXiv Detail & Related papers (2024-09-04T13:05:00Z) - Meet-in-the-middle: Multi-scale upsampling and matching for
cross-resolution face recognition [8.330506641637793]
We propose a method that combines face super-resolution, resolution matching, and multi-scale template accumulation to reliably recognize faces from surveillance footage.
The proposed approach does not require training or fine-tuning on the target dataset of real surveillance images.
arXiv Detail & Related papers (2022-11-28T11:23:32Z) - Semantic Encoder Guided Generative Adversarial Face Ultra-Resolution
Network [15.102899995465041]
We propose a novel face super-resolution method, namely Semantic guided Generative Adversarial Face Ultra-Resolution Network (SEGA-FURN)
The proposed network is composed of a novel semantic encoder that has the ability to capture the embedded semantics to guide adversarial learning and a novel generator that uses a hierarchical architecture named Residual in Internal Block (RIDB)
Experiments on large face datasets have proved that the proposed method can achieve superior super-resolution results and significantly outperform other state-of-the-art methods in both qualitative and quantitative comparisons.
arXiv Detail & Related papers (2022-11-18T23:16:57Z) - Hierarchical Similarity Learning for Aliasing Suppression Image
Super-Resolution [64.15915577164894]
A hierarchical image super-resolution network (HSRNet) is proposed to suppress the influence of aliasing.
HSRNet achieves better quantitative and visual performance than other works, and remits the aliasing more effectively.
arXiv Detail & Related papers (2022-06-07T14:55:32Z) - Deep Posterior Distribution-based Embedding for Hyperspectral Image
Super-resolution [75.24345439401166]
This paper focuses on how to embed the high-dimensional spatial-spectral information of hyperspectral (HS) images efficiently and effectively.
We formulate HS embedding as an approximation of the posterior distribution of a set of carefully-defined HS embedding events.
Then, we incorporate the proposed feature embedding scheme into a source-consistent super-resolution framework that is physically-interpretable.
Experiments over three common benchmark datasets demonstrate that PDE-Net achieves superior performance over state-of-the-art methods.
arXiv Detail & Related papers (2022-05-30T06:59:01Z) - Uncertainty-Aware Deep Multi-View Photometric Stereo [100.97116470055273]
Photometric stereo (PS) is excellent at recovering high-frequency surface details, whereas multi-view stereo (MVS) can help remove the low-frequency distortion due to PS and retain the global shape.
This paper proposes an approach that can effectively utilize such complementary strengths of PS and MVS.
We estimate per-pixel surface normals and depth using an uncertainty-aware deep-PS network and deep-MVS network, respectively.
arXiv Detail & Related papers (2022-02-26T05:45:52Z) - High-resolution Depth Maps Imaging via Attention-based Hierarchical
Multi-modal Fusion [84.24973877109181]
We propose a novel attention-based hierarchical multi-modal fusion network for guided DSR.
We show that our approach outperforms state-of-the-art methods in terms of reconstruction accuracy, running speed and memory efficiency.
arXiv Detail & Related papers (2021-04-04T03:28:33Z) - Robust Facial Landmark Detection by Multi-order Multi-constraint Deep
Networks [35.19368350816032]
We propose a Multi-order Multi-constraint Deep Network (MMDN) for more powerful feature correlations and shape constraints learning.
An Implicit Multi-order Correlating Geometry-aware (IMCG) model is proposed to introduce the multi-order spatial correlations and multi-order channel correlations.
An Explicit Probability-based Boundary-adaptive Regression (EPBR) method is developed to enhance the global shape constraints.
arXiv Detail & Related papers (2020-12-09T09:11:47Z) - Pyramidal Edge-maps and Attention based Guided Thermal Super-resolution [28.798966778371145]
Guided super-resolution (GSR) of thermal images using visible range images is challenging because of the difference in the spectral-range between the images.
We propose a novel algorithm for GSR based on pyramidal edge-maps extracted from the visible image.
Our model outperforms the state-of-the-art GSR methods, both quantitatively and qualitatively.
arXiv Detail & Related papers (2020-03-13T12:11:26Z) - PULSE: Self-Supervised Photo Upsampling via Latent Space Exploration of
Generative Models [77.32079593577821]
PULSE (Photo Upsampling via Latent Space Exploration) generates high-resolution, realistic images at resolutions previously unseen in the literature.
Our method outperforms state-of-the-art methods in perceptual quality at higher resolutions and scale factors than previously possible.
arXiv Detail & Related papers (2020-03-08T16:44:31Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.