Multistage Model for Robust Face Alignment Using Deep Neural Networks
- URL: http://arxiv.org/abs/2002.01075v1
- Date: Tue, 4 Feb 2020 01:13:58 GMT
- Title: Multistage Model for Robust Face Alignment Using Deep Neural Networks
- Authors: Huabin Wang and Rui Cheng and Jian Zhou and Liang Tao and Hon Keung
Kwan
- Abstract summary: A multistage model is proposed which takes advantage of spatial transformer networks, hourglass networks and exemplar-based shape constraints.
Experiments are performed to demonstrate the superior performance of the proposed method over other state-of-the-art methods.
- Score: 8.504539228134082
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: An ability to generalize unconstrained conditions such as severe occlusions
and large pose variations remains a challenging goal to achieve in face
alignment. In this paper, a multistage model based on deep neural networks is
proposed which takes advantage of spatial transformer networks, hourglass
networks and exemplar-based shape constraints. First, a spatial transformer -
generative adversarial network which consists of convolutional layers and
residual units is utilized to solve the initialization issues caused by face
detectors, such as rotation and scale variations, to obtain improved face
bounding boxes for face alignment. Then, stacked hourglass network is employed
to obtain preliminary locations of landmarks as well as their corresponding
scores. In addition, an exemplar-based shape dictionary is designed to
determine landmarks with low scores based on those with high scores. By
incorporating face shape constraints, misaligned landmarks caused by occlusions
or cluttered backgrounds can be considerably improved. Extensive experiments
based on challenging benchmark datasets are performed to demonstrate the
superior performance of the proposed method over other state-of-the-art
methods.
Related papers
- Boosting Cross-Domain Point Classification via Distilling Relational Priors from 2D Transformers [59.0181939916084]
Traditional 3D networks mainly focus on local geometric details and ignore the topological structure between local geometries.
We propose a novel Priors Distillation (RPD) method to extract priors from the well-trained transformers on massive images.
Experiments on the PointDA-10 and the Sim-to-Real datasets verify that the proposed method consistently achieves the state-of-the-art performance of UDA for point cloud classification.
arXiv Detail & Related papers (2024-07-26T06:29:09Z) - Arbitrary-Scale Point Cloud Upsampling by Voxel-Based Network with
Latent Geometric-Consistent Learning [52.825441454264585]
We propose an arbitrary-scale Point cloud Upsampling framework using Voxel-based Network (textbfPU-VoxelNet)
Thanks to the completeness and regularity inherited from the voxel representation, voxel-based networks are capable of providing predefined grid space to approximate 3D surface.
A density-guided grid resampling method is developed to generate high-fidelity points while effectively avoiding sampling outliers.
arXiv Detail & Related papers (2024-03-08T07:31:14Z) - Deformation-Invariant Neural Network and Its Applications in Distorted
Image Restoration and Analysis [8.009077765403287]
Images degraded by geometric distortions pose a significant challenge to imaging and computer vision tasks such as object recognition.
Deep learning-based imaging models usually fail to give accurate performance for geometrically distorted images.
We propose the deformation-invariant neural network (DINN), a framework to address the problem of imaging tasks for geometrically distorted images.
arXiv Detail & Related papers (2023-10-04T08:01:36Z) - Iterative Soft Shrinkage Learning for Efficient Image Super-Resolution [91.3781512926942]
Image super-resolution (SR) has witnessed extensive neural network designs from CNN to transformer architectures.
This work investigates the potential of network pruning for super-resolution iteration to take advantage of off-the-shelf network designs and reduce the underlying computational overhead.
We propose a novel Iterative Soft Shrinkage-Percentage (ISS-P) method by optimizing the sparse structure of a randomly network at each and tweaking unimportant weights with a small amount proportional to the magnitude scale on-the-fly.
arXiv Detail & Related papers (2023-03-16T21:06:13Z) - Parallax-Tolerant Unsupervised Deep Image Stitching [57.76737888499145]
We propose UDIS++, a parallax-tolerant unsupervised deep image stitching technique.
First, we propose a robust and flexible warp to model the image registration from global homography to local thin-plate spline motion.
To further eliminate the parallax artifacts, we propose to composite the stitched image seamlessly by unsupervised learning for seam-driven composition masks.
arXiv Detail & Related papers (2023-02-16T10:40:55Z) - Sign-Agnostic CONet: Learning Implicit Surface Reconstructions by
Sign-Agnostic Optimization of Convolutional Occupancy Networks [39.65056638604885]
We learn implicit surface reconstruction by sign-agnostic optimization of convolutional occupancy networks.
We show this goal can be effectively achieved by a simple yet effective design.
arXiv Detail & Related papers (2021-05-08T03:35:32Z) - ProAlignNet : Unsupervised Learning for Progressively Aligning Noisy
Contours [12.791313859673187]
"ProAlignNet" accounts for large scale misalignments and complex transformations between the contour shapes.
It learns by training with a novel loss function which is derived an upperbound of a proximity-sensitive and local shape-dependent similarity metric.
In two real-world applications, the proposed models consistently perform superior to state-of-the-art methods.
arXiv Detail & Related papers (2020-05-23T14:56:14Z) - Deep Semantic Matching with Foreground Detection and Cycle-Consistency [103.22976097225457]
We address weakly supervised semantic matching based on a deep network.
We explicitly estimate the foreground regions to suppress the effect of background clutter.
We develop cycle-consistent losses to enforce the predicted transformations across multiple images to be geometrically plausible and consistent.
arXiv Detail & Related papers (2020-03-31T22:38:09Z) - Beyond Dropout: Feature Map Distortion to Regularize Deep Neural
Networks [107.77595511218429]
In this paper, we investigate the empirical Rademacher complexity related to intermediate layers of deep neural networks.
We propose a feature distortion method (Disout) for addressing the aforementioned problem.
The superiority of the proposed feature map distortion for producing deep neural network with higher testing performance is analyzed and demonstrated.
arXiv Detail & Related papers (2020-02-23T13:59:13Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.