A Detailed Look At CNN-based Approaches In Facial Landmark Detection
- URL: http://arxiv.org/abs/2005.08649v1
- Date: Fri, 8 May 2020 16:17:42 GMT
- Title: A Detailed Look At CNN-based Approaches In Facial Landmark Detection
- Authors: Chih-Fan Hsu, Chia-Ching Lin, Ting-Yang Hung, Chin-Laung Lei and
Kuan-Ta Chen
- Abstract summary: CNN-based approaches can be divided into regression and heatmap approaches.
In this paper, we investigate both CNN-based approaches, generalize their advantages and disadvantages, and introduce a variation of the heatmap approach.
A comprehensive evaluation is conducted and the result shows that the proposed model outperforms other models in all tested datasets.
- Score: 5.774786149181392
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Facial landmark detection has been studied over decades. Numerous neural
network (NN)-based approaches have been proposed for detecting landmarks,
especially the convolutional neural network (CNN)-based approaches. In general,
CNN-based approaches can be divided into regression and heatmap approaches.
However, no research systematically studies the characteristics of different
approaches. In this paper, we investigate both CNN-based approaches, generalize
their advantages and disadvantages, and introduce a variation of the heatmap
approach, a pixel-wise classification (PWC) model. To the best of our
knowledge, using the PWC model to detect facial landmarks have not been
comprehensively studied. We further design a hybrid loss function and a
discrimination network for strengthening the landmarks' interrelationship
implied in the PWC model to improve the detection accuracy without modifying
the original model architecture. Six common facial landmark datasets, AFW,
Helen, LFPW, 300-W, IBUG, and COFW are adopted to train or evaluate our model.
A comprehensive evaluation is conducted and the result shows that the proposed
model outperforms other models in all tested datasets.
Related papers
- Model Evaluation and Anomaly Detection in Temporal Complex Networks using Deep Learning Methods [0.0]
This paper proposes an automatic approach based on deep learning to handle the issue of results evaluation for temporal network models.
In addition to an evaluation method, the proposed method can also be used for anomaly detection in evolving networks.
arXiv Detail & Related papers (2024-06-15T09:19:09Z) - Visual Prompting Upgrades Neural Network Sparsification: A Data-Model Perspective [64.04617968947697]
We introduce a novel data-model co-design perspective: to promote superior weight sparsity.
Specifically, customized Visual Prompts are mounted to upgrade neural Network sparsification in our proposed VPNs framework.
arXiv Detail & Related papers (2023-12-03T13:50:24Z) - Exploiting Point-Wise Attention in 6D Object Pose Estimation Based on
Bidirectional Prediction [22.894810893732416]
The paper proposes a bidirectional correspondence prediction network with a point-wise attention-aware mechanism.
Our key insight is that the correlations between each model point and scene point provide essential information for learning point-pair matches.
Experimental results on the public datasets of LineMOD, YCB-Video, and Occ-LineMOD show that the proposed method achieves better performance than other state-of-the-art methods.
arXiv Detail & Related papers (2023-08-16T17:13:45Z) - Energy-based Out-of-Distribution Detection for Graph Neural Networks [76.0242218180483]
We propose a simple, powerful and efficient OOD detection model for GNN-based learning on graphs, which we call GNNSafe.
GNNSafe achieves up to $17.0%$ AUROC improvement over state-of-the-arts and it could serve as simple yet strong baselines in such an under-developed area.
arXiv Detail & Related papers (2023-02-06T16:38:43Z) - Multi-Branch Deep Radial Basis Function Networks for Facial Emotion
Recognition [80.35852245488043]
We propose a CNN based architecture enhanced with multiple branches formed by radial basis function (RBF) units.
RBF units capture local patterns shared by similar instances using an intermediate representation.
We show it is the incorporation of local information what makes the proposed model competitive.
arXiv Detail & Related papers (2021-09-07T21:05:56Z) - Anomaly Detection on Attributed Networks via Contrastive Self-Supervised
Learning [50.24174211654775]
We present a novel contrastive self-supervised learning framework for anomaly detection on attributed networks.
Our framework fully exploits the local information from network data by sampling a novel type of contrastive instance pair.
A graph neural network-based contrastive learning model is proposed to learn informative embedding from high-dimensional attributes and local structure.
arXiv Detail & Related papers (2021-02-27T03:17:20Z) - Firearm Detection via Convolutional Neural Networks: Comparing a
Semantic Segmentation Model Against End-to-End Solutions [68.8204255655161]
Threat detection of weapons and aggressive behavior from live video can be used for rapid detection and prevention of potentially deadly incidents.
One way for achieving this is through the use of artificial intelligence and, in particular, machine learning for image analysis.
We compare a traditional monolithic end-to-end deep learning model and a previously proposed model based on an ensemble of simpler neural networks detecting fire-weapons via semantic segmentation.
arXiv Detail & Related papers (2020-12-17T15:19:29Z) - Identification of Probability weighted ARX models with arbitrary domains [75.91002178647165]
PieceWise Affine models guarantees universal approximation, local linearity and equivalence to other classes of hybrid system.
In this work, we focus on the identification of PieceWise Auto Regressive with eXogenous input models with arbitrary regions (NPWARX)
The architecture is conceived following the Mixture of Expert concept, developed within the machine learning field.
arXiv Detail & Related papers (2020-09-29T12:50:33Z) - Adaptive Exploitation of Pre-trained Deep Convolutional Neural Networks
for Robust Visual Tracking [14.627458410954628]
This paper provides a comprehensive analysis of four commonly used CNN models to determine the best feature maps of each model.
With the aid of analysis results as attribute dictionaries, adaptive exploitation of deep features is proposed to improve the accuracy and robustness of visual trackers.
arXiv Detail & Related papers (2020-08-29T17:09:43Z) - Critically Examining the Claimed Value of Convolutions over User-Item
Embedding Maps for Recommender Systems [14.414055798999764]
In recent years, algorithm research in the area of recommender systems has shifted from matrix factorization techniques to neural approaches.
We show through analytical considerations and empirical evaluations that the claimed gains reported in the literature cannot be attributed to the ability of CNNs to model embedding correlations.
arXiv Detail & Related papers (2020-07-23T10:03:47Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.