KPNet: Towards Minimal Face Detector
- URL: http://arxiv.org/abs/2003.07543v1
- Date: Tue, 17 Mar 2020 05:37:45 GMT
- Title: KPNet: Towards Minimal Face Detector
- Authors: Guanglu Song, Yu Liu, Yuhang Zang, Xiaogang Wang, Biao Leng, Qingsheng
Yuan
- Abstract summary: In this work, we find that the appearance feature of a generic face is discriminative enough for a tiny and shallow neural network to verify from the background.
The proposed KPNet detects small facial keypoints instead of the whole face by in a bottom-up manner.
- Score: 27.68740638740399
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The small receptive field and capacity of minimal neural networks limit their
performance when using them to be the backbone of detectors. In this work, we
find that the appearance feature of a generic face is discriminative enough for
a tiny and shallow neural network to verify from the background. And the
essential barriers behind us are 1) the vague definition of the face bounding
box and 2) tricky design of anchor-boxes or receptive field. Unlike most
top-down methods for joint face detection and alignment, the proposed KPNet
detects small facial keypoints instead of the whole face by in a bottom-up
manner. It first predicts the facial landmarks from a low-resolution image via
the well-designed fine-grained scale approximation and scale adaptive
soft-argmax operator. Finally, the precise face bounding boxes, no matter how
we define it, can be inferred from the keypoints. Without any complex head
architecture or meticulous network designing, the KPNet achieves
state-of-the-art accuracy on generic face detection and alignment benchmarks
with only $\sim1M$ parameters, which runs at 1000fps on GPU and is easy to
perform real-time on most modern front-end chips.
Related papers
- Learning to Make Keypoints Sub-Pixel Accurate [80.55676599677824]
This work addresses the challenge of sub-pixel accuracy in detecting 2D local features.
We propose a novel network that enhances any detector with sub-pixel precision by learning an offset vector for detected features.
arXiv Detail & Related papers (2024-07-16T12:39:56Z) - Efficient Verification-Based Face Identification [50.616875565173274]
We study the problem of performing face verification with an efficient neural model $f$.
Our model leads to a substantially small $f$ requiring only 23k parameters and 5M floating point operations (FLOPS)
We use six face verification datasets to demonstrate that our method is on par or better than state-of-the-art models.
arXiv Detail & Related papers (2023-12-20T18:08:02Z) - Filter-Pruning of Lightweight Face Detectors Using a Geometric Median
Criterion [9.284740716447342]
We implement filter pruning on two already small and compact face detectors, named EXTD and EResFD.
The proposed approach has the potential to further reduce the model size of already lightweight face detectors.
arXiv Detail & Related papers (2023-11-28T09:02:38Z) - SqueezerFaceNet: Reducing a Small Face Recognition CNN Even More Via
Filter Pruning [55.84746218227712]
We develop SqueezerFaceNet, a light face recognition network which less than 1M parameters.
We show that it can be further reduced (up to 40%) without an appreciable loss in performance.
arXiv Detail & Related papers (2023-07-20T08:38:50Z) - A new face swap method for image and video domains: a technical report [60.47144478048589]
We introduce a new face swap pipeline that is based on FaceShifter architecture.
New eye loss function, super-resolution block, and Gaussian-based face mask generation leads to improvements in quality.
arXiv Detail & Related papers (2022-02-07T10:15:50Z) - End2End Occluded Face Recognition by Masking Corrupted Features [82.27588990277192]
State-of-the-art general face recognition models do not generalize well to occluded face images.
This paper presents a novel face recognition method that is robust to occlusions based on a single end-to-end deep neural network.
Our approach, named FROM (Face Recognition with Occlusion Masks), learns to discover the corrupted features from the deep convolutional neural networks, and clean them by the dynamically learned masks.
arXiv Detail & Related papers (2021-08-21T09:08:41Z) - An Efficient Multitask Neural Network for Face Alignment, Head Pose
Estimation and Face Tracking [9.39854778804018]
We propose an efficient multitask face alignment, face tracking and head pose estimation network (ATPN)
ATPN achieves improved performance compared to previous state-of-the-art methods while having less number of parameters and FLOPS.
arXiv Detail & Related papers (2021-03-13T04:41:15Z) - Sub-pixel face landmarks using heatmaps and a bag of tricks [0.0]
We present our heatmap regression approach to accurately localize face landmarks.
We use five na"ive face landmarks from a publicly available face detector to position and align the face.
We also show that it is possible to reduce the upscaling complexity by using a mixture of deconvolution and pixel-shuffle layers.
arXiv Detail & Related papers (2021-03-04T14:34:20Z) - Long-distance tiny face detection based on enhanced YOLOv3 for unmanned
system [10.856903504701712]
We propose an enhanced network model (YOLOv3-C) based on the YOLOv3 algorithm for unmanned platform.
The enhanced model improves the accuracy of tiny face detection in the cases of long-distance and high-density crowds.
arXiv Detail & Related papers (2020-10-09T08:12:58Z) - Pixel-in-Pixel Net: Towards Efficient Facial Landmark Detection in the
Wild [104.61677518999976]
We propose Pixel-in-Pixel Net (PIPNet) for facial landmark detection.
The proposed model is equipped with a novel detection head based on heatmap regression.
To further improve the cross-domain generalization capability of PIPNet, we propose self-training with curriculum.
arXiv Detail & Related papers (2020-03-08T12:23:42Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.