ASFD: Automatic and Scalable Face Detector
- URL: http://arxiv.org/abs/2201.10781v1
- Date: Wed, 26 Jan 2022 07:11:51 GMT
- Title: ASFD: Automatic and Scalable Face Detector
- Authors: Jian Li, Bin Zhang, Yabiao Wang, Ying Tai, ZhenYu Zhang, Chengjie
Wang, Jilin Li, Xiaoming Huang, Yili Xia
- Abstract summary: We propose to search an effective FAE architecture, termed AutoFAE, which outperforms all existing FAE modules in face detection with a considerable margin.
In particular, our strong ASFD-D6 outperforms the best competitor with AP 96.7/96.2/92.1 on WIDER Face test, and the lightweight ASFD-D0 costs about 3.1 ms, more than 320 FPS.
- Score: 59.31799101216593
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Along with current multi-scale based detectors, Feature Aggregation and
Enhancement (FAE) modules have shown superior performance gains for
cutting-edge object detection. However, these hand-crafted FAE modules show
inconsistent improvements on face detection, which is mainly due to the
significant distribution difference between its training and applying corpus,
COCO vs. WIDER Face. To tackle this problem, we essentially analyse the effect
of data distribution, and consequently propose to search an effective FAE
architecture, termed AutoFAE by a differentiable architecture search, which
outperforms all existing FAE modules in face detection with a considerable
margin. Upon the found AutoFAE and existing backbones, a supernet is further
built and trained, which automatically obtains a family of detectors under the
different complexity constraints. Extensive experiments conducted on popular
benchmarks, WIDER Face and FDDB, demonstrate the state-of-the-art
performance-efficiency trade-off for the proposed automatic and scalable face
detector (ASFD) family. In particular, our strong ASFD-D6 outperforms the best
competitor with AP 96.7/96.2/92.1 on WIDER Face test, and the lightweight
ASFD-D0 costs about 3.1 ms, more than 320 FPS, on the V100 GPU with
VGA-resolution images.
Related papers
- Towards Data-Centric Face Anti-Spoofing: Improving Cross-domain Generalization via Physics-based Data Synthesis [64.46312434121455]
Face Anti-Spoofing (FAS) research is challenged by the cross-domain problem, where there is a domain gap between the training and testing data.
We propose task-specific FAS data augmentation (FAS-Aug), which increases data diversity by synthesizing data of artifacts.
We also propose Spoofing Attack Risk Equalization (SARE) to prevent models from relying on certain types of artifacts and improve the generalization performance.
arXiv Detail & Related papers (2024-09-04T01:45:18Z) - Efficient Facial Landmark Detection for Embedded Systems [1.0878040851638]
This paper introduces the Efficient Facial Landmark Detection (EFLD) model, specifically designed for edge devices confronted with the challenges related to power consumption and time latency.
EFLD features a lightweight backbone and a flexible detection head, each significantly enhancing operational efficiency on resource-constrained devices.
We propose a cross-format training strategy to enhance the model's generalizability and robustness, without increasing inference costs.
arXiv Detail & Related papers (2024-07-14T14:49:20Z) - FMM-X3D: FPGA-based modeling and mapping of X3D for Human Action
Recognition [10.385864925381384]
This paper addresses the problem of mapping X3D, a state-of-the-art model in Human Action Recognition, onto any FPGA device.
The proposed toolflow generates an optimised stream-based hardware system, taking into account the available resources and off-chip memory characteristics of the FPGA device.
arXiv Detail & Related papers (2023-05-29T11:17:51Z) - EResFD: Rediscovery of the Effectiveness of Standard Convolution for
Lightweight Face Detection [13.357235715178584]
We re-examine the effectiveness of the standard convolutional block as a lightweight backbone architecture for face detection.
We show that heavily channel-pruned standard convolution layers can achieve better accuracy and inference speed.
Our proposed detector EResFD obtained 80.4% mAP on WIDER FACE Hard subset which only takes 37.7 ms for VGA image inference on CPU.
arXiv Detail & Related papers (2022-04-04T02:30:43Z) - A Synthesis-Based Approach for Thermal-to-Visible Face Verification [105.63410428506536]
This paper presents an algorithm that achieves state-of-the-art performance on the ARL-VTF and TUFTS multi-spectral face datasets.
We also present MILAB-VTF(B), a challenging multi-spectral face dataset composed of paired thermal and visible videos.
arXiv Detail & Related papers (2021-08-21T17:59:56Z) - Face Anti-Spoofing with Human Material Perception [76.4844593082362]
Face anti-spoofing (FAS) plays a vital role in securing the face recognition systems from presentation attacks.
We rephrase face anti-spoofing as a material recognition problem and combine it with classical human material perception.
We propose the Bilateral Convolutional Networks (BCN), which is able to capture intrinsic material-based patterns.
arXiv Detail & Related papers (2020-07-04T18:25:53Z) - ACFD: Asymmetric Cartoon Face Detector [72.60983975604145]
ACFD achieves the 1st place on the detection track of 2020 iCartoon Face Challenge.
Our ACFD achieves the 1st place on the detection track of 2020 iCartoon Face Challenge under the constraints of model size 200MB, inference time 50ms per image, and without any pretrained models.
arXiv Detail & Related papers (2020-07-02T05:57:34Z) - ASFD: Automatic and Scalable Face Detector [129.82350993748258]
We propose a novel Automatic and Scalable Face Detector (ASFD)
ASFD is based on a combination of neural architecture search techniques as well as a new loss design.
Our ASFD-D6 outperforms the prior strong competitors, and our lightweight ASFD-D0 runs at more than 120 FPS with Mobilenet for VGA-resolution images.
arXiv Detail & Related papers (2020-03-25T06:00:47Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.