ACFD: Asymmetric Cartoon Face Detector
- URL: http://arxiv.org/abs/2007.00899v1
- Date: Thu, 2 Jul 2020 05:57:34 GMT
- Title: ACFD: Asymmetric Cartoon Face Detector
- Authors: Bin Zhang, Jian Li, Yabiao Wang, Zhipeng Cui, Yili Xia, Chengjie Wang,
Jilin Li, Feiyue Huang
- Abstract summary: ACFD achieves the 1st place on the detection track of 2020 iCartoon Face Challenge.
Our ACFD achieves the 1st place on the detection track of 2020 iCartoon Face Challenge under the constraints of model size 200MB, inference time 50ms per image, and without any pretrained models.
- Score: 72.60983975604145
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Cartoon face detection is a more challenging task than human face detection
due to many difficult scenarios is involved. Aiming at the characteristics of
cartoon faces, such as huge differences within the intra-faces, in this paper,
we propose an asymmetric cartoon face detector, named ACFD. Specifically, it
consists of the following modules: a novel backbone VoVNetV3 comprised of
several asymmetric one-shot aggregation modules (AOSA), asymmetric
bi-directional feature pyramid network (ABi-FPN), dynamic anchor match strategy
(DAM) and the corresponding margin binary classification loss (MBC). In
particular, to generate features with diverse receptive fields, multi-scale
pyramid features are extracted by VoVNetV3, and then fused and enhanced
simultaneously by ABi-FPN for handling the faces in some extreme poses and have
disparate aspect ratios. Besides, DAM is used to match enough high-quality
anchors for each face, and MBC is for the strong power of discrimination. With
the effectiveness of these modules, our ACFD achieves the 1st place on the
detection track of 2020 iCartoon Face Challenge under the constraints of model
size 200MB, inference time 50ms per image, and without any pretrained models.
Related papers
- Unified Physical-Digital Face Attack Detection [66.14645299430157]
Face Recognition (FR) systems can suffer from physical (i.e., print photo) and digital (i.e., DeepFake) attacks.
Previous related work rarely considers both situations at the same time.
We propose a Unified Attack Detection framework based on Vision-Language Models (VLMs)
arXiv Detail & Related papers (2024-01-31T09:38:44Z) - Learning Dense Correspondence for NeRF-Based Face Reenactment [24.072019889495966]
We propose a novel framework, which adopts tri-planes as fundamental NeRF representation and decomposes face tri-planes into three components: canonical tri-planes, identity deformations, and motion.
Our framework is the first method that achieves one-shot multi-view face reenactment without a 3D parametric model prior.
arXiv Detail & Related papers (2023-12-16T11:31:34Z) - A Closer Look at Geometric Temporal Dynamics for Face Anti-Spoofing [13.725319422213623]
Face anti-spoofing (FAS) is indispensable for a face recognition system.
We propose Geometry-Aware Interaction Network (GAIN) to distinguish between normal and abnormal movements of live and spoof presentations.
Our approach achieves state-of-the-art performance in the standard intra- and cross-dataset evaluations.
arXiv Detail & Related papers (2023-06-25T18:59:52Z) - Surveillance Face Anti-spoofing [81.50018853811895]
Face Anti-spoofing (FAS) is essential to secure face recognition systems from various physical attacks.
We propose a Contrastive Quality-Invariance Learning (CQIL) network to alleviate the performance degradation caused by image quality.
A large number of experiments verify the quality of the SuHiFiMask dataset and the superiority of the proposed CQIL.
arXiv Detail & Related papers (2023-01-03T07:09:57Z) - Dynamic Prototype Mask for Occluded Person Re-Identification [88.7782299372656]
Existing methods mainly address this issue by employing body clues provided by an extra network to distinguish the visible part.
We propose a novel Dynamic Prototype Mask (DPM) based on two self-evident prior knowledge.
Under this condition, the occluded representation could be well aligned in a selected subspace spontaneously.
arXiv Detail & Related papers (2022-07-19T03:31:13Z) - Decoupled Multi-task Learning with Cyclical Self-Regulation for Face
Parsing [71.19528222206088]
We propose a novel Decoupled Multi-task Learning with Cyclical Self-Regulation for face parsing.
Specifically, DML-CSR designs a multi-task model which comprises face parsing, binary edge, and category edge detection.
Our method achieves the new state-of-the-art performance on the Helen, CelebA-HQ, and LapaMask datasets.
arXiv Detail & Related papers (2022-03-28T02:12:30Z) - Two-stream Convolutional Networks for Multi-frame Face Anti-spoofing [1.9890930069402575]
We propose an efficient two-stream model to capture the key differences between live and spoof faces.
We evaluate the proposed method on the datasets of Siw, Oulu-NPU, CASIA-MFSD and Replay-Attack.
arXiv Detail & Related papers (2021-08-09T13:35:30Z) - Wide Aspect Ratio Matching for Robust Face Detection [11.593495085674345]
The max IoUs between anchors and extreme aspect ratio faces are still lower than fixed sampling threshold.
We propose a Wide Aspect Ratio Matching (WARM) strategy to collect more representative positive anchors from ground-truth faces.
We also present a novel feature enhancement module, named Receptive Field Diversity (RFD) module, to provide diverse receptive field corresponding to different aspect ratios.
arXiv Detail & Related papers (2021-03-10T11:05:38Z) - Face Anti-Spoofing with Human Material Perception [76.4844593082362]
Face anti-spoofing (FAS) plays a vital role in securing the face recognition systems from presentation attacks.
We rephrase face anti-spoofing as a material recognition problem and combine it with classical human material perception.
We propose the Bilateral Convolutional Networks (BCN), which is able to capture intrinsic material-based patterns.
arXiv Detail & Related papers (2020-07-04T18:25:53Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.