Long-distance tiny face detection based on enhanced YOLOv3 for unmanned
system
- URL: http://arxiv.org/abs/2010.04421v1
- Date: Fri, 9 Oct 2020 08:12:58 GMT
- Title: Long-distance tiny face detection based on enhanced YOLOv3 for unmanned
system
- Authors: Jia-Yi Chang, Yan-Feng Lu, Ya-Jun Liu, Bo Zhou, Hong Qiao
- Abstract summary: We propose an enhanced network model (YOLOv3-C) based on the YOLOv3 algorithm for unmanned platform.
The enhanced model improves the accuracy of tiny face detection in the cases of long-distance and high-density crowds.
- Score: 10.856903504701712
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Remote tiny face detection applied in unmanned system is a challeng-ing work.
The detector cannot obtain sufficient context semantic information due to the
relatively long distance. The received poor fine-grained features make the face
detection less accurate and robust. To solve the problem of long-distance
detection of tiny faces, we propose an enhanced network model (YOLOv3-C) based
on the YOLOv3 algorithm for unmanned platform. In this model, we bring in
multi-scale features from feature pyramid networks and make the features
fu-sion to adjust prediction feature map of the output, which improves the
sensitivity of the entire algorithm for tiny target faces. The enhanced model
improves the accuracy of tiny face detection in the cases of long-distance and
high-density crowds. The experimental evaluation results demonstrated the
superior perfor-mance of the proposed YOLOv3-C in comparison with other
relevant detectors in remote tiny face detection. It is worth mentioning that
our proposed method achieves comparable performance with the state of the art
YOLOv4[1] in the tiny face detection tasks.
Related papers
- EfficientSRFace: An Efficient Network with Super-Resolution Enhancement
for Accurate Face Detection [18.977044046941813]
In face detection, low-resolution faces, such as numerous small faces of a human group in a crowded scene, are common in dense face prediction tasks.
We develop an efficient detector termed EfficientSRFace by introducing a feature-level super-resolution reconstruction network.
This module plays an auxiliary role in the training process, and can be removed during the inference without increasing the inference time.
arXiv Detail & Related papers (2023-06-04T06:49:44Z) - An advanced YOLOv3 method for small object detection [2.906551456030129]
This paper introduces an improved YOLOv3 algorithm for small object detection.
In the proposed method, the dilated convolutions mish (DCM) module is introduced into the backbone network of YOLOv3.
In the neck network of YOLOv3, the convolutional block attention module (CBAM) and multi-level fusion module are introduced.
arXiv Detail & Related papers (2022-12-06T07:58:21Z) - YOLO-FaceV2: A Scale and Occlusion Aware Face Detector [3.632920443532506]
We propose a real-time face detector based on the one-stage detector YOLOv5, named YOLO-FaceV2.
We use a Receptive Field Enhancement module called RFE to enhance receptive field of small face, and use NWD Loss to make up for the sensitivity of IoU to the location deviation of tiny objects.
We show that our face detector outperforms YOLO and its variants can be find in all easy, medium and hard subsets.
arXiv Detail & Related papers (2022-08-03T12:40:00Z) - Enhanced Single-shot Detector for Small Object Detection in Remote
Sensing Images [33.84369068593722]
We propose image pyramid single-shot detector (IPSSD) for small-scale object detection.
In IPSSD, single-shot detector is adopted combined with an image pyramid network to extract semantically strong features for generating candidate regions.
The proposed network can enhance the small-scale features from a feature pyramid network.
arXiv Detail & Related papers (2022-05-12T07:35:07Z) - SALISA: Saliency-based Input Sampling for Efficient Video Object
Detection [58.22508131162269]
We propose SALISA, a novel non-uniform SALiency-based Input SAmpling technique for video object detection.
We show that SALISA significantly improves the detection of small objects.
arXiv Detail & Related papers (2022-04-05T17:59:51Z) - EResFD: Rediscovery of the Effectiveness of Standard Convolution for
Lightweight Face Detection [13.357235715178584]
We re-examine the effectiveness of the standard convolutional block as a lightweight backbone architecture for face detection.
We show that heavily channel-pruned standard convolution layers can achieve better accuracy and inference speed.
Our proposed detector EResFD obtained 80.4% mAP on WIDER FACE Hard subset which only takes 37.7 ms for VGA image inference on CPU.
arXiv Detail & Related papers (2022-04-04T02:30:43Z) - The KFIoU Loss for Rotated Object Detection [115.334070064346]
In this paper, we argue that one effective alternative is to devise an approximate loss who can achieve trend-level alignment with SkewIoU loss.
Specifically, we model the objects as Gaussian distribution and adopt Kalman filter to inherently mimic the mechanism of SkewIoU.
The resulting new loss called KFIoU is easier to implement and works better compared with exact SkewIoU.
arXiv Detail & Related papers (2022-01-29T10:54:57Z) - Delving into Localization Errors for Monocular 3D Object Detection [85.77319416168362]
Estimating 3D bounding boxes from monocular images is an essential component in autonomous driving.
In this work, we quantify the impact introduced by each sub-task and find the localization error' is the vital factor in restricting monocular 3D detection.
arXiv Detail & Related papers (2021-03-30T10:38:01Z) - PLUME: Efficient 3D Object Detection from Stereo Images [95.31278688164646]
Existing methods tackle the problem in two steps: first depth estimation is performed, a pseudo LiDAR point cloud representation is computed from the depth estimates, and then object detection is performed in 3D space.
We propose a model that unifies these two tasks in the same metric space.
Our approach achieves state-of-the-art performance on the challenging KITTI benchmark, with significantly reduced inference time compared with existing methods.
arXiv Detail & Related papers (2021-01-17T05:11:38Z) - AOT: Appearance Optimal Transport Based Identity Swapping for Forgery
Detection [76.7063732501752]
We provide a new identity swapping algorithm with large differences in appearance for face forgery detection.
The appearance gaps mainly arise from the large discrepancies in illuminations and skin colors.
A discriminator is introduced to distinguish the fake parts from a mix of real and fake image patches.
arXiv Detail & Related papers (2020-11-05T06:17:04Z) - Reinforced Axial Refinement Network for Monocular 3D Object Detection [160.34246529816085]
Monocular 3D object detection aims to extract the 3D position and properties of objects from a 2D input image.
Conventional approaches sample 3D bounding boxes from the space and infer the relationship between the target object and each of them, however, the probability of effective samples is relatively small in the 3D space.
We propose to start with an initial prediction and refine it gradually towards the ground truth, with only one 3d parameter changed in each step.
This requires designing a policy which gets a reward after several steps, and thus we adopt reinforcement learning to optimize it.
arXiv Detail & Related papers (2020-08-31T17:10:48Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.