An advanced YOLOv3 method for small object detection
- URL: http://arxiv.org/abs/2212.02809v3
- Date: Wed, 22 Mar 2023 04:08:48 GMT
- Title: An advanced YOLOv3 method for small object detection
- Authors: Baokai Liu, Fengjie He, Shiqiang Du, Jiacheng Li, Wenjie Liu
- Abstract summary: This paper introduces an improved YOLOv3 algorithm for small object detection.
In the proposed method, the dilated convolutions mish (DCM) module is introduced into the backbone network of YOLOv3.
In the neck network of YOLOv3, the convolutional block attention module (CBAM) and multi-level fusion module are introduced.
- Score: 2.906551456030129
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Small object detection has important application value in the fields of
autonomous driving and drone scene analysis. As one of the most advanced object
detection algorithms, YOLOv3 suffers some challenges when detecting small
objects, such as the problem of detection failure of small objects and occluded
objects. To solve these problems, an improved YOLOv3 algorithm for small object
detection is proposed. In the proposed method, the dilated convolutions mish
(DCM) module is introduced into the backbone network of YOLOv3 to improve the
feature expression ability by fusing the feature maps of different receptive
fields. In the neck network of YOLOv3, the convolutional block attention module
(CBAM) and multi-level fusion module are introduced to select the important
information for small object detection in the shallow network, suppress the
uncritical information, and use the fusion module to fuse the feature maps of
different scales, so as to improve the detection accuracy of the algorithm. In
addition, the Soft-NMS and Complete-IoU (CloU) strategies are applied to
candidate frame screening, which improves the accuracy of the algorithm for the
detection of occluded objects. The ablation experiment of the MS COCO2017
object detection task proves the effectiveness of several modules introduced in
this paper for small object detection. The experimental results on the MS
COCO2017, VOC2007, and VOC2012 datasets show that the Average Precision (AP) of
this method is 16.5%, 8.71%, and 9.68% higher than that of YOLOv3,
respectively.
Related papers
- SOD-YOLOv8 -- Enhancing YOLOv8 for Small Object Detection in Traffic Scenes [1.3812010983144802]
Small Object Detection YOLOv8 (SOD-YOLOv8) is designed for scenarios involving numerous small objects.
SOD-YOLOv8 significantly improves small object detection, surpassing widely used models in various metrics.
In dynamic real-world traffic scenes, SOD-YOLOv8 demonstrated notable improvements in diverse conditions.
arXiv Detail & Related papers (2024-08-08T23:05:25Z) - DASSF: Dynamic-Attention Scale-Sequence Fusion for Aerial Object Detection [6.635903943457569]
The original YOLO algorithm has low overall detection accuracy due to its weak ability to perceive targets of different scales.
This paper proposes a dynamic-attention scale-sequence fusion algorithm (DASSF) for small target detection in aerial images.
Experimental results show that when the DASSF method is applied to YOLOv8, compared to YOLOv8n, the model shows an increase of 9.2% and 2.4% in the mean average precision (mAP)
arXiv Detail & Related papers (2024-06-18T05:26:44Z) - Innovative Horizons in Aerial Imagery: LSKNet Meets DiffusionDet for
Advanced Object Detection [55.2480439325792]
We present an in-depth evaluation of an object detection model that integrates the LSKNet backbone with the DiffusionDet head.
The proposed model achieves a mean average precision (MAP) of approximately 45.7%, which is a significant improvement.
This advancement underscores the effectiveness of the proposed modifications and sets a new benchmark in aerial image analysis.
arXiv Detail & Related papers (2023-11-21T19:49:13Z) - Small Object Detection via Coarse-to-fine Proposal Generation and
Imitation Learning [52.06176253457522]
We propose a two-stage framework tailored for small object detection based on the Coarse-to-fine pipeline and Feature Imitation learning.
CFINet achieves state-of-the-art performance on the large-scale small object detection benchmarks, SODA-D and SODA-A.
arXiv Detail & Related papers (2023-08-18T13:13:09Z) - YOLOv3 with Spatial Pyramid Pooling for Object Detection with Unmanned
Aerial Vehicles [0.0]
We aim to improve the performance of the one-stage detector YOLOv3 by adding a Spatial Pyramid Pooling layer on the end of the backbone darknet-53.
We also conducted an evaluation study on different versions of YOLOv3 methods.
arXiv Detail & Related papers (2023-05-21T04:41:52Z) - A Tri-Layer Plugin to Improve Occluded Detection [100.99802831241583]
We propose a simple '' module for the detection head of two-stage object detectors to improve the recall of partially occluded objects.
The module predicts a tri-layer of segmentation masks for the target object, the occluder and the occludee, and by doing so is able to better predict the mask of the target object.
We also establish a COCO evaluation dataset to measure the recall performance of partially occluded and separated objects.
arXiv Detail & Related papers (2022-10-18T17:59:51Z) - A lightweight and accurate YOLO-like network for small target detection
in Aerial Imagery [94.78943497436492]
We present YOLO-S, a simple, fast and efficient network for small target detection.
YOLO-S exploits a small feature extractor based on Darknet20, as well as skip connection, via both bypass and concatenation.
YOLO-S has an 87% decrease of parameter size and almost one half FLOPs of YOLOv3, making practical the deployment for low-power industrial applications.
arXiv Detail & Related papers (2022-04-05T16:29:49Z) - Delving into Localization Errors for Monocular 3D Object Detection [85.77319416168362]
Estimating 3D bounding boxes from monocular images is an essential component in autonomous driving.
In this work, we quantify the impact introduced by each sub-task and find the localization error' is the vital factor in restricting monocular 3D detection.
arXiv Detail & Related papers (2021-03-30T10:38:01Z) - Slender Object Detection: Diagnoses and Improvements [74.40792217534]
In this paper, we are concerned with the detection of a particular type of objects with extreme aspect ratios, namely textbfslender objects.
For a classical object detection method, a drastic drop of $18.9%$ mAP on COCO is observed, if solely evaluated on slender objects.
arXiv Detail & Related papers (2020-11-17T09:39:42Z) - Real-time object detection method based on improved YOLOv4-tiny [0.0]
YOLOv4-tiny is proposed based on YOLOv4 to simple the network structure and reduce parameters, which makes it be suitable for developing on the mobile and embedded devices.
It firstly uses two ResBlock-D modules in ResNet-D network instead of two CSPBlock modules in Yolov4-tiny, which reduces the computation complexity.
In the design of auxiliary network, two consecutive 3x3 convolutions are used to obtain 5x5 receptive fields to extract global features, and channel attention and spatial attention are also used to extract more effective information.
arXiv Detail & Related papers (2020-11-09T08:26:28Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.