Underwater target detection based on improved YOLOv7
- URL: http://arxiv.org/abs/2302.06939v1
- Date: Tue, 14 Feb 2023 09:50:52 GMT
- Title: Underwater target detection based on improved YOLOv7
- Authors: Kaiyue Liu, Qi Sun, Daming Sun, Mengduo Yang, Nizhuan Wang
- Abstract summary: This study proposes an improved YOLOv7 network (YOLOv7-AC) for underwater target detection.
The proposed network utilizes an ACmixBlock module to replace the 3x3 convolution block in the E-ELAN structure.
A ResNet-ACmix module is designed to avoid feature information loss and reduce computation.
- Score: 7.264267222876267
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Underwater target detection is a crucial aspect of ocean exploration.
However, conventional underwater target detection methods face several
challenges such as inaccurate feature extraction, slow detection speed and lack
of robustness in complex underwater environments. To address these limitations,
this study proposes an improved YOLOv7 network (YOLOv7-AC) for underwater
target detection. The proposed network utilizes an ACmixBlock module to replace
the 3x3 convolution block in the E-ELAN structure, and incorporates jump
connections and 1x1 convolution architecture between ACmixBlock modules to
improve feature extraction and network reasoning speed. Additionally, a
ResNet-ACmix module is designed to avoid feature information loss and reduce
computation, while a Global Attention Mechanism (GAM) is inserted in the
backbone and head parts of the model to improve feature extraction.
Furthermore, the K-means++ algorithm is used instead of K-means to obtain
anchor boxes and enhance model accuracy. Experimental results show that the
improved YOLOv7 network outperforms the original YOLOv7 model and other popular
underwater target detection methods. The proposed network achieved a mean
average precision (mAP) value of 89.6% and 97.4% on the URPC dataset and
Brackish dataset, respectively, and demonstrated a higher frame per second
(FPS) compared to the original YOLOv7 model. The source code for this study is
publicly available at https://github.com/NZWANG/YOLOV7-AC. In conclusion, the
improved YOLOv7 network proposed in this study represents a promising solution
for underwater target detection and holds great potential for practical
applications in various underwater tasks.
Related papers
- LAM-YOLO: Drones-based Small Object Detection on Lighting-Occlusion Attention Mechanism YOLO [0.9062164411594178]
LAM-YOLO is an object detection model specifically designed for drone-based images.
We introduce a light-occlusion attention mechanism to enhance the visibility of small targets under different lighting conditions.
Second, we utilize an improved SIB-IoU as the regression loss function to accelerate model convergence and enhance localization accuracy.
arXiv Detail & Related papers (2024-11-01T10:00:48Z) - YOLO-ELA: Efficient Local Attention Modeling for High-Performance Real-Time Insulator Defect Detection [0.0]
Existing detection methods for insulator defect identification from unmanned aerial vehicles struggle with complex background scenes and small objects.
This paper proposes a new attention-based foundation architecture, YOLO-ELA, to address this issue.
Experimental results on high-resolution UAV images show that our method achieved a state-of-the-art performance of 96.9% mAP0.5 and a real-time detection speed of 74.63 frames per second.
arXiv Detail & Related papers (2024-10-15T16:00:01Z) - PVAFN: Point-Voxel Attention Fusion Network with Multi-Pooling Enhancing for 3D Object Detection [59.355022416218624]
integration of point and voxel representations is becoming more common in LiDAR-based 3D object detection.
We propose a novel two-stage 3D object detector, called Point-Voxel Attention Fusion Network (PVAFN)
PVAFN uses a multi-pooling strategy to integrate both multi-scale and region-specific information effectively.
arXiv Detail & Related papers (2024-08-26T19:43:01Z) - Fostc3net:A Lightweight YOLOv5 Based On the Network Structure Optimization [11.969138981034247]
This paper presents an enhanced lightweight YOLOv5 technique customized for mobile devices.
The proposed model achieves a 1% increase in detection accuracy, a 13% reduction in FLOPs, and a 26% decrease in model parameters compared to the existing YOLOv5.
arXiv Detail & Related papers (2024-03-20T16:07:04Z) - Innovative Horizons in Aerial Imagery: LSKNet Meets DiffusionDet for
Advanced Object Detection [55.2480439325792]
We present an in-depth evaluation of an object detection model that integrates the LSKNet backbone with the DiffusionDet head.
The proposed model achieves a mean average precision (MAP) of approximately 45.7%, which is a significant improvement.
This advancement underscores the effectiveness of the proposed modifications and sets a new benchmark in aerial image analysis.
arXiv Detail & Related papers (2023-11-21T19:49:13Z) - Small Object Detection via Coarse-to-fine Proposal Generation and
Imitation Learning [52.06176253457522]
We propose a two-stage framework tailored for small object detection based on the Coarse-to-fine pipeline and Feature Imitation learning.
CFINet achieves state-of-the-art performance on the large-scale small object detection benchmarks, SODA-D and SODA-A.
arXiv Detail & Related papers (2023-08-18T13:13:09Z) - An advanced YOLOv3 method for small object detection [2.906551456030129]
This paper introduces an improved YOLOv3 algorithm for small object detection.
In the proposed method, the dilated convolutions mish (DCM) module is introduced into the backbone network of YOLOv3.
In the neck network of YOLOv3, the convolutional block attention module (CBAM) and multi-level fusion module are introduced.
arXiv Detail & Related papers (2022-12-06T07:58:21Z) - A lightweight and accurate YOLO-like network for small target detection
in Aerial Imagery [94.78943497436492]
We present YOLO-S, a simple, fast and efficient network for small target detection.
YOLO-S exploits a small feature extractor based on Darknet20, as well as skip connection, via both bypass and concatenation.
YOLO-S has an 87% decrease of parameter size and almost one half FLOPs of YOLOv3, making practical the deployment for low-power industrial applications.
arXiv Detail & Related papers (2022-04-05T16:29:49Z) - The KFIoU Loss for Rotated Object Detection [115.334070064346]
In this paper, we argue that one effective alternative is to devise an approximate loss who can achieve trend-level alignment with SkewIoU loss.
Specifically, we model the objects as Gaussian distribution and adopt Kalman filter to inherently mimic the mechanism of SkewIoU.
The resulting new loss called KFIoU is easier to implement and works better compared with exact SkewIoU.
arXiv Detail & Related papers (2022-01-29T10:54:57Z) - AGSFCOS: Based on attention mechanism and Scale-Equalizing pyramid
network of object detection [10.824032219531095]
Our model has a certain improvement in accuracy compared with the current popular detection models on the COCO dataset.
Our optimal model can get 39.5% COCO AP under the background of ResNet50.
arXiv Detail & Related papers (2021-05-20T08:41:02Z) - Depthwise Non-local Module for Fast Salient Object Detection Using a
Single Thread [136.2224792151324]
We propose a new deep learning algorithm for fast salient object detection.
The proposed algorithm achieves competitive accuracy and high inference efficiency simultaneously with a single CPU thread.
arXiv Detail & Related papers (2020-01-22T15:23:48Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.