Photovoltaic Panel Defect Detection Based on Ghost Convolution with
BottleneckCSP and Tiny Target Prediction Head Incorporating YOLOv5
- URL: http://arxiv.org/abs/2303.00886v1
- Date: Thu, 2 Mar 2023 01:06:35 GMT
- Title: Photovoltaic Panel Defect Detection Based on Ghost Convolution with
BottleneckCSP and Tiny Target Prediction Head Incorporating YOLOv5
- Authors: Longlong Li, Zhifeng Wang, Tingting Zhang
- Abstract summary: This paper proposes an approach named Ghost convolution with BottleneckCSP and a tiny target prediction head for PV panel defect detection.
The BottleneckCSP module is introduced to add a prediction head for tiny target detection to alleviate tiny defect misses.
The proposed PV panel surface-defect detection network improves the mAP performance by at least 27.8%.
- Score: 5.632384612137748
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Photovoltaic (PV) panel surface-defect detection technology is crucial for
the PV industry to perform smart maintenance. Using computer vision technology
to detect PV panel surface defects can ensure better accuracy while reducing
the workload of traditional worker field inspections. However, multiple tiny
defects on the PV panel surface and the high similarity between different
defects make it challenging to {accurately identify and detect such defects}.
This paper proposes an approach named Ghost convolution with BottleneckCSP and
a tiny target prediction head incorporating YOLOv5 (GBH-YOLOv5) for PV panel
defect detection. To ensure better accuracy on multiscale targets, the
BottleneckCSP module is introduced to add a prediction head for tiny target
detection to alleviate tiny defect misses, using Ghost convolution to improve
the model inference speed and reduce the number of parameters. First, the
original image is compressed and cropped to enlarge the defect size physically.
Then, the processed images are input into GBH-YOLOv5, and the depth features
are extracted through network processing based on Ghost convolution, the
application of the BottleneckCSP module, and the prediction head of tiny
targets. Finally, the extracted features are classified by a Feature Pyramid
Network (FPN) and a Path Aggregation Network (PAN) structure. Meanwhile, we
compare our method with state-of-the-art methods to verify the effectiveness of
the proposed method. The proposed PV panel surface-defect detection network
improves the mAP performance by at least 27.8%.
Related papers
- YOLO-pdd: A Novel Multi-scale PCB Defect Detection Method Using Deep Representations with Sequential Images [7.368447690189973]
This paper proposes a high-precision, robust, and real-time end-to-end method for PCB defect detection based on CNN.
We propose a novel approach combining YOLOv5 and multiscale modules for hierarchical residual-like connections.
Experiments on a large-scale PCB dataset demonstrate significant improvements in precision, recall, and F1-score compared to existing methods.
arXiv Detail & Related papers (2024-07-22T07:08:22Z) - MoE-FFD: Mixture of Experts for Generalized and Parameter-Efficient Face Forgery Detection [54.545054873239295]
Deepfakes have recently raised significant trust issues and security concerns among the public.
ViT-based methods take advantage of the expressivity of transformers, achieving superior detection performance.
This work introduces Mixture-of-Experts modules for Face Forgery Detection (MoE-FFD), a generalized yet parameter-efficient ViT-based approach.
arXiv Detail & Related papers (2024-04-12T13:02:08Z) - Global Context Aggregation Network for Lightweight Saliency Detection of
Surface Defects [70.48554424894728]
We develop a Global Context Aggregation Network (GCANet) for lightweight saliency detection of surface defects on the encoder-decoder structure.
First, we introduce a novel transformer encoder on the top layer of the lightweight backbone, which captures global context information through a novel Depth-wise Self-Attention (DSA) module.
The experimental results on three public defect datasets demonstrate that the proposed network achieves a better trade-off between accuracy and running efficiency compared with other 17 state-of-the-art methods.
arXiv Detail & Related papers (2023-09-22T06:19:11Z) - CINFormer: Transformer network with multi-stage CNN feature injection
for surface defect segmentation [73.02218479926469]
We propose a transformer network with multi-stage CNN feature injection for surface defect segmentation.
CINFormer presents a simple yet effective feature integration mechanism that injects the multi-level CNN features of the input image into different stages of the transformer network in the encoder.
In addition, CINFormer presents a Top-K self-attention module to focus on tokens with more important information about the defects.
arXiv Detail & Related papers (2023-09-22T06:12:02Z) - Small Object Detection via Coarse-to-fine Proposal Generation and
Imitation Learning [52.06176253457522]
We propose a two-stage framework tailored for small object detection based on the Coarse-to-fine pipeline and Feature Imitation learning.
CFINet achieves state-of-the-art performance on the large-scale small object detection benchmarks, SODA-D and SODA-A.
arXiv Detail & Related papers (2023-08-18T13:13:09Z) - Lightweight wood panel defect detection method incorporating attention
mechanism and feature fusion network [9.775181958901326]
We propose a lightweight wood panel defect detection method called YOLOv5-LW, which incorporates attention mechanisms and a feature fusion network.
The proposed method achieves a 92.8% accuracy rate, reduces the number of parameters by 27.78%, compresses computational volume by 41.25%, improves detection inference speed by 10.16%.
arXiv Detail & Related papers (2023-06-21T08:55:45Z) - Adaptive Sparse Convolutional Networks with Global Context Enhancement
for Faster Object Detection on Drone Images [26.51970603200391]
This paper investigates optimizing the detection head based on the sparse convolution.
It suffers from inadequate integration of contextual information of tiny objects.
We propose a novel global context-enhanced adaptive sparse convolutional network.
arXiv Detail & Related papers (2023-03-25T14:42:50Z) - Concealed Object Detection for Passive Millimeter-Wave Security Imaging
Based on Task-Aligned Detection Transformer [6.524763502003648]
This paper proposes a Task-Aligned Detection Transformer network, named PMMW-DETR.
In the first stage, a Denoising Coarse-to-Fine Transformer (DCFT) backbone is designed to extract long- and short-range features in the different scales.
In the second stage, we propose the Query Selection module to introduce learned spatial features into the network as prior knowledge.
In the third stage, aiming to improve the classification performance, we perform a Task-Aligned Dual-Head block to decouple the classification and regression tasks.
arXiv Detail & Related papers (2022-12-01T07:03:29Z) - Efficient Decoder-free Object Detection with Transformers [75.00499377197475]
Vision transformers (ViTs) are changing the landscape of object detection approaches.
We propose a decoder-free fully transformer-based (DFFT) object detector.
DFFT_SMALL achieves high efficiency in both training and inference stages.
arXiv Detail & Related papers (2022-06-14T13:22:19Z) - A Multi-Stage model based on YOLOv3 for defect detection in PV panels
based on IR and Visible Imaging by Unmanned Aerial Vehicle [65.99880594435643]
We propose a novel model to detect panel defects on aerial images captured by unmanned aerial vehicle.
The model combines detections of panels and defects to refine its accuracy.
The proposed model has been validated on two big PV plants in the south of Italy.
arXiv Detail & Related papers (2021-11-23T08:04:32Z) - BAF-Detector: An Efficient CNN-Based Detector for Photovoltaic Cell
Defect Detection [7.151552353494974]
In this paper, attention-based top-down and bottom-up architecture is developed to accomplish multi-scale feature fusion.
A novel object detector is proposed, called BAF-Detector, which embeds BAFPN into Region Proposal Network (RPN) in Faster RCNN+FPN.
The experimental results on a large-scale EL dataset including 3629 images, 2129 of which are defective, show that the proposed method achieves 98.70% (F-measure), 88.07% (mAP), and 73.29% (IoU) in terms of multi-scale defects classification and
arXiv Detail & Related papers (2020-12-19T08:48:14Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.