Harmonizing output imbalance for defect segmentation on
extremely-imbalanced photovoltaic module cells images
- URL: http://arxiv.org/abs/2211.05295v4
- Date: Tue, 24 Oct 2023 10:08:10 GMT
- Title: Harmonizing output imbalance for defect segmentation on
extremely-imbalanced photovoltaic module cells images
- Authors: Jianye Yi, Xiaopin Zhong, Weixiang Liu, Zongze Wu, Yuanlong Deng and
Zhengguang Wu
- Abstract summary: When learning to segment defect regions in PV module cell images, Tiny Hidden Cracks (THC) lead to extremely-imbalanced samples.
This paper proposes an explicit measure for output imbalance; it generalizes a distribution-based loss that can handle different types of output imbalances; and it introduces a compound loss.
The proposed method is evaluated on four widely-used deep learning architectures and four datasets with varying degrees of input imbalance.
- Score: 17.472820798324143
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The continuous development of the photovoltaic (PV) industry has raised high
requirements for the quality of monocrystalline of PV module cells. When
learning to segment defect regions in PV module cell images, Tiny Hidden Cracks
(THC) lead to extremely-imbalanced samples. The ratio of defect pixels to
normal pixels can be as low as 1:2000. This extreme imbalance makes it
difficult to segment the THC of PV module cells, which is also a challenge for
semantic segmentation. To address the problem of segmenting defects on
extremely-imbalanced THC data, the paper makes contributions from three
aspects: (1) it proposes an explicit measure for output imbalance; (2) it
generalizes a distribution-based loss that can handle different types of output
imbalances; and (3) it introduces a compound loss with our adaptive
hyperparameter selection algorithm that can keep the consistency of training
and inference for harmonizing the output imbalance on extremelyimbalanced input
data. The proposed method is evaluated on four widely-used deep learning
architectures and four datasets with varying degrees of input imbalance. The
experimental results show that the proposed method outperforms existing
methods.
Related papers
- Misalignment-Robust Frequency Distribution Loss for Image Transformation [51.0462138717502]
This paper aims to address a common challenge in deep learning-based image transformation methods, such as image enhancement and super-resolution.
We introduce a novel and simple Frequency Distribution Loss (FDL) for computing distribution distance within the frequency domain.
Our method is empirically proven effective as a training constraint due to the thoughtful utilization of global information in the frequency domain.
arXiv Detail & Related papers (2024-02-28T09:27:41Z) - An extended asymmetric sigmoid with Perceptron (SIGTRON) for imbalanced linear classification [0.0]
This article presents a new parameterized sigmoid called SIGTRON, and its companion convex model called SIGTRON-imbalanced classification (SIC) model.
In contrast to the conventional $pi$-weighted cost-sensitive learning model, the SIC model does not have an external $pi$-weight on the loss function.
We show that the proposed SIC model is more adaptive to variations of the dataset.
arXiv Detail & Related papers (2023-12-26T13:14:17Z) - DiffusionPCR: Diffusion Models for Robust Multi-Step Point Cloud
Registration [73.37538551605712]
Point Cloud Registration (PCR) estimates the relative rigid transformation between two point clouds.
We propose formulating PCR as a denoising diffusion probabilistic process, mapping noisy transformations to the ground truth.
Our experiments showcase the effectiveness of our DiffusionPCR, yielding state-of-the-art registration recall rates (95.3%/81.6%) on 3D and 3DLoMatch.
arXiv Detail & Related papers (2023-12-05T18:59:41Z) - Investigating Shift Equivalence of Convolutional Neural Networks in
Industrial Defect Segmentation [3.843350895842836]
In industrial defect segmentation tasks, output consistency (also referred to equivalence) of the model is often overlooked.
A novel pair of down/upsampling layers called component attention polyphase sampling (CAPS) is proposed as a replacement for the conventional sampling layers in CNNs.
The experimental results on the micro surface defect (MSD) dataset and four real-world industrial defect datasets demonstrate that the proposed method exhibits higher equivalence and segmentation performance.
arXiv Detail & Related papers (2023-09-29T00:04:47Z) - Fault diagnosis for PV arrays considering dust impact based on
transformed graphical feature of characteristic curves and convolutional
neural network with CBAM modules [3.8256083307758804]
A novel fault diagnosis method for PV arrays considering dust impact is proposed.
The model of convolutional neural network (CNN) with convolutional block attention modules (CBAM) is designed to extract fault differentiation information.
The developed method for PV arrays with different blocking diodes configurations under various operating conditions has high fault diagnosis accuracy and reliability.
arXiv Detail & Related papers (2023-03-24T09:55:11Z) - Meta-Causal Feature Learning for Out-of-Distribution Generalization [71.38239243414091]
This paper presents a balanced meta-causal learner (BMCL), which includes a balanced task generation module (BTG) and a meta-causal feature learning module (MCFL)
BMCL effectively identifies the class-invariant visual regions for classification and may serve as a general framework to improve the performance of the state-of-the-art methods.
arXiv Detail & Related papers (2022-08-22T09:07:02Z) - Uncovering the Over-smoothing Challenge in Image Super-Resolution: Entropy-based Quantification and Contrastive Optimization [67.99082021804145]
We propose an explicit solution to the COO problem, called Detail Enhanced Contrastive Loss (DECLoss)
DECLoss utilizes the clustering property of contrastive learning to directly reduce the variance of the potential high-resolution distribution.
We evaluate DECLoss on multiple super-resolution benchmarks and demonstrate that it improves the perceptual quality of PSNR-oriented models.
arXiv Detail & Related papers (2022-01-04T08:30:09Z) - CSformer: Bridging Convolution and Transformer for Compressive Sensing [65.22377493627687]
This paper proposes a hybrid framework that integrates the advantages of leveraging detailed spatial information from CNN and the global context provided by transformer for enhanced representation learning.
The proposed approach is an end-to-end compressive image sensing method, composed of adaptive sampling and recovery.
The experimental results demonstrate the effectiveness of the dedicated transformer-based architecture for compressive sensing.
arXiv Detail & Related papers (2021-12-31T04:37:11Z) - L3C-Stereo: Lossless Compression for Stereo Images [24.6995551743392]
A large number of autonomous driving tasks need high-definition stereo images, which requires a large amount of storage space.
To tackle this, we propose L3C-Stereo, a multi-scale lossless compression model consisting of two main modules.
In the experiments, our method outperforms the hand-crafted compression methods and the learning-based method on all three datasets used.
arXiv Detail & Related papers (2021-08-21T02:36:15Z) - Segmentation of cell-level anomalies in electroluminescence images of
photovoltaic modules [0.0]
We propose an end-to-end deep learning pipeline that detects, locates and segments cell-level anomalies from entire photovoltaic modules.
The proposed modular pipeline combines three deep learning techniques: 1. object detection (modified Faster-RNN), 2. image classification (EfficientNet) and 3. weakly supervised segmentation (autoencoder)
arXiv Detail & Related papers (2021-06-21T10:17:40Z) - Learnable Cost Volume Using the Cayley Representation [67.19770048548232]
We propose a learnable cost volume (LCV) for optical flow estimation models.
The proposed LCV is a lightweight module and can be easily plugged into existing models to replace the vanilla cost volume.
Experimental results show that the LCV module not only improves the accuracy of state-of-the-art models on standard benchmarks, but also promotes their robustness against illumination change, noises, and adversarial perturbations of the input signals.
arXiv Detail & Related papers (2020-07-21T01:59:36Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.