Related papers: Weakly Supervised Patch Label Inference Networks for Efficient Pavement Distress Detection and Recognition in the Wild

Weakly Supervised Patch Label Inference Networks for Efficient Pavement Distress Detection and Recognition in the Wild

URL: http://arxiv.org/abs/2203.16782v2
Date: Mon, 3 Apr 2023 03:07:56 GMT
Title: Weakly Supervised Patch Label Inference Networks for Efficient Pavement Distress Detection and Recognition in the Wild
Authors: Sheng Huang and Wenhao Tang and Guixin Huang and Luwen Huangfu and Dan Yang
Abstract summary: We present Weakly Supervised Patch Label Inference Networks (WSPLIN) for efficiently addressing pavement image classification tasks. WSPLIN transforms the fully supervised pavement image classification problem into a weakly supervised pavement patch classification problem. We evaluate our method on a large-scale bituminous pavement distress dataset.
Score: 14.16549562799135
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Automatic image-based pavement distress detection and recognition are vital for pavement maintenance and management. However, existing deep learning-based methods largely omit the specific characteristics of pavement images, such as high image resolution and low distress area ratio, and are not end-to-end trainable. In this paper, we present a series of simple yet effective end-to-end deep learning approaches named Weakly Supervised Patch Label Inference Networks (WSPLIN) for efficiently addressing these tasks under various application settings. WSPLIN transforms the fully supervised pavement image classification problem into a weakly supervised pavement patch classification problem for solutions. Specifically, WSPLIN first divides the pavement image under different scales into patches with different collection strategies and then employs a Patch Label Inference Network (PLIN) to infer the labels of these patches to fully exploit the resolution and scale information. Notably, we design a patch label sparsity constraint based on the prior knowledge of distress distribution and leverage the Comprehensive Decision Network (CDN) to guide the training of PLIN in a weakly supervised way. Therefore, the patch labels produced by PLIN provide interpretable intermediate information, such as the rough location and the type of distress. We evaluate our method on a large-scale bituminous pavement distress dataset named CQU-BPDD and the augmented Crack500 (Crack500-PDD) dataset, which is a newly constructed pavement distress detection dataset augmented from the Crack500. Extensive results demonstrate the superiority of our method over baselines in both performance and efficiency. The source codes of WSPLIN are released on https://github.com/DearCaat/wsplin.

Related papers

Semi-supervised 3D Object Detection with PatchTeacher and PillarMix [71.4908268136439]
Current semi-supervised 3D object detection methods typically use a teacher to generate pseudo labels for a student. We propose PatchTeacher, which focuses on partial scene 3D object detection to provide high-quality pseudo labels for the student. We introduce three key techniques, i.e., Patch Normalizer, Quadrant Align, and Fovea Selection, to improve the performance of PatchTeacher.
arXiv Detail & Related papers (2024-07-13T06:58:49Z)
Learning to Rank Patches for Unbiased Image Redundancy Reduction [80.93989115541966]
Images suffer from heavy spatial redundancy because pixels in neighboring regions are spatially correlated. Existing approaches strive to overcome this limitation by reducing less meaningful image regions. We propose a self-supervised framework for image redundancy reduction called Learning to Rank Patches.
arXiv Detail & Related papers (2024-03-31T13:12:41Z)
PNT-Edge: Towards Robust Edge Detection with Noisy Labels by Learning Pixel-level Noise Transitions [119.17602768128806]
It is hard to manually label edges accurately, especially for large datasets. This paper proposes to learn Pixel-level NoiseTransitions to model the label-corruption process.
arXiv Detail & Related papers (2023-07-26T09:45:17Z)
Kernel Inversed Pyramidal Resizing Network for Efficient Pavement Distress Recognition [9.927965682734069]
A light network named the Kernel Inversed Pyramidal Resizing Network (KIPRN) is introduced for image resizing. In KIPRN, pyramidal convolution and kernel inversed convolution are specifically designed to mine discriminative information. Extensive results demonstrate that KIPRN can generally improve the pavement distress recognition of CNN models.
arXiv Detail & Related papers (2022-12-04T10:40:40Z)
PicT: A Slim Weakly Supervised Vision Transformer for Pavement Distress Classification [10.826472503315912]
We present a vision Transformer named textbfPavement textbfImage textbfClassification textbfPicT for pavement distress classification. textbfPicT outperforms the second-best performed model by a large margin.
arXiv Detail & Related papers (2022-09-21T02:33:49Z)
Unsupervised Domain Adaptive Salient Object Detection Through Uncertainty-Aware Pseudo-Label Learning [104.00026716576546]
We propose to learn saliency from synthetic but clean labels, which naturally has higher pixel-labeling quality without the effort of manual annotations. We show that our proposed method outperforms the existing state-of-the-art deep unsupervised SOD methods on several benchmark datasets.
arXiv Detail & Related papers (2022-02-26T16:03:55Z)
Patch-Based Stochastic Attention for Image Editing [4.8201607588546]
We propose an efficient attention layer based on the algorithm PatchMatch, which is used for determining approximate nearest neighbors. We demonstrate the usefulness of PSAL on several image editing tasks, such as image inpainting, guided image colorization, and single-image super-resolution.
arXiv Detail & Related papers (2022-02-07T13:42:00Z)
Grasp-Oriented Fine-grained Cloth Segmentation without Real Supervision [66.56535902642085]
This paper tackles the problem of fine-grained region detection in deformed clothes using only a depth image. We define up to 6 semantic regions of varying extent, including edges on the neckline, sleeve cuffs, and hem, plus top and bottom grasping points. We introduce a U-net based network to segment and label these parts. We show that training our network solely with synthetic data and the proposed DA yields results competitive with models trained on real data.
arXiv Detail & Related papers (2021-10-06T16:31:20Z)
An Iteratively Optimized Patch Label Inference Network for Automatic Pavement Distress Detection [12.89160593375335]
We present a novel deep learning framework named the Iteratively optimized Patch Label Inference Network (IOPLIN) for automatically detecting various pavement distresses. IOPLIN can be iteratively trained with only the image label via the Expectation-Maximization Inspired Patch Label Distillation strategy. It is able to handle images in different resolutions, and sufficiently utilize image information particularly for the high-resolution ones.
arXiv Detail & Related papers (2020-05-27T11:56:38Z)
Solving Missing-Annotation Object Detection with Background Recalibration Loss [49.42997894751021]
This paper focuses on a novel and challenging detection scenario: A majority of true objects/instances is unlabeled in the datasets. Previous art has proposed to use soft sampling to re-weight the gradients of RoIs based on the overlaps with positive instances, while their method is mainly based on the two-stage detector. In this paper, we introduce a superior solution called Background Recalibration Loss (BRL) that can automatically re-calibrate the loss signals according to the pre-defined IoU threshold and input image.
arXiv Detail & Related papers (2020-02-12T23:11:46Z)
Localizing Interpretable Multi-scale informative Patches Derived from Media Classification Task [12.447143226347922]
We construct an interpretable AnchorNet equipped with our carefully designed RFs and linearly spatial aggregation. We show that localized patches can indeed retain the most semantics and evidences of the original inputs.
arXiv Detail & Related papers (2020-01-31T10:04:18Z)

This list is automatically generated from the titles and abstracts of the papers in this site.

This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.