An Iteratively Optimized Patch Label Inference Network for Automatic
Pavement Distress Detection
- URL: http://arxiv.org/abs/2005.13298v3
- Date: Thu, 8 Sep 2022 07:09:12 GMT
- Title: An Iteratively Optimized Patch Label Inference Network for Automatic
Pavement Distress Detection
- Authors: Wenhao Tang and Sheng Huang and Qiming Zhao and Ren Li and Luwen
Huangfu
- Abstract summary: We present a novel deep learning framework named the Iteratively optimized Patch Label Inference Network (IOPLIN) for automatically detecting various pavement distresses.
IOPLIN can be iteratively trained with only the image label via the Expectation-Maximization Inspired Patch Label Distillation strategy.
It is able to handle images in different resolutions, and sufficiently utilize image information particularly for the high-resolution ones.
- Score: 12.89160593375335
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: We present a novel deep learning framework named the Iteratively Optimized
Patch Label Inference Network (IOPLIN) for automatically detecting various
pavement distresses that are not solely limited to specific ones, such as
cracks and potholes. IOPLIN can be iteratively trained with only the image
label via the Expectation-Maximization Inspired Patch Label Distillation
(EMIPLD) strategy, and accomplish this task well by inferring the labels of
patches from the pavement images. IOPLIN enjoys many desirable properties over
the state-of-the-art single branch CNN models such as GoogLeNet and
EfficientNet. It is able to handle images in different resolutions, and
sufficiently utilize image information particularly for the high-resolution
ones, since IOPLIN extracts the visual features from unrevised image patches
instead of the resized entire image. Moreover, it can roughly localize the
pavement distress without using any prior localization information in the
training phase. In order to better evaluate the effectiveness of our method in
practice, we construct a large-scale Bituminous Pavement Disease Detection
dataset named CQU-BPDD consisting of 60,059 high-resolution pavement images,
which are acquired from different areas at different times. Extensive results
on this dataset demonstrate the superiority of IOPLIN over the state-of-the-art
image classification approaches in automatic pavement distress detection. The
source codes of IOPLIN are released on
\url{https://github.com/DearCaat/ioplin}, and the CQU-BPDD dataset is able to
be accessed on \url{https://dearcaat.github.io/CQU-BPDD/}.
Related papers
- Learning Deblurring Texture Prior from Unpaired Data with Diffusion Model [92.61216319417208]
We propose a novel diffusion model (DM)-based framework, dubbed ours, for image deblurring.<n>ours performs DM to generate the prior knowledge that aids in recovering the textures of blurry images.<n>To fully exploit the generated texture priors, we present the Texture Transfer Transformer layer (TTformer)
arXiv Detail & Related papers (2025-07-18T01:50:31Z) - Semi-supervised 3D Object Detection with PatchTeacher and PillarMix [71.4908268136439]
Current semi-supervised 3D object detection methods typically use a teacher to generate pseudo labels for a student.
We propose PatchTeacher, which focuses on partial scene 3D object detection to provide high-quality pseudo labels for the student.
We introduce three key techniques, i.e., Patch Normalizer, Quadrant Align, and Fovea Selection, to improve the performance of PatchTeacher.
arXiv Detail & Related papers (2024-07-13T06:58:49Z) - CLIP-Guided Source-Free Object Detection in Aerial Images [17.26407623526735]
High-resolution aerial images often require substantial storage space and may not be readily accessible to the public.
We propose a novel Source-Free Object Detection (SFOD) method to address these challenges.
To alleviate the noisy labels in self-training, we utilize Contrastive Language-Image Pre-training (CLIP) to guide the generation of pseudo-labels.
By leveraging CLIP's zero-shot classification capability, we aggregate its scores with the original predicted bounding boxes, enabling us to obtain refined scores for the pseudo-labels.
arXiv Detail & Related papers (2024-01-10T14:03:05Z) - Probabilistic Deep Metric Learning for Hyperspectral Image
Classification [91.5747859691553]
This paper proposes a probabilistic deep metric learning framework for hyperspectral image classification.
It aims to predict the category of each pixel for an image captured by hyperspectral sensors.
Our framework can be readily applied to existing hyperspectral image classification methods.
arXiv Detail & Related papers (2022-11-15T17:57:12Z) - Towards Effective Image Manipulation Detection with Proposal Contrastive
Learning [61.5469708038966]
We propose Proposal Contrastive Learning (PCL) for effective image manipulation detection.
Our PCL consists of a two-stream architecture by extracting two types of global features from RGB and noise views respectively.
Our PCL can be easily adapted to unlabeled data in practice, which can reduce manual labeling costs and promote more generalizable features.
arXiv Detail & Related papers (2022-10-16T13:30:13Z) - PicT: A Slim Weakly Supervised Vision Transformer for Pavement Distress
Classification [10.826472503315912]
We present a vision Transformer named textbfPavement textbfImage textbfClassification textbfPicT for pavement distress classification.
textbfPicT outperforms the second-best performed model by a large margin.
arXiv Detail & Related papers (2022-09-21T02:33:49Z) - Weakly Supervised Patch Label Inference Networks for Efficient Pavement
Distress Detection and Recognition in the Wild [14.16549562799135]
We present Weakly Supervised Patch Label Inference Networks (WSPLIN) for efficiently addressing pavement image classification tasks.
WSPLIN transforms the fully supervised pavement image classification problem into a weakly supervised pavement patch classification problem.
We evaluate our method on a large-scale bituminous pavement distress dataset.
arXiv Detail & Related papers (2022-03-31T04:01:02Z) - Patch-Based Stochastic Attention for Image Editing [4.8201607588546]
We propose an efficient attention layer based on the algorithm PatchMatch, which is used for determining approximate nearest neighbors.
We demonstrate the usefulness of PSAL on several image editing tasks, such as image inpainting, guided image colorization, and single-image super-resolution.
arXiv Detail & Related papers (2022-02-07T13:42:00Z) - Maximize the Exploration of Congeneric Semantics for Weakly Supervised
Semantic Segmentation [27.155133686127474]
We construct a graph neural network (P-GNN) based on the self-detected patches from different images that contain the same class labels.
We conduct experiments on the popular PASCAL VOC 2012 benchmarks, and our model yields state-of-the-art performance.
arXiv Detail & Related papers (2021-10-08T08:59:16Z) - DSNet: A Dual-Stream Framework for Weakly-Supervised Gigapixel Pathology
Image Analysis [78.78181964748144]
We present a novel weakly-supervised framework for classifying whole slide images (WSIs)
WSIs are commonly processed by patch-wise classification with patch-level labels.
With image-level labels only, patch-wise classification would be sub-optimal due to inconsistency between the patch appearance and image-level label.
arXiv Detail & Related papers (2021-09-13T09:10:43Z) - Semi-Supervised Domain Adaptation with Prototypical Alignment and
Consistency Learning [86.6929930921905]
This paper studies how much it can help address domain shifts if we further have a few target samples labeled.
To explore the full potential of landmarks, we incorporate a prototypical alignment (PA) module which calculates a target prototype for each class from the landmarks.
Specifically, we severely perturb the labeled images, making PA non-trivial to achieve and thus promoting model generalizability.
arXiv Detail & Related papers (2021-04-19T08:46:08Z) - High-Order Information Matters: Learning Relation and Topology for
Occluded Person Re-Identification [84.43394420267794]
We propose a novel framework by learning high-order relation and topology information for discriminative features and robust alignment.
Our framework significantly outperforms state-of-the-art by6.5%mAP scores on Occluded-Duke dataset.
arXiv Detail & Related papers (2020-03-18T12:18:35Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.