Kernel Inversed Pyramidal Resizing Network for Efficient Pavement
Distress Recognition
- URL: http://arxiv.org/abs/2212.01790v1
- Date: Sun, 4 Dec 2022 10:40:40 GMT
- Title: Kernel Inversed Pyramidal Resizing Network for Efficient Pavement
Distress Recognition
- Authors: Rong Qin and Luwen Huangfu and Devon Hood and James Ma and Sheng Huang
- Abstract summary: A light network named the Kernel Inversed Pyramidal Resizing Network (KIPRN) is introduced for image resizing.
In KIPRN, pyramidal convolution and kernel inversed convolution are specifically designed to mine discriminative information.
Extensive results demonstrate that KIPRN can generally improve the pavement distress recognition of CNN models.
- Score: 9.927965682734069
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Pavement Distress Recognition (PDR) is an important step in pavement
inspection and can be powered by image-based automation to expedite the process
and reduce labor costs. Pavement images are often in high-resolution with a low
ratio of distressed to non-distressed areas. Advanced approaches leverage these
properties via dividing images into patches and explore discriminative features
in the scale space. However, these approaches usually suffer from information
loss during image resizing and low efficiency due to complex learning
frameworks. In this paper, we propose a novel and efficient method for PDR. A
light network named the Kernel Inversed Pyramidal Resizing Network (KIPRN) is
introduced for image resizing, and can be flexibly plugged into the image
classification network as a pre-network to exploit resolution and scale
information. In KIPRN, pyramidal convolution and kernel inversed convolution
are specifically designed to mine discriminative information across different
feature granularities and scales. The mined information is passed along to the
resized images to yield an informative image pyramid to assist the image
classification network for PDR. We applied our method to three well-known
Convolutional Neural Networks (CNNs), and conducted an evaluation on a
large-scale pavement image dataset named CQU-BPDD. Extensive results
demonstrate that KIPRN can generally improve the pavement distress recognition
of these CNN models and show that the simple combination of KIPRN and
EfficientNet-B3 significantly outperforms the state-of-the-art patch-based
method in both performance and efficiency.
Related papers
- AMSA-UNet: An Asymmetric Multiple Scales U-net Based on Self-attention for Deblurring [7.00986132499006]
asymmetric multiple scales U-net based on self-attention (AMSA-UNet) is proposed to improve the accuracy and computational complexity.
By introducing a multiple-scales U shape architecture, the network can focus on blurry regions at the global level and better recover image details at the local level.
arXiv Detail & Related papers (2024-06-13T11:39:02Z) - Parameter-Inverted Image Pyramid Networks [49.35689698870247]
We propose a novel network architecture known as the Inverted Image Pyramid Networks (PIIP)
Our core idea is to use models with different parameter sizes to process different resolution levels of the image pyramid.
PIIP achieves superior performance in tasks such as object detection, segmentation, and image classification.
arXiv Detail & Related papers (2024-06-06T17:59:10Z) - EPNet: An Efficient Pyramid Network for Enhanced Single-Image
Super-Resolution with Reduced Computational Requirements [12.439807086123983]
Single-image super-resolution (SISR) has seen significant advancements through the integration of deep learning.
This paper introduces a new Efficient Pyramid Network (EPNet) that harmoniously merges an Edge Split Pyramid Module (ESPM) with a Panoramic Feature Extraction Module (PFEM) to overcome the limitations of existing methods.
arXiv Detail & Related papers (2023-12-20T19:56:53Z) - PRISTA-Net: Deep Iterative Shrinkage Thresholding Network for Coded
Diffraction Patterns Phase Retrieval [6.982256124089]
Phase retrieval is a challenge nonlinear inverse problem in computational imaging and image processing.
We have developed PRISTA-Net, a deep unfolding network based on the first-order iterative threshold threshold algorithm (ISTA)
All parameters in the proposed PRISTA-Net framework, including the nonlinear transformation, threshold, and step size, are learned-to-end instead of being set.
arXiv Detail & Related papers (2023-09-08T07:37:15Z) - Deep Multi-Threshold Spiking-UNet for Image Processing [51.88730892920031]
This paper introduces the novel concept of Spiking-UNet for image processing, which combines the power of Spiking Neural Networks (SNNs) with the U-Net architecture.
To achieve an efficient Spiking-UNet, we face two primary challenges: ensuring high-fidelity information propagation through the network via spikes and formulating an effective training strategy.
Experimental results show that, on image segmentation and denoising, our Spiking-UNet achieves comparable performance to its non-spiking counterpart.
arXiv Detail & Related papers (2023-07-20T16:00:19Z) - Rank-Enhanced Low-Dimensional Convolution Set for Hyperspectral Image
Denoising [50.039949798156826]
This paper tackles the challenging problem of hyperspectral (HS) image denoising.
We propose rank-enhanced low-dimensional convolution set (Re-ConvSet)
We then incorporate Re-ConvSet into the widely-used U-Net architecture to construct an HS image denoising method.
arXiv Detail & Related papers (2022-07-09T13:35:12Z) - Hierarchical Similarity Learning for Aliasing Suppression Image
Super-Resolution [64.15915577164894]
A hierarchical image super-resolution network (HSRNet) is proposed to suppress the influence of aliasing.
HSRNet achieves better quantitative and visual performance than other works, and remits the aliasing more effectively.
arXiv Detail & Related papers (2022-06-07T14:55:32Z) - Deep Triplet Hashing Network for Case-based Medical Image Retrieval [33.21919320742157]
We propose an end-to-end framework, called Attention-based Triplet Hashing (ATH) network, to learn low-dimensional hash codes.
We show that our proposed ATH can further improve the retrieval performance compared to the state-of-the-art deep hashing methods.
arXiv Detail & Related papers (2021-01-29T01:35:46Z) - FDRN: A Fast Deformable Registration Network for Medical Images [3.3504365823045044]
We propose a fast convolutional neural network to boost the registration performance in both accuracy and runtime.
We show FDRN outperforms the existing state-of-the-art registration methods for brain MR images by resorting to the compact network structure and efficient learning.
arXiv Detail & Related papers (2020-11-04T14:09:51Z) - Cross-Scale Internal Graph Neural Network for Image Super-Resolution [147.77050877373674]
Non-local self-similarity in natural images has been well studied as an effective prior in image restoration.
For single image super-resolution (SISR), most existing deep non-local methods only exploit similar patches within the same scale of the low-resolution (LR) input image.
This is achieved using a novel cross-scale internal graph neural network (IGNN)
arXiv Detail & Related papers (2020-06-30T10:48:40Z) - Deep Adaptive Inference Networks for Single Image Super-Resolution [72.7304455761067]
Single image super-resolution (SISR) has witnessed tremendous progress in recent years owing to the deployment of deep convolutional neural networks (CNNs)
In this paper, we take a step forward to address this issue by leveraging the adaptive inference networks for deep SISR (AdaDSR)
Our AdaDSR involves an SISR model as backbone and a lightweight adapter module which takes image features and resource constraint as input and predicts a map of local network depth.
arXiv Detail & Related papers (2020-04-08T10:08:20Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.