One-Stage Deep Edge Detection Based on Dense-Scale Feature Fusion and
Pixel-Level Imbalance Learning
- URL: http://arxiv.org/abs/2203.09387v1
- Date: Thu, 17 Mar 2022 15:26:00 GMT
- Title: One-Stage Deep Edge Detection Based on Dense-Scale Feature Fusion and
Pixel-Level Imbalance Learning
- Authors: Dawei Dai, Chunjie Wang, Shuyin Xia, Yingge Liu, Guoyin Wang
- Abstract summary: We propose a one-stage neural network model that can generate high-quality edge images without postprocessing.
The proposed model adopts a classic encoder-decoder framework in which a pre-trained neural model is used as the encoder.
We propose a new loss function that addresses the pixel-level imbalance in the edge image.
- Score: 5.370848116287344
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Edge detection, a basic task in the field of computer vision, is an important
preprocessing operation for the recognition and understanding of a visual
scene. In conventional models, the edge image generated is ambiguous, and the
edge lines are also very thick, which typically necessitates the use of
non-maximum suppression (NMS) and morphological thinning operations to generate
clear and thin edge images. In this paper, we aim to propose a one-stage neural
network model that can generate high-quality edge images without
postprocessing. The proposed model adopts a classic encoder-decoder framework
in which a pre-trained neural model is used as the encoder and a
multi-feature-fusion mechanism that merges the features of each level with each
other functions as a learnable decoder. Further, we propose a new loss function
that addresses the pixel-level imbalance in the edge image by suppressing the
false positive (FP) edge information near the true positive (TP) edge and the
false negative (FN) non-edge. The results of experiments conducted on several
benchmark datasets indicate that the proposed method achieves state-of-the-art
results without using NMS and morphological thinning operations.
Related papers
- Deep Learning Based Speckle Filtering for Polarimetric SAR Images. Application to Sentinel-1 [51.404644401997736]
We propose a complete framework to remove speckle in polarimetric SAR images using a convolutional neural network.
Experiments show that the proposed approach offers exceptional results in both speckle reduction and resolution preservation.
arXiv Detail & Related papers (2024-08-28T10:07:17Z) - Distance Weighted Trans Network for Image Completion [52.318730994423106]
We propose a new architecture that relies on Distance-based Weighted Transformer (DWT) to better understand the relationships between an image's components.
CNNs are used to augment the local texture information of coarse priors.
DWT blocks are used to recover certain coarse textures and coherent visual structures.
arXiv Detail & Related papers (2023-10-11T12:46:11Z) - Pixel-Inconsistency Modeling for Image Manipulation Localization [59.968362815126326]
Digital image forensics plays a crucial role in image authentication and manipulation localization.
This paper presents a generalized and robust manipulation localization model through the analysis of pixel inconsistency artifacts.
Experiments show that our method successfully extracts inherent pixel-inconsistency forgery fingerprints.
arXiv Detail & Related papers (2023-09-30T02:54:51Z) - PRISTA-Net: Deep Iterative Shrinkage Thresholding Network for Coded
Diffraction Patterns Phase Retrieval [6.982256124089]
Phase retrieval is a challenge nonlinear inverse problem in computational imaging and image processing.
We have developed PRISTA-Net, a deep unfolding network based on the first-order iterative threshold threshold algorithm (ISTA)
All parameters in the proposed PRISTA-Net framework, including the nonlinear transformation, threshold, and step size, are learned-to-end instead of being set.
arXiv Detail & Related papers (2023-09-08T07:37:15Z) - Single Image Depth Prediction Made Better: A Multivariate Gaussian Take [163.14849753700682]
We introduce an approach that performs continuous modeling of per-pixel depth.
Our method's accuracy (named MG) is among the top on the KITTI depth-prediction benchmark leaderboard.
arXiv Detail & Related papers (2023-03-31T16:01:03Z) - ISSTAD: Incremental Self-Supervised Learning Based on Transformer for
Anomaly Detection and Localization [12.975540251326683]
We introduce a novel approach based on the Transformer backbone network.
We train a Masked Autoencoder (MAE) model solely on normal images.
In the subsequent stage, we apply pixel-level data augmentation techniques to generate corrupted normal images.
This process allows the model to learn how to repair corrupted regions and classify the status of each pixel.
arXiv Detail & Related papers (2023-03-30T13:11:26Z) - Hyperspectral Remote Sensing Image Classification Based on Multi-scale
Cross Graphic Convolution [20.42582692786715]
New multi-scale feature-mining learning algorithm (MGRNet) is proposed.
MGRNet uses principal component analysis to reduce the dimensionality of the original hyperspectral image (HSI) to retain 99.99% of its semantic information.
Experiments on three common hyperspectral datasets showed the MGRNet algorithm proposed in this paper to be superior to traditional methods in recognition accuracy.
arXiv Detail & Related papers (2021-06-28T15:28:09Z) - Blind microscopy image denoising with a deep residual and multiscale
encoder/decoder network [0.0]
Deep multiscale convolutional encoder-decoder neural network is proposed.
The proposed model reaches on average 38.38 of PSNR and 0.98 of SSIM on a test set of 57458 images.
arXiv Detail & Related papers (2021-05-01T14:54:57Z) - Image Inpainting with Edge-guided Learnable Bidirectional Attention Maps [85.67745220834718]
We present an edge-guided learnable bidirectional attention map (Edge-LBAM) for improving image inpainting of irregular holes.
Our Edge-LBAM method contains dual procedures,including structure-aware mask-updating guided by predict edges.
Extensive experiments show that our Edge-LBAM is effective in generating coherent image structures and preventing color discrepancy and blurriness.
arXiv Detail & Related papers (2021-04-25T07:25:16Z) - Hierarchical Convolutional Neural Network with Feature Preservation and
Autotuned Thresholding for Crack Detection [5.735035463793008]
Drone imagery is increasingly used in automated inspection for infrastructure surface defects.
This paper proposes a deep learning approach using hierarchical convolutional neural networks with feature preservation.
The proposed technique is then applied to identify surface cracks on the surface of roads, bridges or pavements.
arXiv Detail & Related papers (2021-04-21T13:07:58Z) - DWDN: Deep Wiener Deconvolution Network for Non-Blind Image Deblurring [66.91879314310842]
We propose an explicit deconvolution process in a feature space by integrating a classical Wiener deconvolution framework with learned deep features.
A multi-scale cascaded feature refinement module then predicts the deblurred image from the deconvolved deep features.
We show that the proposed deep Wiener deconvolution network facilitates deblurred results with visibly fewer artifacts and quantitatively outperforms state-of-the-art non-blind image deblurring methods by a wide margin.
arXiv Detail & Related papers (2021-03-18T00:38:11Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.