Beyond single receptive field: A receptive field
fusion-and-stratification network for airborne laser scanning point cloud
classification
- URL: http://arxiv.org/abs/2207.10278v1
- Date: Thu, 21 Jul 2022 03:10:35 GMT
- Title: Beyond single receptive field: A receptive field
fusion-and-stratification network for airborne laser scanning point cloud
classification
- Authors: Yongqiang Mao, Kaiqiang Chen, Wenhui Diao, Xian Sun, Xiaonan Lu, Kun
Fu, Martin Weinmann
- Abstract summary: We propose a novel receptive field fusion-and-stratification network (RFFS-Net)
RFFS-Net is more adaptable to the classification of regions with complex structures and extreme scale variations in large-scale ALS point clouds.
Experiments on the LASDU dataset and the 2019 IEEE-GRSS Data Fusion Contest dataset show that RFFS-Net achieves a new state-of-the-art classification performance.
- Score: 14.706139194001773
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The classification of airborne laser scanning (ALS) point clouds is a
critical task of remote sensing and photogrammetry fields. Although recent deep
learning-based methods have achieved satisfactory performance, they have
ignored the unicity of the receptive field, which makes the ALS point cloud
classification remain challenging for the distinguishment of the areas with
complex structures and extreme scale variations. In this article, for the
objective of configuring multi-receptive field features, we propose a novel
receptive field fusion-and-stratification network (RFFS-Net). With a novel
dilated graph convolution (DGConv) and its extension annular dilated
convolution (ADConv) as basic building blocks, the receptive field fusion
process is implemented with the dilated and annular graph fusion (DAGFusion)
module, which obtains multi-receptive field feature representation through
capturing dilated and annular graphs with various receptive regions. The
stratification of the receptive fields with point sets of different resolutions
as the calculation bases is performed with Multi-level Decoders nested in
RFFS-Net and driven by the multi-level receptive field aggregation loss
(MRFALoss) to drive the network to learn in the direction of the supervision
labels with different resolutions. With receptive field
fusion-and-stratification, RFFS-Net is more adaptable to the classification of
regions with complex structures and extreme scale variations in large-scale ALS
point clouds. Evaluated on the ISPRS Vaihingen 3D dataset, our RFFS-Net
significantly outperforms the baseline approach by 5.3% on mF1 and 5.4% on
mIoU, accomplishing an overall accuracy of 82.1%, an mF1 of 71.6%, and an mIoU
of 58.2%. Furthermore, experiments on the LASDU dataset and the 2019 IEEE-GRSS
Data Fusion Contest dataset show that RFFS-Net achieves a new state-of-the-art
classification performance.
Related papers
- PVAFN: Point-Voxel Attention Fusion Network with Multi-Pooling Enhancing for 3D Object Detection [59.355022416218624]
integration of point and voxel representations is becoming more common in LiDAR-based 3D object detection.
We propose a novel two-stage 3D object detector, called Point-Voxel Attention Fusion Network (PVAFN)
PVAFN uses a multi-pooling strategy to integrate both multi-scale and region-specific information effectively.
arXiv Detail & Related papers (2024-08-26T19:43:01Z) - ELGC-Net: Efficient Local-Global Context Aggregation for Remote Sensing Change Detection [65.59969454655996]
We propose an efficient change detection framework, ELGC-Net, which leverages rich contextual information to precisely estimate change regions.
Our proposed ELGC-Net sets a new state-of-the-art performance in remote sensing change detection benchmarks.
We also introduce ELGC-Net-LW, a lighter variant with significantly reduced computational complexity, suitable for resource-constrained settings.
arXiv Detail & Related papers (2024-03-26T17:46:25Z) - Improved Multi-Scale Grid Rendering of Point Clouds for Radar Object
Detection Networks [3.3787383461150045]
The transfer from irregular point cloud data to a dense grid structure is often associated with a loss of information.
We propose a novel architecture, multi-scale KPPillarsBEV, that aims to mitigate the negative effects of grid rendering.
arXiv Detail & Related papers (2023-05-25T08:26:42Z) - Semantic Segmentation of Radar Detections using Convolutions on Point
Clouds [59.45414406974091]
We introduce a deep-learning based method to convolve radar detections into point clouds.
We adapt this algorithm to radar-specific properties through distance-dependent clustering and pre-processing of input point clouds.
Our network outperforms state-of-the-art approaches that are based on PointNet++ on the task of semantic segmentation of radar point clouds.
arXiv Detail & Related papers (2023-05-22T07:09:35Z) - Spatial Layout Consistency for 3D Semantic Segmentation [0.7614628596146599]
We introduce a novel deep convolutional neural network (DCNN) technique for achieving voxel-based semantic segmentation of the ALTM's point clouds.
The suggested deep learning method, Semantic Utility Network (SUNet) is a multi-dimensional and multi-resolution network.
Our experiments demonstrated that SUNet's spatial layout consistency and a multi-resolution feature aggregation could significantly improve performance.
arXiv Detail & Related papers (2023-03-02T03:24:21Z) - Multi-scale Feature Aggregation for Crowd Counting [84.45773306711747]
We propose a multi-scale feature aggregation network (MSFANet)
MSFANet consists of two feature aggregation modules: the short aggregation (ShortAgg) and the skip aggregation (SkipAgg)
arXiv Detail & Related papers (2022-08-10T10:23:12Z) - SAR Despeckling Using Overcomplete Convolutional Networks [53.99620005035804]
despeckling is an important problem in remote sensing as speckle degrades SAR images.
Recent studies show that convolutional neural networks(CNNs) outperform classical despeckling methods.
This study employs an overcomplete CNN architecture to focus on learning low-level features by restricting the receptive field.
We show that the proposed network improves despeckling performance compared to recent despeckling methods on synthetic and real SAR images.
arXiv Detail & Related papers (2022-05-31T15:55:37Z) - Semantic Segmentation for Point Cloud Scenes via Dilated Graph Feature
Aggregation and Pyramid Decoders [15.860648472852597]
We propose a graph convolutional network DGFA-Net rooted in dilated graph feature aggregation (DGFA)
Experiments on S3DIS, ShapeNetPart and Toronto-3D show that DGFA-Net outperforms the baseline approach.
arXiv Detail & Related papers (2022-04-11T08:41:01Z) - TC-Net: Triple Context Network for Automated Stroke Lesion Segmentation [0.5482532589225552]
We propose a new network, Triple Context Network (TC-Net), with the capture of spatial contextual information as the core.
Our network is evaluated on the open dataset ATLAS, achieving the highest score of 0.594, Hausdorff distance of 27.005 mm, and average symmetry surface distance of 7.137 mm.
arXiv Detail & Related papers (2022-02-28T11:12:16Z) - An Attention-Fused Network for Semantic Segmentation of
Very-High-Resolution Remote Sensing Imagery [26.362854938949923]
We propose a novel convolutional neural network architecture, named attention-fused network (AFNet)
We achieve state-of-the-art performance with an overall accuracy of 91.7% and a mean F1 score of 90.96% on the ISPRS Vaihingen 2D dataset and the ISPRS Potsdam 2D dataset.
arXiv Detail & Related papers (2021-05-10T06:23:27Z) - Cross-layer Feature Pyramid Network for Salient Object Detection [102.20031050972429]
We propose a novel Cross-layer Feature Pyramid Network to improve the progressive fusion in salient object detection.
The distributed features per layer own both semantics and salient details from all other layers simultaneously, and suffer reduced loss of important information.
arXiv Detail & Related papers (2020-02-25T14:06:27Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.