LACV-Net: Semantic Segmentation of Large-Scale Point Cloud Scene via
Local Adaptive and Comprehensive VLAD
- URL: http://arxiv.org/abs/2210.05870v1
- Date: Wed, 12 Oct 2022 02:11:00 GMT
- Title: LACV-Net: Semantic Segmentation of Large-Scale Point Cloud Scene via
Local Adaptive and Comprehensive VLAD
- Authors: Ziyin Zeng, Yongyang Xu, Zhong Xie, Wei Tang, Jie Wan and Weichao Wu
- Abstract summary: We propose an end-to-end deep neural network called LACV-Net for large-scale point cloud semantic segmentation.
The proposed network contains three main components: 1) a local adaptive feature augmentation module (LAFA) to adaptively learn the similarity of centroids and neighboring points to augment the local context; 2) a comprehensive VLAD module that fuses local features with multi-layer, multi-scale, and multi-resolution to represent a comprehensive global description vector; and 3) an aggregation loss function to effectively optimize the segmentation boundaries by constraining the adaptive weight from the LAFA module.
- Score: 13.907586081922345
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Large-scale point cloud semantic segmentation is an important task in 3D
computer vision, which is widely applied in autonomous driving, robotics, and
virtual reality. Current large-scale point cloud semantic segmentation methods
usually use down-sampling operations to improve computation efficiency and
acquire point clouds with multi-resolution. However, this may cause the problem
of missing local information. Meanwhile, it is difficult for networks to
capture global information in large-scale distributed contexts. To capture
local and global information effectively, we propose an end-to-end deep neural
network called LACV-Net for large-scale point cloud semantic segmentation. The
proposed network contains three main components: 1) a local adaptive feature
augmentation module (LAFA) to adaptively learn the similarity of centroids and
neighboring points to augment the local context; 2) a comprehensive VLAD module
(C-VLAD) that fuses local features with multi-layer, multi-scale, and
multi-resolution to represent a comprehensive global description vector; and 3)
an aggregation loss function to effectively optimize the segmentation
boundaries by constraining the adaptive weight from the LAFA module. Compared
to state-of-the-art networks on several large-scale benchmark datasets,
including S3DIS, Toronto3D, and SensatUrban, we demonstrated the effectiveness
of the proposed network.
Related papers
- Efficiently Expanding Receptive Fields: Local Split Attention and Parallel Aggregation for Enhanced Large-scale Point Cloud Semantic Segmentation [7.199090922071512]
We propose the Local Split Attention Pooling (LSAP) mechanism to effectively expand the receptive field through a series of local split operations.
We put forth a novel framework, designated as LSNet, for large-scale point cloud semantic segmentation.
LSNet demonstrated superior performance compared to state-of-the-art semantic segmentation networks on three benchmark datasets.
arXiv Detail & Related papers (2024-09-03T07:10:20Z) - PVAFN: Point-Voxel Attention Fusion Network with Multi-Pooling Enhancing for 3D Object Detection [59.355022416218624]
integration of point and voxel representations is becoming more common in LiDAR-based 3D object detection.
We propose a novel two-stage 3D object detector, called Point-Voxel Attention Fusion Network (PVAFN)
PVAFN uses a multi-pooling strategy to integrate both multi-scale and region-specific information effectively.
arXiv Detail & Related papers (2024-08-26T19:43:01Z) - SWCF-Net: Similarity-weighted Convolution and Local-global Fusion for Efficient Large-scale Point Cloud Semantic Segmentation [10.328077317786342]
We propose a Similarity-Weighted Convolution and local-global Fusion Network, named SWCF-Net.
Our method achieves a competitive result with less computational cost, and is able to handle large-scale point clouds efficiently.
arXiv Detail & Related papers (2024-06-17T11:54:46Z) - PointeNet: A Lightweight Framework for Effective and Efficient Point
Cloud Analysis [28.54939134635978]
PointeNet is a network designed specifically for point cloud analysis.
Our method demonstrates flexibility by seamlessly integrating with a classification/segmentation head or embedding into off-the-shelf 3D object detection networks.
Experiments on object-level datasets, including ModelNet40, ScanObjectNN, ShapeNet KITTI, and the scene-level dataset KITTI, demonstrate the superior performance of PointeNet over state-of-the-art methods in point cloud analysis.
arXiv Detail & Related papers (2023-12-20T03:34:48Z) - LCPFormer: Towards Effective 3D Point Cloud Analysis via Local Context
Propagation in Transformers [60.51925353387151]
We propose a novel module named Local Context Propagation (LCP) to exploit the message passing between neighboring local regions.
We use the overlap points of adjacent local regions as intermediaries, then re-weight the features of these shared points from different local regions before passing them to the next layers.
The proposed method is applicable to different tasks and outperforms various transformer-based methods in benchmarks including 3D shape classification and dense prediction tasks.
arXiv Detail & Related papers (2022-10-23T15:43:01Z) - SVNet: Where SO(3) Equivariance Meets Binarization on Point Cloud
Representation [65.4396959244269]
The paper tackles the challenge by designing a general framework to construct 3D learning architectures.
The proposed approach can be applied to general backbones like PointNet and DGCNN.
Experiments on ModelNet40, ShapeNet, and the real-world dataset ScanObjectNN, demonstrated that the method achieves a great trade-off between efficiency, rotation, and accuracy.
arXiv Detail & Related papers (2022-09-13T12:12:19Z) - Multi-scale Network with Attentional Multi-resolution Fusion for Point
Cloud Semantic Segmentation [2.964101313270572]
We present a comprehensive point cloud semantic segmentation network that aggregates both local and global multi-scale information.
We introduce an Angle Correlation Point Convolution module to effectively learn the local shapes of points.
Third, inspired by HRNet which has excellent performance on 2D image vision tasks, we build an HRNet customized for point cloud to learn global multi-scale context.
arXiv Detail & Related papers (2022-06-27T21:03:33Z) - SemAffiNet: Semantic-Affine Transformation for Point Cloud Segmentation [94.11915008006483]
We propose SemAffiNet for point cloud semantic segmentation.
We conduct extensive experiments on the ScanNetV2 and NYUv2 datasets.
arXiv Detail & Related papers (2022-05-26T17:00:23Z) - PRA-Net: Point Relation-Aware Network for 3D Point Cloud Analysis [56.91758845045371]
We propose a novel framework named Point Relation-Aware Network (PRA-Net)
It is composed of an Intra-region Structure Learning (ISL) module and an Inter-region Relation Learning (IRL) module.
Experiments on several 3D benchmarks covering shape classification, keypoint estimation, and part segmentation have verified the effectiveness and the ability of PRA-Net.
arXiv Detail & Related papers (2021-12-09T13:24:43Z) - Learning Semantic Segmentation of Large-Scale Point Clouds with Random
Sampling [52.464516118826765]
We introduce RandLA-Net, an efficient and lightweight neural architecture to infer per-point semantics for large-scale point clouds.
The key to our approach is to use random point sampling instead of more complex point selection approaches.
Our RandLA-Net can process 1 million points in a single pass up to 200x faster than existing approaches.
arXiv Detail & Related papers (2021-07-06T05:08:34Z) - Efficient 3D Point Cloud Feature Learning for Large-Scale Place
Recognition [21.818744369503197]
We develop an efficient point cloud learning network ( EPC-Net) to form a global descriptor for visual place recognition.
Our proposed method can achieve state-of-the-art performance with lower parameters, FLOPs, and runtime per frame.
arXiv Detail & Related papers (2021-01-07T05:15:31Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.