DLA-Net: Learning Dual Local Attention Features for Semantic
Segmentation of Large-Scale Building Facade Point Clouds
- URL: http://arxiv.org/abs/2106.00376v1
- Date: Tue, 1 Jun 2021 10:39:11 GMT
- Title: DLA-Net: Learning Dual Local Attention Features for Semantic
Segmentation of Large-Scale Building Facade Point Clouds
- Authors: Yanfei Su, Weiquan Liu, Zhimin Yuan, Ming Cheng, Zhihong Zhang, Xuelun
Shen, Cheng Wang
- Abstract summary: We construct the first large-scale building facade point clouds benchmark dataset for semantic segmentation.
We propose a learnable attention module that learns Dual Local Attention features, called DLA in this paper.
- Score: 14.485540292321257
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Semantic segmentation of building facade is significant in various
applications, such as urban building reconstruction and damage assessment. As
there is a lack of 3D point clouds datasets related to the fine-grained
building facade, we construct the first large-scale building facade point
clouds benchmark dataset for semantic segmentation. The existing methods of
semantic segmentation cannot fully mine the local neighborhood information of
point clouds. Addressing this problem, we propose a learnable attention module
that learns Dual Local Attention features, called DLA in this paper. The
proposed DLA module consists of two blocks, including the self-attention block
and attentive pooling block, which both embed an enhanced position encoding
block. The DLA module could be easily embedded into various network
architectures for point cloud segmentation, naturally resulting in a new 3D
semantic segmentation network with an encoder-decoder architecture, called
DLA-Net in this work. Extensive experimental results on our constructed
building facade dataset demonstrate that the proposed DLA-Net achieves better
performance than the state-of-the-art methods for semantic segmentation.
Related papers
- SegPoint: Segment Any Point Cloud via Large Language Model [62.69797122055389]
We propose a model, called SegPoint, to produce point-wise segmentation masks across a diverse range of tasks.
SegPoint is the first model to address varied segmentation tasks within a single framework.
arXiv Detail & Related papers (2024-07-18T17:58:03Z) - A Framework for Building Point Cloud Cleaning, Plane Detection and
Semantic Segmentation [0.5439020425818999]
We focus in the cleaning stage on removing outliers from the acquired point cloud data.
Following the cleaning process, we perform plane detection using the robust RANSAC paradigm.
The resulting segments can generate accurate and detailed point clouds representing the building's architectural elements.
arXiv Detail & Related papers (2024-02-01T15:50:40Z) - PotholeGuard: A Pothole Detection Approach by Point Cloud Semantic
Segmentation [0.0]
3D Semantic Pothole research often overlooks point cloud sparsity, leading to suboptimal local feature capture and segmentation accuracy.
Our model efficiently identifies hidden features and uses a feedback mechanism to enhance local characteristics.
Our approach offers a promising solution for robust and accurate 3D pothole segmentation, with applications in road maintenance and safety.
arXiv Detail & Related papers (2023-11-05T12:57:05Z) - Few-Shot 3D Point Cloud Semantic Segmentation via Stratified
Class-Specific Attention Based Transformer Network [22.9434434107516]
We develop a new multi-layer transformer network for few-shot point cloud semantic segmentation.
Our method achieves the new state-of-the-art performance, with 15% less inference time, over existing few-shot 3D point cloud segmentation models.
arXiv Detail & Related papers (2023-03-28T00:27:54Z) - CLR-GAM: Contrastive Point Cloud Learning with Guided Augmentation and
Feature Mapping [12.679625717350113]
We present CLR-GAM, a contrastive learning-based framework with Guided Augmentation (GA) for efficient dynamic exploration strategy.
We empirically demonstrate that the proposed approach achieves state-of-the-art performance on both simulated and real-world 3D point cloud datasets.
arXiv Detail & Related papers (2023-02-28T04:38:52Z) - PartSLIP: Low-Shot Part Segmentation for 3D Point Clouds via Pretrained
Image-Language Models [56.324516906160234]
Generalizable 3D part segmentation is important but challenging in vision and robotics.
This paper explores an alternative way for low-shot part segmentation of 3D point clouds by leveraging a pretrained image-language model, GLIP.
We transfer the rich knowledge from 2D to 3D through GLIP-based part detection on point cloud rendering and a novel 2D-to-3D label lifting algorithm.
arXiv Detail & Related papers (2022-12-03T06:59:01Z) - PRA-Net: Point Relation-Aware Network for 3D Point Cloud Analysis [56.91758845045371]
We propose a novel framework named Point Relation-Aware Network (PRA-Net)
It is composed of an Intra-region Structure Learning (ISL) module and an Inter-region Relation Learning (IRL) module.
Experiments on several 3D benchmarks covering shape classification, keypoint estimation, and part segmentation have verified the effectiveness and the ability of PRA-Net.
arXiv Detail & Related papers (2021-12-09T13:24:43Z) - PIG-Net: Inception based Deep Learning Architecture for 3D Point Cloud
Segmentation [0.9137554315375922]
We propose a inception based deep network architecture called PIG-Net, that effectively characterizes the local and global geometric details of the point clouds.
We perform an exhaustive experimental analysis of the PIG-Net architecture on two state-of-the-art datasets.
arXiv Detail & Related papers (2021-01-28T13:27:55Z) - Boundary-Aware Segmentation Network for Mobile and Web Applications [60.815545591314915]
Boundary-Aware Network (BASNet) is integrated with a predict-refine architecture and a hybrid loss for highly accurate image segmentation.
BASNet runs at over 70 fps on a single GPU which benefits many potential real applications.
Based on BASNet, we further developed two (close to) commercial applications: AR COPY & PASTE, in which BASNet is augmented reality for "COPY" and "PASTING" real-world objects, and OBJECT CUT, which is a web-based tool for automatic object background removal.
arXiv Detail & Related papers (2021-01-12T19:20:26Z) - Improving Point Cloud Semantic Segmentation by Learning 3D Object
Detection [102.62963605429508]
Point cloud semantic segmentation plays an essential role in autonomous driving.
Current 3D semantic segmentation networks focus on convolutional architectures that perform great for well represented classes.
We propose a novel Aware 3D Semantic Detection (DASS) framework that explicitly leverages localization features from an auxiliary 3D object detection task.
arXiv Detail & Related papers (2020-09-22T14:17:40Z) - Campus3D: A Photogrammetry Point Cloud Benchmark for Hierarchical
Understanding of Outdoor Scene [76.4183572058063]
We present a richly-annotated 3D point cloud dataset for multiple outdoor scene understanding tasks.
The dataset has been point-wisely annotated with both hierarchical and instance-based labels.
We formulate a hierarchical learning problem for 3D point cloud segmentation and propose a measurement evaluating consistency across various hierarchies.
arXiv Detail & Related papers (2020-08-11T19:10:32Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.