Related papers: Deep Learning Based 3D Segmentation: A Survey

Deep Learning Based 3D Segmentation: A Survey

URL: http://arxiv.org/abs/2103.05423v3
Date: Wed, 26 Jul 2023 08:14:39 GMT
Title: Deep Learning Based 3D Segmentation: A Survey
Authors: Yong He, Hongshan Yu, Xiaoyan Liu, Zhengeng Yang, Wei Sun, Ajmal Mian
Abstract summary: 3D segmentation is a fundamental problem in computer vision with applications in autonomous driving, robotics, augmented reality and medical image analysis. Deep learning techniques have recently become the tool of choice for 3D segmentation tasks. This paper fills the gap and provides a comprehensive survey of the recent progress made in deep learning based 3D segmentation.
Score: 29.402585297221457
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: 3D segmentation is a fundamental and challenging problem in computer vision with applications in autonomous driving, robotics, augmented reality and medical image analysis. It has received significant attention from the computer vision, graphics and machine learning communities. Conventional methods for 3D segmentation, based on hand-crafted features and machine learning classifiers, lack generalization ability. Driven by their success in 2D computer vision, deep learning techniques have recently become the tool of choice for 3D segmentation tasks. This has led to an influx of a large number of methods in the literature that have been evaluated on different benchmark datasets. Whereas survey papers on RGB-D and point cloud segmentation exist, there is a lack of an in-depth and recent survey that covers all 3D data modalities and application domains. This paper fills the gap and provides a comprehensive survey of the recent progress made in deep learning based 3D segmentation. It covers over 180 works, analyzes their strengths and limitations and discusses their competitive results on benchmark datasets. The survey provides a summary of the most commonly used pipelines and finally highlights promising research directions for the future.

Related papers

E3D-Bench: A Benchmark for End-to-End 3D Geometric Foundation Models [78.1674905950243]
We present the first comprehensive benchmark for 3D geometric foundation models (GFMs)<n>GFMs directly predict dense 3D representations in a single feed-forward pass, eliminating the need for slow or unavailable precomputed camera parameters.<n>We evaluate 16 state-of-the-art GFMs, revealing their strengths and limitations across tasks and domains.<n>All code, evaluation scripts, and processed data will be publicly released to accelerate research in 3D spatial intelligence.
arXiv Detail & Related papers (2025-06-02T17:53:09Z)
3D Instance Segmentation Using Deep Learning on RGB-D Indoor Data [0.0]
2D region based convolutional neural networks (Mask R-CNN) deep learning model with point based rending module is adapted to integrate with depth information to recognize and segment 3D instances of objects. In order to generate 3D point cloud coordinates, segmented 2D pixels of recognized object regions in the RGB image are merged into (u, v) points of the depth image.
arXiv Detail & Related papers (2024-06-19T08:00:35Z)
Deep Learning-Based 3D Instance and Semantic Segmentation: A Review [0.0]
3D segmentation is challenging with point cloud data due to substantial redundancy, fluctuating sample density and lack of organization. Deep learning has been successfully used to a spectrum of 2D vision domains as a prevailing A.I. methods. This study examines many strategies that have been presented to 3D instance and semantic segmentation.
arXiv Detail & Related papers (2024-06-19T07:56:14Z)
TS40K: a 3D Point Cloud Dataset of Rural Terrain and Electrical Transmission System [39.244727514293324]
TS40K is a 3D point cloud dataset that encompasses more than 40,000 Km on electrical transmission systems situated in European rural terrain. This is not only a novel problem for the research community that can aid in the high-risk mission of power-grid inspection, but it also offers 3D point clouds with distinct characteristics from those in self-driving and indoor 3D data. We evaluate the performance of state-of-the-art methods on our dataset concerning 3D semantic segmentation and 3D object detection.
arXiv Detail & Related papers (2024-05-22T20:53:23Z)
SAI3D: Segment Any Instance in 3D Scenes [68.57002591841034]
We introduce SAI3D, a novel zero-shot 3D instance segmentation approach. Our method partitions a 3D scene into geometric primitives, which are then progressively merged into 3D instance segmentations. Empirical evaluations on ScanNet, Matterport3D and the more challenging ScanNet++ datasets demonstrate the superiority of our approach.
arXiv Detail & Related papers (2023-12-17T09:05:47Z)
SAM-guided Graph Cut for 3D Instance Segmentation [60.75119991853605]
This paper addresses the challenge of 3D instance segmentation by simultaneously leveraging 3D geometric and multi-view image information. We introduce a novel 3D-to-2D query framework to effectively exploit 2D segmentation models for 3D instance segmentation. Our method achieves robust segmentation performance and can generalize across different types of scenes.
arXiv Detail & Related papers (2023-12-13T18:59:58Z)
Leveraging Large-Scale Pretrained Vision Foundation Models for Label-Efficient 3D Point Cloud Segmentation [67.07112533415116]
We present a novel framework that adapts various foundational models for the 3D point cloud segmentation task. Our approach involves making initial predictions of 2D semantic masks using different large vision models. To generate robust 3D semantic pseudo labels, we introduce a semantic label fusion strategy that effectively combines all the results via voting.
arXiv Detail & Related papers (2023-11-03T15:41:15Z)
Towards Open Set 3D Learning: A Benchmark on Object Point Clouds [17.145309633743747]
This paper provides the first broad study on Open Set 3D learning. We introduce a novel testbed with settings of increasing difficulty in terms of category semantic shift. We investigate the related out-of-distribution and Open Set 2D literature to understand if and how their most recent approaches are effective on 3D data.
arXiv Detail & Related papers (2022-07-23T17:00:45Z)
Improving Point Cloud Semantic Segmentation by Learning 3D Object Detection [102.62963605429508]
Point cloud semantic segmentation plays an essential role in autonomous driving. Current 3D semantic segmentation networks focus on convolutional architectures that perform great for well represented classes. We propose a novel Aware 3D Semantic Detection (DASS) framework that explicitly leverages localization features from an auxiliary 3D object detection task.
arXiv Detail & Related papers (2020-09-22T14:17:40Z)
Deep Learning for 3D Point Cloud Understanding: A Survey [16.35767262996978]
The development of practical applications, such as autonomous driving and robotics, has brought increasing attention to 3D point cloud understanding. Deep learning has achieved remarkable success on image-based tasks, but there are many unique challenges faced by deep neural networks in processing massive, unstructured and noisy 3D points. This paper summarizes recent remarkable research contributions in this area from several different directions.
arXiv Detail & Related papers (2020-09-18T16:34:12Z)
PointContrast: Unsupervised Pre-training for 3D Point Cloud Understanding [107.02479689909164]
In this work, we aim at facilitating research on 3D representation learning. We measure the effect of unsupervised pre-training on a large source set of 3D scenes.
arXiv Detail & Related papers (2020-07-21T17:59:22Z)

This list is automatically generated from the titles and abstracts of the papers in this site.