Are We Hungry for 3D LiDAR Data for Semantic Segmentation? A Survey and
Experimental Study
- URL: http://arxiv.org/abs/2006.04307v2
- Date: Mon, 30 Nov 2020 12:58:20 GMT
- Title: Are We Hungry for 3D LiDAR Data for Semantic Segmentation? A Survey and
Experimental Study
- Authors: Biao Gao, Yancheng Pan, Chengkun Li, Sibo Geng, Huijing Zhao
- Abstract summary: 3D semantic segmentation is a fundamental task for robotic and autonomous driving applications.
Recent works have been focused on using deep learning techniques, whereas developing fine-annotated 3D LiDAR datasets is extremely labor intensive.
The performance limitation caused by insufficient datasets is called data hunger problem.
- Score: 5.6780397318769245
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: 3D semantic segmentation is a fundamental task for robotic and autonomous
driving applications. Recent works have been focused on using deep learning
techniques, whereas developing fine-annotated 3D LiDAR datasets is extremely
labor intensive and requires professional skills. The performance limitation
caused by insufficient datasets is called data hunger problem. This research
provides a comprehensive survey and experimental study on the question: are we
hungry for 3D LiDAR data for semantic segmentation? The studies are conducted
at three levels. First, a broad review to the main 3D LiDAR datasets is
conducted, followed by a statistical analysis on three representative datasets
to gain an in-depth view on the datasets' size and diversity, which are the
critical factors in learning deep models. Second, a systematic review to the
state-of-the-art 3D semantic segmentation is conducted, followed by experiments
and cross examinations of three representative deep learning methods to find
out how the size and diversity of the datasets affect deep models' performance.
Finally, a systematic survey to the existing efforts to solve the data hunger
problem is conducted on both methodological and dataset's viewpoints, followed
by an insightful discussion of remaining problems and open questions To the
best of our knowledge, this is the first work to analyze the data hunger
problem for 3D semantic segmentation using deep learning techniques that are
addressed in the literature review, statistical analysis, and cross-dataset and
cross-algorithm experiments. We share findings and discussions, which may lead
to potential topics in future works.
Related papers
- Advancing 3D Point Cloud Understanding through Deep Transfer Learning: A Comprehensive Survey [3.929140365559557]
This paper provides a comprehensive overview of the latest techniques for understanding 3DPC using deep transfer learning (DTL) and domain adaptation (DA)
The paper covers various applications, such as 3DPC object detection, semantic labeling, segmentation, classification, registration, downsampling/upsampling, and denoising.
arXiv Detail & Related papers (2024-07-25T08:47:27Z) - Deep Learning-Based 3D Instance and Semantic Segmentation: A Review [0.0]
3D segmentation is challenging with point cloud data due to substantial redundancy, fluctuating sample density and lack of organization.
Deep learning has been successfully used to a spectrum of 2D vision domains as a prevailing A.I. methods.
This study examines many strategies that have been presented to 3D instance and semantic segmentation.
arXiv Detail & Related papers (2024-06-19T07:56:14Z) - Enhancing Generalizability of Representation Learning for Data-Efficient 3D Scene Understanding [50.448520056844885]
We propose a generative Bayesian network to produce diverse synthetic scenes with real-world patterns.
A series of experiments robustly display our method's consistent superiority over existing state-of-the-art pre-training approaches.
arXiv Detail & Related papers (2024-06-17T07:43:53Z) - A Survey on Data Selection for Language Models [148.300726396877]
Data selection methods aim to determine which data points to include in a training dataset.
Deep learning is mostly driven by empirical evidence and experimentation on large-scale data is expensive.
Few organizations have the resources for extensive data selection research.
arXiv Detail & Related papers (2024-02-26T18:54:35Z) - A Review and Roadmap of Deep Causal Model from Different Causal
Structures and Representations [23.87336875544181]
We redefinition causal data into three categories: definite data, semi-definite data, and indefinite data.
Definite data pertains to statistical data used in conventional causal scenarios, while semi-definite data refers to a spectrum of data formats germane to deep learning.
Indefinite data is an emergent research sphere inferred from the progression of data forms by us.
arXiv Detail & Related papers (2023-11-02T01:31:42Z) - A Survey of Label-Efficient Deep Learning for 3D Point Clouds [109.07889215814589]
This paper presents the first comprehensive survey of label-efficient learning of point clouds.
We propose a taxonomy that organizes label-efficient learning methods based on the data prerequisites provided by different types of labels.
For each approach, we outline the problem setup and provide an extensive literature review that showcases relevant progress and challenges.
arXiv Detail & Related papers (2023-05-31T12:54:51Z) - Advanced Data Augmentation Approaches: A Comprehensive Survey and Future
directions [57.30984060215482]
We provide a background of data augmentation, a novel and comprehensive taxonomy of reviewed data augmentation techniques, and the strengths and weaknesses (wherever possible) of each technique.
We also provide comprehensive results of the data augmentation effect on three popular computer vision tasks, such as image classification, object detection and semantic segmentation.
arXiv Detail & Related papers (2023-01-07T11:37:32Z) - Learning 3D Human Pose Estimation from Dozens of Datasets using a
Geometry-Aware Autoencoder to Bridge Between Skeleton Formats [80.12253291709673]
We propose a novel affine-combining autoencoder (ACAE) method to perform dimensionality reduction on the number of landmarks.
Our approach scales to an extreme multi-dataset regime, where we use 28 3D human pose datasets to supervise one model.
arXiv Detail & Related papers (2022-12-29T22:22:49Z) - Deep Depth Completion: A Survey [26.09557446012222]
We provide a comprehensive literature review that helps readers better grasp the research trends and clearly understand the current advances.
We investigate the related studies from the design aspects of network architectures, loss functions, benchmark datasets, and learning strategies.
We present a quantitative comparison of model performance on two widely used benchmark datasets, including an indoor and an outdoor dataset.
arXiv Detail & Related papers (2022-05-11T08:24:00Z) - Deep Learning Based 3D Segmentation: A Survey [42.44509605101214]
3D segmentation is a fundamental and challenging problem in computer vision with applications in autonomous driving and robotics.
Deep learning techniques have recently become the tool of choice for 3D segmentation tasks.
This paper comprehensively surveys the recent progress in deep learning-based 3D segmentation techniques.
arXiv Detail & Related papers (2021-03-09T13:58:35Z) - PointContrast: Unsupervised Pre-training for 3D Point Cloud
Understanding [107.02479689909164]
In this work, we aim at facilitating research on 3D representation learning.
We measure the effect of unsupervised pre-training on a large source set of 3D scenes.
arXiv Detail & Related papers (2020-07-21T17:59:22Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.