Survey and Systematization of 3D Object Detection Models and Methods
- URL: http://arxiv.org/abs/2201.09354v2
- Date: Fri, 5 May 2023 09:19:03 GMT
- Title: Survey and Systematization of 3D Object Detection Models and Methods
- Authors: Moritz Drobnitzky, Jonas Friederich, Bernhard Egger, Patrick Zschech
- Abstract summary: We provide a comprehensive survey of recent developments from 2012-2021 in 3D object detection.
We introduce fundamental concepts, focus on a broad range of different approaches that have emerged over the past decade.
We propose a systematization that provides a practical framework for comparing these approaches with the goal of guiding future development, evaluation and application activities.
- Score: 3.472931603805115
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Strong demand for autonomous vehicles and the wide availability of 3D sensors
are continuously fueling the proposal of novel methods for 3D object detection.
In this paper, we provide a comprehensive survey of recent developments from
2012-2021 in 3D object detection covering the full pipeline from input data,
over data representation and feature extraction to the actual detection
modules. We introduce fundamental concepts, focus on a broad range of different
approaches that have emerged over the past decade, and propose a
systematization that provides a practical framework for comparing these
approaches with the goal of guiding future development, evaluation and
application activities. Specifically, our survey and systematization of 3D
object detection models and methods can help researchers and practitioners to
get a quick overview of the field by decomposing 3DOD solutions into more
manageable pieces.
Related papers
- Scalable Vision-Based 3D Object Detection and Monocular Depth Estimation
for Autonomous Driving [5.347428263669927]
This dissertation is a multifaceted contribution to the advancement of vision-based 3D perception technologies.
In the first segment, the thesis introduces structural enhancements to both monocular and stereo 3D object detection algorithms.
The second segment is devoted to data-driven strategies and their real-world applications in 3D vision detection.
arXiv Detail & Related papers (2024-03-04T13:42:54Z) - Robustness-Aware 3D Object Detection in Autonomous Driving: A Review and
Outlook [20.380856953006592]
This study emphasizes the importance of robustness, alongside accuracy and latency, in evaluating perception systems under practical scenarios.
Our work presents an extensive survey of camera-based, LiDAR-based, and multimodal 3D object detection algorithms, thoroughly evaluating their trade-off between accuracy, latency, and robustness.
Among these,multimodal 3D detection approaches exhibit superior robustness and a novel taxonomy is introduced to reorganize its literature for enhanced clarity.
arXiv Detail & Related papers (2024-01-12T12:35:45Z) - Joint object detection and re-identification for 3D obstacle
multi-camera systems [47.87501281561605]
This research paper introduces a novel modification to an object detection network that uses camera and lidar information.
It incorporates an additional branch designed for the task of re-identifying objects across adjacent cameras within the same vehicle.
The results underscore the superiority of this method over traditional Non-Maximum Suppression (NMS) techniques.
arXiv Detail & Related papers (2023-10-09T15:16:35Z) - Multi-Modal Dataset Acquisition for Photometrically Challenging Object [56.30027922063559]
This paper addresses the limitations of current datasets for 3D vision tasks in terms of accuracy, size, realism, and suitable imaging modalities for photometrically challenging objects.
We propose a novel annotation and acquisition pipeline that enhances existing 3D perception and 6D object pose datasets.
arXiv Detail & Related papers (2023-08-21T10:38:32Z) - HUM3DIL: Semi-supervised Multi-modal 3D Human Pose Estimation for
Autonomous Driving [95.42203932627102]
3D human pose estimation is an emerging technology, which can enable the autonomous vehicle to perceive and understand the subtle and complex behaviors of pedestrians.
Our method efficiently makes use of these complementary signals, in a semi-supervised fashion and outperforms existing methods with a large margin.
Specifically, we embed LiDAR points into pixel-aligned multi-modal features, which we pass through a sequence of Transformer refinement stages.
arXiv Detail & Related papers (2022-12-15T11:15:14Z) - 3D Object Detection for Autonomous Driving: A Comprehensive Survey [48.30753402458884]
3D object detection, which intelligently predicts the locations, sizes, and categories of the critical 3D objects near an autonomous vehicle, is an important part of a perception system.
This paper reviews the advances in 3D object detection for autonomous driving.
arXiv Detail & Related papers (2022-06-19T19:43:11Z) - A Survey of Robust 3D Object Detection Methods in Point Clouds [2.1655448059430222]
We describe novel data augmentation methods, sampling strategies, activation functions, attention mechanisms, and regularization methods.
We evaluate novel 3D object detectors on the KITTI, nuScenes, and dataset.
We mention the current challenges in 3D object detection in LiDAR point clouds and list some open issues.
arXiv Detail & Related papers (2022-03-31T21:41:32Z) - 3D Object Detection from Images for Autonomous Driving: A Survey [68.33502122185813]
3D object detection from images is one of the fundamental and challenging problems in autonomous driving.
More than 200 works have studied this problem from 2015 to 2021, encompassing a broad spectrum of theories, algorithms, and applications.
We provide the first comprehensive survey of this novel and continuously growing research field, summarizing the most commonly used pipelines for image-based 3D detection.
arXiv Detail & Related papers (2022-02-07T07:12:24Z) - DOPS: Learning to Detect 3D Objects and Predict their 3D Shapes [54.239416488865565]
We propose a fast single-stage 3D object detection method for LIDAR data.
The core novelty of our method is a fast, single-pass architecture that both detects objects in 3D and estimates their shapes.
We find that our proposed method achieves state-of-the-art results by 5% on object detection in ScanNet scenes, and it gets top results by 3.4% in the Open dataset.
arXiv Detail & Related papers (2020-04-02T17:48:50Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.