Cross-Cluster Shifting for Efficient and Effective 3D Object Detection
in Autonomous Driving
- URL: http://arxiv.org/abs/2403.06166v1
- Date: Sun, 10 Mar 2024 10:36:32 GMT
- Title: Cross-Cluster Shifting for Efficient and Effective 3D Object Detection
in Autonomous Driving
- Authors: Zhili Chen, Kien T. Pham, Maosheng Ye, Zhiqiang Shen, and Qifeng Chen
- Abstract summary: We present a new 3D point-based detector model, named Shift-SSD, for precise 3D object detection in autonomous driving.
We introduce an intriguing Cross-Cluster Shifting operation to unleash the representation capacity of the point-based detector.
We conduct extensive experiments on the KITTI, runtime, and nuScenes datasets, and the results demonstrate the state-of-the-art performance of Shift-SSD.
- Score: 69.20604395205248
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We present a new 3D point-based detector model, named Shift-SSD, for precise
3D object detection in autonomous driving. Traditional point-based 3D object
detectors often employ architectures that rely on a progressive downsampling of
points. While this method effectively reduces computational demands and
increases receptive fields, it will compromise the preservation of crucial
non-local information for accurate 3D object detection, especially in the
complex driving scenarios. To address this, we introduce an intriguing
Cross-Cluster Shifting operation to unleash the representation capacity of the
point-based detector by efficiently modeling longer-range inter-dependency
while including only a negligible overhead. Concretely, the Cross-Cluster
Shifting operation enhances the conventional design by shifting partial
channels from neighboring clusters, which enables richer interaction with
non-local regions and thus enlarges the receptive field of clusters. We conduct
extensive experiments on the KITTI, Waymo, and nuScenes datasets, and the
results demonstrate the state-of-the-art performance of Shift-SSD in both
detection accuracy and runtime efficiency.
Related papers
- Domain Generalization of 3D Object Detection by Density-Resampling [14.510085711178217]
Point-cloud-based 3D object detection suffers from performance degradation when encountering data with novel domain gaps.
We propose an SDG method to improve the generalizability of 3D object detection to unseen target domains.
Our work introduces a novel data augmentation method and contributes a new multi-task learning strategy in the methodology.
arXiv Detail & Related papers (2023-11-17T20:01:29Z) - LS-VOS: Identifying Outliers in 3D Object Detections Using Latent Space
Virtual Outlier Synthesis [10.920640666237833]
LiDAR-based 3D object detectors have achieved unprecedented speed and accuracy in autonomous driving applications.
They are often biased toward high-confidence predictions or return detections where no real object is present.
We propose LS-VOS, a framework for identifying outliers in 3D object detections.
arXiv Detail & Related papers (2023-10-02T07:44:26Z) - SSC3OD: Sparsely Supervised Collaborative 3D Object Detection from LiDAR
Point Clouds [16.612824810651897]
We propose a sparsely supervised collaborative 3D object detection framework SSC3OD.
It only requires each agent to randomly label one object in the scene.
It can effectively improve the performance of sparsely supervised collaborative 3D object detectors.
arXiv Detail & Related papers (2023-07-03T02:42:14Z) - DETR4D: Direct Multi-View 3D Object Detection with Sparse Attention [50.11672196146829]
3D object detection with surround-view images is an essential task for autonomous driving.
We propose DETR4D, a Transformer-based framework that explores sparse attention and direct feature query for 3D object detection in multi-view images.
arXiv Detail & Related papers (2022-12-15T14:18:47Z) - AGO-Net: Association-Guided 3D Point Cloud Object Detection Network [86.10213302724085]
We propose a novel 3D detection framework that associates intact features for objects via domain adaptation.
We achieve new state-of-the-art performance on the KITTI 3D detection benchmark in both accuracy and speed.
arXiv Detail & Related papers (2022-08-24T16:54:38Z) - Ret3D: Rethinking Object Relations for Efficient 3D Object Detection in
Driving Scenes [82.4186966781934]
We introduce a simple, efficient, and effective two-stage detector, termed as Ret3D.
At the core of Ret3D is the utilization of novel intra-frame and inter-frame relation modules.
With negligible extra overhead, Ret3D achieves the state-of-the-art performance.
arXiv Detail & Related papers (2022-08-18T03:48:58Z) - High-level camera-LiDAR fusion for 3D object detection with machine
learning [0.0]
This paper tackles the 3D object detection problem, which is of vital importance for applications such as autonomous driving.
It uses a Machine Learning pipeline on a combination of monocular camera and LiDAR data to detect vehicles in the surrounding 3D space of a moving platform.
Our results demonstrate an efficient and accurate inference on a validation set, achieving an overall accuracy of 87.1%.
arXiv Detail & Related papers (2021-05-24T01:57:34Z) - Improving Point Cloud Semantic Segmentation by Learning 3D Object
Detection [102.62963605429508]
Point cloud semantic segmentation plays an essential role in autonomous driving.
Current 3D semantic segmentation networks focus on convolutional architectures that perform great for well represented classes.
We propose a novel Aware 3D Semantic Detection (DASS) framework that explicitly leverages localization features from an auxiliary 3D object detection task.
arXiv Detail & Related papers (2020-09-22T14:17:40Z) - Reinforced Axial Refinement Network for Monocular 3D Object Detection [160.34246529816085]
Monocular 3D object detection aims to extract the 3D position and properties of objects from a 2D input image.
Conventional approaches sample 3D bounding boxes from the space and infer the relationship between the target object and each of them, however, the probability of effective samples is relatively small in the 3D space.
We propose to start with an initial prediction and refine it gradually towards the ground truth, with only one 3d parameter changed in each step.
This requires designing a policy which gets a reward after several steps, and thus we adopt reinforcement learning to optimize it.
arXiv Detail & Related papers (2020-08-31T17:10:48Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.