LiDAR R-CNN: An Efficient and Universal 3D Object Detector
- URL: http://arxiv.org/abs/2103.15297v1
- Date: Mon, 29 Mar 2021 03:01:21 GMT
- Title: LiDAR R-CNN: An Efficient and Universal 3D Object Detector
- Authors: Zhichao Li, Feng Wang, Naiyan Wang
- Abstract summary: LiDAR-based 3D detection in point cloud is essential in the perception system of autonomous driving.
We present LiDAR R-CNN, a second stage detector that can generally improve any existing 3D detector.
In particular, based on one variant of PointPillars, our method could achieve new state-of-the-art results with minor cost.
- Score: 20.17906188581305
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: LiDAR-based 3D detection in point cloud is essential in the perception system
of autonomous driving. In this paper, we present LiDAR R-CNN, a second stage
detector that can generally improve any existing 3D detector. To fulfill the
real-time and high precision requirement in practice, we resort to point-based
approach other than the popular voxel-based approach. However, we find an
overlooked issue in previous work: Naively applying point-based methods like
PointNet could make the learned features ignore the size of proposals. To this
end, we analyze this problem in detail and propose several methods to remedy
it, which bring significant performance improvement. Comprehensive experimental
results on real-world datasets like Waymo Open Dataset (WOD) and KITTI dataset
with various popular detectors demonstrate the universality and superiority of
our LiDAR R-CNN. In particular, based on one variant of PointPillars, our
method could achieve new state-of-the-art results with minor cost. Codes will
be released at https://github.com/tusimple/LiDAR_RCNN .
Related papers
- Leveraging Anchor-based LiDAR 3D Object Detection via Point Assisted
Sample Selection [40.005411891186874]
This paper introduces a new training sample selection method that utilizes point cloud distribution for anchor sample quality measurement.
Experimental results demonstrate that the application of PASS elevates the average precision of anchor-based LiDAR 3D object detectors to a novel state-of-the-art.
arXiv Detail & Related papers (2024-03-04T12:20:40Z) - Unsupervised Domain Adaptation for Self-Driving from Past Traversal
Features [69.47588461101925]
We propose a method to adapt 3D object detectors to new driving environments.
Our approach enhances LiDAR-based detection models using spatial quantized historical features.
Experiments on real-world datasets demonstrate significant improvements.
arXiv Detail & Related papers (2023-09-21T15:00:31Z) - Graph R-CNN: Towards Accurate 3D Object Detection with
Semantic-Decorated Local Graph [26.226885108862735]
Two-stage detectors have gained much popularity in 3D object detection.
Most two-stage 3D detectors utilize grid points, voxel grids, or sampled keypoints for RoI feature extraction in the second stage.
This paper solves this problem in three aspects.
arXiv Detail & Related papers (2022-08-07T02:56:56Z) - Boosting 3D Object Detection by Simulating Multimodality on Point Clouds [51.87740119160152]
This paper presents a new approach to boost a single-modality (LiDAR) 3D object detector by teaching it to simulate features and responses that follow a multi-modality (LiDAR-image) detector.
The approach needs LiDAR-image data only when training the single-modality detector, and once well-trained, it only needs LiDAR data at inference.
Experimental results on the nuScenes dataset show that our approach outperforms all SOTA LiDAR-only 3D detectors.
arXiv Detail & Related papers (2022-06-30T01:44:30Z) - Cost-Aware Evaluation and Model Scaling for LiDAR-Based 3D Object
Detection [34.34668878632354]
This work is to conduct a cost-aware evaluation of LiDAR-based 3D object detectors.
Specifically, we focus on SECOND, a simple grid-based one-stage detector.
We compare the family of scaled SECOND with recent 3D detection methods, such as Voxel R-CNN and PV-RCNN++.
arXiv Detail & Related papers (2022-05-02T18:16:17Z) - MonoDistill: Learning Spatial Features for Monocular 3D Object Detection [80.74622486604886]
We propose a simple and effective scheme to introduce the spatial information from LiDAR signals to the monocular 3D detectors.
We use the resulting data to train a 3D detector with the same architecture as the baseline model.
Experimental results show that the proposed method can significantly boost the performance of the baseline model.
arXiv Detail & Related papers (2022-01-26T09:21:41Z) - Lite-FPN for Keypoint-based Monocular 3D Object Detection [18.03406686769539]
Keypoint-based monocular 3D object detection has made tremendous progress and achieved great speed-accuracy trade-off.
We propose a sort of lightweight feature pyramid network called Lite-FPN to achieve multi-scale feature fusion.
Our proposed method achieves significantly higher accuracy and frame rate at the same time.
arXiv Detail & Related papers (2021-05-01T14:44:31Z) - PV-RCNN++: Point-Voxel Feature Set Abstraction With Local Vector
Representation for 3D Object Detection [100.60209139039472]
We propose the PointVoxel Region based Convolution Neural Networks (PVRCNNs) for accurate 3D detection from point clouds.
Our proposed PV-RCNNs significantly outperform previous state-of-the-art 3D detection methods on both the Open dataset and the highly-competitive KITTI benchmark.
arXiv Detail & Related papers (2021-01-31T14:51:49Z) - Voxel R-CNN: Towards High Performance Voxel-based 3D Object Detection [99.16162624992424]
We devise a simple but effective voxel-based framework, named Voxel R-CNN.
By taking full advantage of voxel features in a two stage approach, our method achieves comparable detection accuracy with state-of-the-art point-based models.
Our results show that Voxel R-CNN delivers a higher detection accuracy while maintaining a realtime frame processing rate, emphi.e, at a speed of 25 FPS on an NVIDIA 2080 Ti GPU.
arXiv Detail & Related papers (2020-12-31T17:02:46Z) - Rethinking Pseudo-LiDAR Representation [70.29791705160203]
We propose an image based CNN detector named Patch-Net, which is more generalized and can be instantiated as pseudo-LiDAR based 3D detectors.
We conduct extensive experiments on the challenging KITTI dataset, where the proposed PatchNet outperforms all existing pseudo-LiDAR based counterparts.
arXiv Detail & Related papers (2020-08-11T08:44:18Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.