3D Cascade RCNN: High Quality Object Detection in Point Clouds
- URL: http://arxiv.org/abs/2211.08248v1
- Date: Tue, 15 Nov 2022 15:58:36 GMT
- Title: 3D Cascade RCNN: High Quality Object Detection in Point Clouds
- Authors: Qi Cai and Yingwei Pan and Ting Yao and Tao Mei
- Abstract summary: We present 3D Cascade RCNN, which allocates multiple detectors based on the voxelized point clouds in a cascade paradigm.
We validate the superiority of our proposed 3D Cascade RCNN, when comparing to state-of-the-art 3D object detection techniques.
- Score: 122.42455210196262
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Recent progress on 2D object detection has featured Cascade RCNN, which
capitalizes on a sequence of cascade detectors to progressively improve
proposal quality, towards high-quality object detection. However, there has not
been evidence in support of building such cascade structures for 3D object
detection, a challenging detection scenario with highly sparse LiDAR point
clouds. In this work, we present a simple yet effective cascade architecture,
named 3D Cascade RCNN, that allocates multiple detectors based on the voxelized
point clouds in a cascade paradigm, pursuing higher quality 3D object detector
progressively. Furthermore, we quantitatively define the sparsity level of the
points within 3D bounding box of each object as the point completeness score,
which is exploited as the task weight for each proposal to guide the learning
of each stage detector. The spirit behind is to assign higher weights for
high-quality proposals with relatively complete point distribution, while
down-weight the proposals with extremely sparse points that often incur noise
during training. This design of completeness-aware re-weighting elegantly
upgrades the cascade paradigm to be better applicable for the sparse input
data, without increasing any FLOP budgets. Through extensive experiments on
both the KITTI dataset and Waymo Open Dataset, we validate the superiority of
our proposed 3D Cascade RCNN, when comparing to state-of-the-art 3D object
detection techniques. The source code is publicly available at
\url{https://github.com/caiqi/Cascasde-3D}.
Related papers
- CascadeV-Det: Cascade Point Voting for 3D Object Detection [10.714006902287904]
Anchor-free object detectors are highly efficient in performing point-based prediction without the need for extra post-processing of anchors.
Different from the 2D grids, the 3D points used in these detectors are often far from the ground truth center.
We propose a Cascade Voting ( CascadeV) strategy that provides high-quality 3D object detection with point-based prediction.
arXiv Detail & Related papers (2024-01-15T05:10:27Z) - CAGroup3D: Class-Aware Grouping for 3D Object Detection on Point Clouds [55.44204039410225]
We present a novel two-stage fully sparse convolutional 3D object detection framework, named CAGroup3D.
Our proposed method first generates some high-quality 3D proposals by leveraging the class-aware local group strategy on the object surface voxels.
To recover the features of missed voxels due to incorrect voxel-wise segmentation, we build a fully sparse convolutional RoI pooling module.
arXiv Detail & Related papers (2022-10-09T13:38:48Z) - 3D Object Detection Combining Semantic and Geometric Features from Point
Clouds [19.127930862527666]
We propose a novel end-to-end two-stage 3D object detector named SGNet for point clouds scenes.
The VTPM is a Voxel-Point-Based Module that finally implements 3D object detection in point space.
As of September 19, 2021, for KITTI dataset, SGNet ranked 1st in 3D and BEV detection on cyclists with easy difficulty level, and 2nd in the 3D detection of moderate cyclists.
arXiv Detail & Related papers (2021-10-10T04:43:27Z) - Improving 3D Object Detection with Channel-wise Transformer [58.668922561622466]
We propose a two-stage 3D object detection framework (CT3D) with minimal hand-crafted design.
CT3D simultaneously performs proposal-aware embedding and channel-wise context aggregation.
It achieves the AP of 81.77% in the moderate car category on the KITTI test 3D detection benchmark.
arXiv Detail & Related papers (2021-08-23T02:03:40Z) - Anchor-free 3D Single Stage Detector with Mask-Guided Attention for
Point Cloud [79.39041453836793]
We develop a novel single-stage 3D detector for point clouds in an anchor-free manner.
We overcome this by converting the voxel-based sparse 3D feature volumes into the sparse 2D feature maps.
We propose an IoU-based detection confidence re-calibration scheme to improve the correlation between the detection confidence score and the accuracy of the bounding box regression.
arXiv Detail & Related papers (2021-08-08T13:42:13Z) - ST3D: Self-training for Unsupervised Domain Adaptation on 3D
ObjectDetection [78.71826145162092]
We present a new domain adaptive self-training pipeline, named ST3D, for unsupervised domain adaptation on 3D object detection from point clouds.
Our ST3D achieves state-of-the-art performance on all evaluated datasets and even surpasses fully supervised results on KITTI 3D object detection benchmark.
arXiv Detail & Related papers (2021-03-09T10:51:24Z) - D3Feat: Joint Learning of Dense Detection and Description of 3D Local
Features [51.04841465193678]
We leverage a 3D fully convolutional network for 3D point clouds.
We propose a novel and practical learning mechanism that densely predicts both a detection score and a description feature for each 3D point.
Our method achieves state-of-the-art results in both indoor and outdoor scenarios.
arXiv Detail & Related papers (2020-03-06T12:51:09Z) - SMOKE: Single-Stage Monocular 3D Object Detection via Keypoint
Estimation [3.1542695050861544]
Estimating 3D orientation and translation of objects is essential for infrastructure-less autonomous navigation and driving.
We propose a novel 3D object detection method, named SMOKE, that combines a single keypoint estimate with regressed 3D variables.
Despite of its structural simplicity, our proposed SMOKE network outperforms all existing monocular 3D detection methods on the KITTI dataset.
arXiv Detail & Related papers (2020-02-24T08:15:36Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.