CascadeV-Det: Cascade Point Voting for 3D Object Detection
- URL: http://arxiv.org/abs/2401.07477v1
- Date: Mon, 15 Jan 2024 05:10:27 GMT
- Title: CascadeV-Det: Cascade Point Voting for 3D Object Detection
- Authors: Yingping Liang, Ying Fu
- Abstract summary: Anchor-free object detectors are highly efficient in performing point-based prediction without the need for extra post-processing of anchors.
Different from the 2D grids, the 3D points used in these detectors are often far from the ground truth center.
We propose a Cascade Voting ( CascadeV) strategy that provides high-quality 3D object detection with point-based prediction.
- Score: 10.714006902287904
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Anchor-free object detectors are highly efficient in performing point-based
prediction without the need for extra post-processing of anchors. However,
different from the 2D grids, the 3D points used in these detectors are often
far from the ground truth center, making it challenging to accurately regress
the bounding boxes. To address this issue, we propose a Cascade Voting
(CascadeV) strategy that provides high-quality 3D object detection with
point-based prediction. Specifically, CascadeV performs cascade detection using
a novel Cascade Voting decoder that combines two new components: Instance Aware
Voting (IA-Voting) and a Cascade Point Assignment (CPA) module. The IA-Voting
module updates the object features of updated proposal points within the
bounding box using conditional inverse distance weighting. This approach
prevents features from being aggregated outside the instance and helps improve
the accuracy of object detection. Additionally, since model training can suffer
from a lack of proposal points with high centerness, we have developed the CPA
module to narrow down the positive assignment threshold with cascade stages.
This approach relaxes the dependence on proposal centerness in the early stages
while ensuring an ample quantity of positives with high centerness in the later
stages. Experiments show that FCAF3D with our CascadeV achieves
state-of-the-art 3D object detection results with 70.4\% mAP@0.25 and 51.6\%
mAP@0.5 on SUN RGB-D and competitive results on ScanNet. Code will be released
at https://github.com/Sharpiless/CascadeV-Det
Related papers
- 3D Object Detection from Point Cloud via Voting Step Diffusion [52.9966883689137]
existing voting-based methods often receive votes from the partial surfaces of individual objects together with severe noises, leading to sub-optimal detection performance.
We propose a new method to move random 3D points toward the high-density region of the distribution by estimating the score function of the distribution with a noise conditioned score network.
Experiments on two large scale indoor 3D scene datasets, SUN RGB-D and ScanNet V2, demonstrate the superiority of our proposed method.
arXiv Detail & Related papers (2024-03-21T05:04:52Z) - SPGroup3D: Superpoint Grouping Network for Indoor 3D Object Detection [23.208654655032955]
Current 3D object detection methods for indoor scenes mainly follow the voting-and-grouping strategy to generate proposals.
We propose a novel superpoint grouping network for indoor anchor-free one-stage 3D object detection.
Experimental results demonstrate our method achieves state-of-the-art performance on ScanNet V2, SUN RGB-D, and S3DIS datasets.
arXiv Detail & Related papers (2023-12-21T08:08:02Z) - Semi-Supervised and Long-Tailed Object Detection with CascadeMatch [91.86787064083012]
We propose a novel pseudo-labeling-based detector called CascadeMatch.
Our detector features a cascade network architecture, which has multi-stage detection heads with progressive confidence thresholds.
We show that CascadeMatch surpasses existing state-of-the-art semi-supervised approaches in handling long-tailed object detection.
arXiv Detail & Related papers (2023-05-24T07:09:25Z) - 3D Cascade RCNN: High Quality Object Detection in Point Clouds [122.42455210196262]
We present 3D Cascade RCNN, which allocates multiple detectors based on the voxelized point clouds in a cascade paradigm.
We validate the superiority of our proposed 3D Cascade RCNN, when comparing to state-of-the-art 3D object detection techniques.
arXiv Detail & Related papers (2022-11-15T15:58:36Z) - CAGroup3D: Class-Aware Grouping for 3D Object Detection on Point Clouds [55.44204039410225]
We present a novel two-stage fully sparse convolutional 3D object detection framework, named CAGroup3D.
Our proposed method first generates some high-quality 3D proposals by leveraging the class-aware local group strategy on the object surface voxels.
To recover the features of missed voxels due to incorrect voxel-wise segmentation, we build a fully sparse convolutional RoI pooling module.
arXiv Detail & Related papers (2022-10-09T13:38:48Z) - RBGNet: Ray-based Grouping for 3D Object Detection [104.98776095895641]
We propose the RBGNet framework, a voting-based 3D detector for accurate 3D object detection from point clouds.
We propose a ray-based feature grouping module, which aggregates the point-wise features on object surfaces using a group of determined rays.
Our model achieves state-of-the-art 3D detection performance on ScanNet V2 and SUN RGB-D with remarkable performance gains.
arXiv Detail & Related papers (2022-04-05T14:42:57Z) - 3D Object Detection Combining Semantic and Geometric Features from Point
Clouds [19.127930862527666]
We propose a novel end-to-end two-stage 3D object detector named SGNet for point clouds scenes.
The VTPM is a Voxel-Point-Based Module that finally implements 3D object detection in point space.
As of September 19, 2021, for KITTI dataset, SGNet ranked 1st in 3D and BEV detection on cyclists with easy difficulty level, and 2nd in the 3D detection of moderate cyclists.
arXiv Detail & Related papers (2021-10-10T04:43:27Z) - Anchor-free 3D Single Stage Detector with Mask-Guided Attention for
Point Cloud [79.39041453836793]
We develop a novel single-stage 3D detector for point clouds in an anchor-free manner.
We overcome this by converting the voxel-based sparse 3D feature volumes into the sparse 2D feature maps.
We propose an IoU-based detection confidence re-calibration scheme to improve the correlation between the detection confidence score and the accuracy of the bounding box regression.
arXiv Detail & Related papers (2021-08-08T13:42:13Z) - CenterNet3D: An Anchor Free Object Detector for Point Cloud [14.506796247331584]
We propose an anchor-free CenterNet3D network that performs 3D object detection without anchors.
Based on the center point, we propose an anchor-free CenterNet3D network that performs 3D object detection without anchors.
Our method outperforms all state-of-the-art anchor-based one-stage methods and has comparable performance to two-stage methods as well.
arXiv Detail & Related papers (2020-07-13T13:53:56Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.