3DSSD: Point-based 3D Single Stage Object Detector
- URL: http://arxiv.org/abs/2002.10187v1
- Date: Mon, 24 Feb 2020 12:01:58 GMT
- Title: 3DSSD: Point-based 3D Single Stage Object Detector
- Authors: Zetong Yang, Yanan Sun, Shu Liu, Jiaya Jia
- Abstract summary: We present a point-based 3D single stage object detector, named 3DSSD, achieving a good balance between accuracy and efficiency.
Our method outperforms all state-of-the-art voxel-based single stage methods by a large margin, and has comparable performance to two stage point-based methods as well.
- Score: 61.67928229961813
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Currently, there have been many kinds of voxel-based 3D single stage
detectors, while point-based single stage methods are still underexplored. In
this paper, we first present a lightweight and effective point-based 3D single
stage object detector, named 3DSSD, achieving a good balance between accuracy
and efficiency. In this paradigm, all upsampling layers and refinement stage,
which are indispensable in all existing point-based methods, are abandoned to
reduce the large computation cost. We novelly propose a fusion sampling
strategy in downsampling process to make detection on less representative
points feasible. A delicate box prediction network including a candidate
generation layer, an anchor-free regression head with a 3D center-ness
assignment strategy is designed to meet with our demand of accuracy and speed.
Our paradigm is an elegant single stage anchor-free framework, showing great
superiority to other existing methods. We evaluate 3DSSD on widely used KITTI
dataset and more challenging nuScenes dataset. Our method outperforms all
state-of-the-art voxel-based single stage methods by a large margin, and has
comparable performance to two stage point-based methods as well, with inference
speed more than 25 FPS, 2x faster than former state-of-the-art point-based
methods.
Related papers
- IC-FPS: Instance-Centroid Faster Point Sampling Module for 3D Point-base
Object Detection [0.5249805590164902]
Our research focuses on tackling low efficiency issue of point-based methods on large-scale point clouds.
Existing point-based methods adopt farthest point sampling (FPS) strategy for downsampling, which is computationally expensive in terms of inference time and memory consumption when the number of point cloud increases.
We propose a novel Instance-Centroid Faster Point Sampling Module (IC-FPS) which effectively replaces the first Set Abstraction (SA) layer that is extremely tedious.
arXiv Detail & Related papers (2023-03-31T09:31:29Z) - PointInst3D: Segmenting 3D Instances by Points [136.7261709896713]
We propose a fully-convolutional 3D point cloud instance segmentation method that works in a per-point prediction fashion.
We find the key to its success is assigning a suitable target to each sampled point.
Our approach achieves promising results on both ScanNet and S3DIS benchmarks.
arXiv Detail & Related papers (2022-04-25T02:41:46Z) - SASA: Semantics-Augmented Set Abstraction for Point-based 3D Object
Detection [78.90102636266276]
We propose a novel set abstraction method named Semantics-Augmented Set Abstraction (SASA)
Based on the estimated point-wise foreground scores, we then propose a semantics-guided point sampling algorithm to help retain more important foreground points during down-sampling.
In practice, SASA shows to be effective in identifying valuable points related to foreground objects and improving feature learning for point-based 3D detection.
arXiv Detail & Related papers (2022-01-06T08:54:47Z) - AFDetV2: Rethinking the Necessity of the Second Stage for Object
Detection from Point Clouds [15.72821609622122]
We develop a single-stage anchor-free network for 3D detection from point clouds.
We use a self-calibrated convolution block in the backbone, a keypoint auxiliary supervision, and an IoU prediction branch in the multi-task head.
We win the 1st place in the Real-Time 3D Challenge 2021.
arXiv Detail & Related papers (2021-12-16T21:22:17Z) - Learning Semantic Segmentation of Large-Scale Point Clouds with Random
Sampling [52.464516118826765]
We introduce RandLA-Net, an efficient and lightweight neural architecture to infer per-point semantics for large-scale point clouds.
The key to our approach is to use random point sampling instead of more complex point selection approaches.
Our RandLA-Net can process 1 million points in a single pass up to 200x faster than existing approaches.
arXiv Detail & Related papers (2021-07-06T05:08:34Z) - Lite-FPN for Keypoint-based Monocular 3D Object Detection [18.03406686769539]
Keypoint-based monocular 3D object detection has made tremendous progress and achieved great speed-accuracy trade-off.
We propose a sort of lightweight feature pyramid network called Lite-FPN to achieve multi-scale feature fusion.
Our proposed method achieves significantly higher accuracy and frame rate at the same time.
arXiv Detail & Related papers (2021-05-01T14:44:31Z) - Hyperspectral Classification Based on Lightweight 3-D-CNN With Transfer
Learning [67.40866334083941]
We propose an end-to-end 3-D lightweight convolutional neural network (CNN) for limited samples-based HSI classification.
Compared with conventional 3-D-CNN models, the proposed 3-D-LWNet has a deeper network structure, less parameters, and lower computation cost.
Our model achieves competitive performance for HSI classification compared to several state-of-the-art methods.
arXiv Detail & Related papers (2020-12-07T03:44:35Z) - Reinforced Axial Refinement Network for Monocular 3D Object Detection [160.34246529816085]
Monocular 3D object detection aims to extract the 3D position and properties of objects from a 2D input image.
Conventional approaches sample 3D bounding boxes from the space and infer the relationship between the target object and each of them, however, the probability of effective samples is relatively small in the 3D space.
We propose to start with an initial prediction and refine it gradually towards the ground truth, with only one 3d parameter changed in each step.
This requires designing a policy which gets a reward after several steps, and thus we adopt reinforcement learning to optimize it.
arXiv Detail & Related papers (2020-08-31T17:10:48Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.