3D IoU-Net: IoU Guided 3D Object Detector for Point Clouds
- URL: http://arxiv.org/abs/2004.04962v1
- Date: Fri, 10 Apr 2020 09:24:29 GMT
- Title: 3D IoU-Net: IoU Guided 3D Object Detector for Point Clouds
- Authors: Jiale Li, Shujie Luo, Ziqi Zhu, Hang Dai, Andrey S. Krylov, Yong Ding,
and Ling Shao
- Abstract summary: We add a 3D IoU prediction branch to the regular classification and regression branches.
We propose a 3D IoU-Net with IoU sensitive feature learning and an IoU alignment operation.
The experimental results on the KITTI car detection benchmark show that 3D IoU-Net with IoU perception achieves state-of-the-art performance.
- Score: 68.44740333471792
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Most existing point cloud based 3D object detectors focus on the tasks of
classification and box regression. However, another bottleneck in this area is
achieving an accurate detection confidence for the Non-Maximum Suppression
(NMS) post-processing. In this paper, we add a 3D IoU prediction branch to the
regular classification and regression branches. The predicted IoU is used as
the detection confidence for NMS. In order to obtain a more accurate IoU
prediction, we propose a 3D IoU-Net with IoU sensitive feature learning and an
IoU alignment operation. To obtain a perspective-invariant prediction head, we
propose an Attentive Corner Aggregation (ACA) module by aggregating a local
point cloud feature from each perspective of eight corners and adaptively
weighting the contribution of each perspective with different attentions. We
propose a Corner Geometry Encoding (CGE) module for geometry information
embedding. To the best of our knowledge, this is the first time geometric
embedding information has been introduced in proposal feature learning. These
two feature parts are then adaptively fused by a multi-layer perceptron (MLP)
network as our IoU sensitive feature. The IoU alignment operation is introduced
to resolve the mismatching between the bounding box regression head and IoU
prediction, thereby further enhancing the accuracy of IoU prediction. The
experimental results on the KITTI car detection benchmark show that 3D IoU-Net
with IoU perception achieves state-of-the-art performance.
Related papers
- Unified-IoU: For High-Quality Object Detection [1.62877896907106]
We propose a new IoU loss function, called Unified-IoU (UIoU), which is more concerned with the weight assignment between different quality prediction boxes.
Our proposed method achieves better performance on multiple datasets, especially at a high IoU threshold.
arXiv Detail & Related papers (2024-08-13T04:56:45Z) - AGO-Net: Association-Guided 3D Point Cloud Object Detection Network [86.10213302724085]
We propose a novel 3D detection framework that associates intact features for objects via domain adaptation.
We achieve new state-of-the-art performance on the KITTI 3D detection benchmark in both accuracy and speed.
arXiv Detail & Related papers (2022-08-24T16:54:38Z) - Rethinking IoU-based Optimization for Single-stage 3D Object Detection [103.83141677242871]
We propose a Rotation-Decoupled IoU (RDIoU) method that can mitigate the rotation-sensitivity issue.
Our RDIoU simplifies the complex interactions of regression parameters by decoupling the rotation variable as an independent term.
arXiv Detail & Related papers (2022-07-19T15:35:23Z) - Structure Aware and Class Balanced 3D Object Detection on nuScenes
Dataset [0.0]
NuTonomy's nuScenes dataset greatly extends commonly used datasets such as KITTI.
The localization precision of this model is affected by the loss of spatial information in the downscaled feature maps.
We propose to enhance the performance of the CBGS model by designing an auxiliary network, that makes full use of the structure information of the 3D point cloud.
arXiv Detail & Related papers (2022-05-25T06:18:49Z) - SIENet: Spatial Information Enhancement Network for 3D Object Detection
from Point Cloud [20.84329063509459]
LiDAR-based 3D object detection pushes forward an immense influence on autonomous vehicles.
Due to the limitation of the intrinsic properties of LiDAR, fewer points are collected at the objects farther away from the sensor.
To address the challenge, we propose a novel two-stage 3D object detection framework, named SIENet.
arXiv Detail & Related papers (2021-03-29T07:45:09Z) - CIA-SSD: Confident IoU-Aware Single-Stage Object Detector From Point
Cloud [40.871850077528485]
We present a new single-stage detector named the Confident IoU-Aware Single-Stage object Detector (CIA-SSD)
CIA-SSD adaptively fuses high-level abstract semantic features and low-level spatial features for accurate predictions of bounding boxes and classification confidence.
It attains top performance in terms of the official ranking metric and above 32 FPS inference speed.
arXiv Detail & Related papers (2020-12-05T12:00:08Z) - InfoFocus: 3D Object Detection for Autonomous Driving with Dynamic
Information Modeling [65.47126868838836]
We propose a novel 3D object detection framework with dynamic information modeling.
Coarse predictions are generated in the first stage via a voxel-based region proposal network.
Experiments are conducted on the large-scale nuScenes 3D detection benchmark.
arXiv Detail & Related papers (2020-07-16T18:27:08Z) - D3Feat: Joint Learning of Dense Detection and Description of 3D Local
Features [51.04841465193678]
We leverage a 3D fully convolutional network for 3D point clouds.
We propose a novel and practical learning mechanism that densely predicts both a detection score and a description feature for each 3D point.
Our method achieves state-of-the-art results in both indoor and outdoor scenarios.
arXiv Detail & Related papers (2020-03-06T12:51:09Z) - SESS: Self-Ensembling Semi-Supervised 3D Object Detection [138.80825169240302]
We propose SESS, a self-ensembling semi-supervised 3D object detection framework. Specifically, we design a thorough perturbation scheme to enhance generalization of the network on unlabeled and new unseen data.
Our SESS achieves competitive performance compared to the state-of-the-art fully-supervised method by using only 50% labeled data.
arXiv Detail & Related papers (2019-12-26T08:48:04Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.