ImpDet: Exploring Implicit Fields for 3D Object Detection
- URL: http://arxiv.org/abs/2203.17240v1
- Date: Thu, 31 Mar 2022 17:52:12 GMT
- Title: ImpDet: Exploring Implicit Fields for 3D Object Detection
- Authors: Xuelin Qian and Li Wang and Yi Zhu and Li Zhang and Yanwei Fu and
Xiangyang Xue
- Abstract summary: We introduce a new perspective that views bounding box regression as an implicit function.
This leads to our proposed framework, termed Implicit Detection or ImpDet.
Our ImpDet assigns specific values to points in different local 3D spaces, thereby high-quality boundaries can be generated.
- Score: 74.63774221984725
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Conventional 3D object detection approaches concentrate on bounding boxes
representation learning with several parameters, i.e., localization, dimension,
and orientation. Despite its popularity and universality, such a
straightforward paradigm is sensitive to slight numerical deviations,
especially in localization. By exploiting the property that point clouds are
naturally captured on the surface of objects along with accurate location and
intensity information, we introduce a new perspective that views bounding box
regression as an implicit function. This leads to our proposed framework,
termed Implicit Detection or ImpDet, which leverages implicit field learning
for 3D object detection. Our ImpDet assigns specific values to points in
different local 3D spaces, thereby high-quality boundaries can be generated by
classifying points inside or outside the boundary. To solve the problem of
sparsity on the object surface, we further present a simple yet efficient
virtual sampling strategy to not only fill the empty region, but also learn
rich semantic features to help refine the boundaries. Extensive experimental
results on KITTI and Waymo benchmarks demonstrate the effectiveness and
robustness of unifying implicit fields into object detection.
Related papers
- Open Vocabulary Monocular 3D Object Detection [10.424711580213616]
We pioneer the study of open-vocabulary monocular 3D object detection, a novel task that aims to detect and localize objects in 3D space from a single RGB image.
We introduce a class-agnostic approach that leverages open-vocabulary 2D detectors and lifts 2D bounding boxes into 3D space.
Our approach decouples the recognition and localization of objects in 2D from the task of estimating 3D bounding boxes, enabling generalization across unseen categories.
arXiv Detail & Related papers (2024-11-25T18:59:17Z) - Find n' Propagate: Open-Vocabulary 3D Object Detection in Urban Environments [67.83787474506073]
We tackle the limitations of current LiDAR-based 3D object detection systems.
We introduce a universal textscFind n' Propagate approach for 3D OV tasks.
We achieve up to a 3.97-fold increase in Average Precision (AP) for novel object classes.
arXiv Detail & Related papers (2024-03-20T12:51:30Z) - AGO-Net: Association-Guided 3D Point Cloud Object Detection Network [86.10213302724085]
We propose a novel 3D detection framework that associates intact features for objects via domain adaptation.
We achieve new state-of-the-art performance on the KITTI 3D detection benchmark in both accuracy and speed.
arXiv Detail & Related papers (2022-08-24T16:54:38Z) - SASA: Semantics-Augmented Set Abstraction for Point-based 3D Object
Detection [78.90102636266276]
We propose a novel set abstraction method named Semantics-Augmented Set Abstraction (SASA)
Based on the estimated point-wise foreground scores, we then propose a semantics-guided point sampling algorithm to help retain more important foreground points during down-sampling.
In practice, SASA shows to be effective in identifying valuable points related to foreground objects and improving feature learning for point-based 3D detection.
arXiv Detail & Related papers (2022-01-06T08:54:47Z) - Oriented Bounding Boxes for Small and Freely Rotated Objects [7.6997148655751895]
A novel object detection method is presented that handles freely rotated objects of arbitrary sizes.
The method encodes the precise location and orientation of features of the target objects at grid cell locations.
Evaluations on the xView and DOTA datasets show that the proposed method uniformly improves performance over existing state-of-the-art methods.
arXiv Detail & Related papers (2021-04-24T02:04:49Z) - Objects are Different: Flexible Monocular 3D Object Detection [87.82253067302561]
We propose a flexible framework for monocular 3D object detection which explicitly decouples the truncated objects and adaptively combines multiple approaches for object depth estimation.
Experiments demonstrate that our method outperforms the state-of-the-art method by relatively 27% for the moderate level and 30% for the hard level in the test set of KITTI benchmark.
arXiv Detail & Related papers (2021-04-06T07:01:28Z) - Associate-3Ddet: Perceptual-to-Conceptual Association for 3D Point Cloud
Object Detection [64.2159881697615]
Object detection from 3D point clouds remains a challenging task, though recent studies pushed the envelope with the deep learning techniques.
We propose a domain adaptation like approach to enhance the robustness of the feature representation.
Our simple yet effective approach fundamentally boosts the performance of 3D point cloud object detection and achieves the state-of-the-art results.
arXiv Detail & Related papers (2020-06-08T05:15:06Z) - Object as Hotspots: An Anchor-Free 3D Object Detection Approach via
Firing of Hotspots [37.16690737208046]
We argue for an approach opposite to existing methods using object-level anchors.
Inspired by compositional models, we propose an object as composition of its interior non-empty voxels, termed hotspots.
Based on OHS, we propose an anchor-free detection head with a novel ground truth assignment strategy.
arXiv Detail & Related papers (2019-12-30T03:02:22Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.