Pillar-based Object Detection for Autonomous Driving
- URL: http://arxiv.org/abs/2007.10323v2
- Date: Sun, 26 Jul 2020 21:13:04 GMT
- Title: Pillar-based Object Detection for Autonomous Driving
- Authors: Yue Wang, Alireza Fathi, Abhijit Kundu, David Ross, Caroline
Pantofaru, Thomas Funkhouser, Justin Solomon
- Abstract summary: We present a simple and flexible object detection framework optimized for autonomous driving.
Building on the observation that point clouds in this application are extremely sparse, we propose a practical pillar-based approach to fix the issue caused by anchors.
Our algorithm incorporates a cylindrical projection into multi-view feature learning, predicts bounding box parameters per pillar rather than per point or per anchor, and includes an aligned pillar-to-point projection module to improve the final prediction.
- Score: 33.021347169775474
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We present a simple and flexible object detection framework optimized for
autonomous driving. Building on the observation that point clouds in this
application are extremely sparse, we propose a practical pillar-based approach
to fix the imbalance issue caused by anchors. In particular, our algorithm
incorporates a cylindrical projection into multi-view feature learning,
predicts bounding box parameters per pillar rather than per point or per
anchor, and includes an aligned pillar-to-point projection module to improve
the final prediction. Our anchor-free approach avoids hyperparameter search
associated with past methods, simplifying 3D object detection while
significantly improving upon state-of-the-art.
Related papers
- End-to-End 3D Object Detection using LiDAR Point Cloud [0.0]
We present an approach wherein, using a novel encoding of the LiDAR point cloud we infer the location of different classes near the autonomous vehicles.
The output is predictions about the location and orientation of objects in the scene in form of 3D bounding boxes and labels of scene objects.
arXiv Detail & Related papers (2023-12-24T00:52:14Z) - LEF: Late-to-Early Temporal Fusion for LiDAR 3D Object Detection [40.267769862404684]
We propose a late-to-early recurrent feature fusion scheme for 3D object detection using temporal LiDAR point clouds.
Our main motivation is fusing object-aware latent embeddings into the early stages of a 3D object detector.
arXiv Detail & Related papers (2023-09-28T21:58:25Z) - Dynamic Tiling: A Model-Agnostic, Adaptive, Scalable, and
Inference-Data-Centric Approach for Efficient and Accurate Small Object
Detection [3.8332251841430423]
Dynamic Tiling is a model-agnostic, adaptive, and scalable approach for small object detection.
Our method effectively resolves fragmented objects, improves detection accuracy, and minimizes computational overhead.
Overall, Dynamic Tiling outperforms existing model-agnostic uniform cropping methods.
arXiv Detail & Related papers (2023-09-20T05:25:12Z) - Small Object Detection via Coarse-to-fine Proposal Generation and
Imitation Learning [52.06176253457522]
We propose a two-stage framework tailored for small object detection based on the Coarse-to-fine pipeline and Feature Imitation learning.
CFINet achieves state-of-the-art performance on the large-scale small object detection benchmarks, SODA-D and SODA-A.
arXiv Detail & Related papers (2023-08-18T13:13:09Z) - Improving Online Lane Graph Extraction by Object-Lane Clustering [106.71926896061686]
We propose an architecture and loss formulation to improve the accuracy of local lane graph estimates.
The proposed method learns to assign the objects to centerlines by considering the centerlines as cluster centers.
We show that our method can achieve significant performance improvements by using the outputs of existing 3D object detection methods.
arXiv Detail & Related papers (2023-07-20T15:21:28Z) - ConvPoseCNN2: Prediction and Refinement of Dense 6D Object Poses [23.348510362258402]
We propose a fully-convolutional extension of the PoseCNN method, which densely predicts object translations and orientations.
This has several advantages such as improving the spatial resolution of the orientation predictions.
We demonstrate that our method achieves the same accuracy as PoseCNN on the challenging YCB-Video dataset.
arXiv Detail & Related papers (2022-05-23T08:32:09Z) - DAFNe: A One-Stage Anchor-Free Deep Model for Oriented Object Detection [16.21161769128316]
We present DAFNe: A one-stage Anchor-Free deep Network for oriented object detection.
As an anchor-free model, DAFNe reduces the prediction complexity by refraining from employing bounding box anchors.
We introduce an orientation-aware generalization of the center-ness function for arbitrarily oriented bounding boxes to down-weight low-quality predictions.
arXiv Detail & Related papers (2021-09-13T17:37:20Z) - InfoFocus: 3D Object Detection for Autonomous Driving with Dynamic
Information Modeling [65.47126868838836]
We propose a novel 3D object detection framework with dynamic information modeling.
Coarse predictions are generated in the first stage via a voxel-based region proposal network.
Experiments are conducted on the large-scale nuScenes 3D detection benchmark.
arXiv Detail & Related papers (2020-07-16T18:27:08Z) - On the Arbitrary-Oriented Object Detection: Classification based
Approaches Revisited [94.5455251250471]
We first show that the boundary problem suffered in existing dominant regression-based rotation detectors, is caused by angular periodicity or corner ordering.
We transform the angular prediction task from a regression problem to a classification one.
For the resulting circularly distributed angle classification problem, we first devise a Circular Smooth Label technique to handle the periodicity of angle and increase the error tolerance to adjacent angles.
arXiv Detail & Related papers (2020-03-12T03:23:54Z) - Robust 6D Object Pose Estimation by Learning RGB-D Features [59.580366107770764]
We propose a novel discrete-continuous formulation for rotation regression to resolve this local-optimum problem.
We uniformly sample rotation anchors in SO(3), and predict a constrained deviation from each anchor to the target, as well as uncertainty scores for selecting the best prediction.
Experiments on two benchmarks: LINEMOD and YCB-Video, show that the proposed method outperforms state-of-the-art approaches.
arXiv Detail & Related papers (2020-02-29T06:24:55Z) - Road Curb Detection and Localization with Monocular Forward-view Vehicle
Camera [74.45649274085447]
We propose a robust method for estimating road curb 3D parameters using a calibrated monocular camera equipped with a fisheye lens.
Our approach is able to estimate the vehicle to curb distance in real time with mean accuracy of more than 90%.
arXiv Detail & Related papers (2020-02-28T00:24:18Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.