Objects are Different: Flexible Monocular 3D Object Detection
- URL: http://arxiv.org/abs/2104.02323v1
- Date: Tue, 6 Apr 2021 07:01:28 GMT
- Title: Objects are Different: Flexible Monocular 3D Object Detection
- Authors: Yunpeng Zhang, Jiwen Lu, Jie Zhou
- Abstract summary: We propose a flexible framework for monocular 3D object detection which explicitly decouples the truncated objects and adaptively combines multiple approaches for object depth estimation.
Experiments demonstrate that our method outperforms the state-of-the-art method by relatively 27% for the moderate level and 30% for the hard level in the test set of KITTI benchmark.
- Score: 87.82253067302561
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The precise localization of 3D objects from a single image without depth
information is a highly challenging problem. Most existing methods adopt the
same approach for all objects regardless of their diverse distributions,
leading to limited performance for truncated objects. In this paper, we propose
a flexible framework for monocular 3D object detection which explicitly
decouples the truncated objects and adaptively combines multiple approaches for
object depth estimation. Specifically, we decouple the edge of the feature map
for predicting long-tail truncated objects so that the optimization of normal
objects is not influenced. Furthermore, we formulate the object depth
estimation as an uncertainty-guided ensemble of directly regressed object depth
and solved depths from different groups of keypoints. Experiments demonstrate
that our method outperforms the state-of-the-art method by relatively 27\% for
the moderate level and 30\% for the hard level in the test set of KITTI
benchmark while maintaining real-time efficiency. Code will be available at
\url{https://github.com/zhangyp15/MonoFlex}.
Related papers
- Depth-discriminative Metric Learning for Monocular 3D Object Detection [14.554132525651868]
We introduce a novel metric learning scheme that encourages the model to extract depth-discriminative features regardless of the visual attributes.
Our method consistently improves the performance of various baselines by 23.51% and 5.78% on average.
arXiv Detail & Related papers (2024-01-02T07:34:09Z) - What You See Is What You Detect: Towards better Object Densification in
3D detection [2.3436632098950456]
The widely-used full-shape completion approach actually leads to a higher error-upper bound especially for far away objects and small objects like pedestrians.
We introduce a visible part completion method that requires only 11.3% of the prediction points that previous methods generate.
To recover the dense representation, we propose a mesh-deformation-based method to augment the point set associated with visible foreground objects.
arXiv Detail & Related papers (2023-10-27T01:46:37Z) - DORT: Modeling Dynamic Objects in Recurrent for Multi-Camera 3D Object
Detection and Tracking [67.34803048690428]
We propose to model Dynamic Objects in RecurrenT (DORT) to tackle this problem.
DORT extracts object-wise local volumes for motion estimation that also alleviates the heavy computational burden.
It is flexible and practical that can be plugged into most camera-based 3D object detectors.
arXiv Detail & Related papers (2023-03-29T12:33:55Z) - 3D Video Object Detection with Learnable Object-Centric Global
Optimization [65.68977894460222]
Correspondence-based optimization is the cornerstone for 3D scene reconstruction but is less studied in 3D video object detection.
We propose BA-Det, an end-to-end optimizable object detector with object-centric temporal correspondence learning and featuremetric object bundle adjustment.
arXiv Detail & Related papers (2023-03-27T17:39:39Z) - LocPoseNet: Robust Location Prior for Unseen Object Pose Estimation [69.70498875887611]
LocPoseNet is able to robustly learn location prior for unseen objects.
Our method outperforms existing works by a large margin on LINEMOD and GenMOP.
arXiv Detail & Related papers (2022-11-29T15:21:34Z) - Diversity Matters: Fully Exploiting Depth Clues for Reliable Monocular
3D Object Detection [37.37316176663782]
We propose a depth solving system that fully explores the visual clues from the subtasks in monocular 3D images.
Our method surpasses the current best method by more than 20% relatively on the Moderate level of test split in the KITTI 3D object detection benchmark.
arXiv Detail & Related papers (2022-05-19T08:12:55Z) - ImpDet: Exploring Implicit Fields for 3D Object Detection [74.63774221984725]
We introduce a new perspective that views bounding box regression as an implicit function.
This leads to our proposed framework, termed Implicit Detection or ImpDet.
Our ImpDet assigns specific values to points in different local 3D spaces, thereby high-quality boundaries can be generated.
arXiv Detail & Related papers (2022-03-31T17:52:12Z) - ZebraPose: Coarse to Fine Surface Encoding for 6DoF Object Pose
Estimation [76.31125154523056]
We present a discrete descriptor, which can represent the object surface densely.
We also propose a coarse to fine training strategy, which enables fine-grained correspondence prediction.
arXiv Detail & Related papers (2022-03-17T16:16:24Z) - Iterative Corresponding Geometry: Fusing Region and Depth for Highly
Efficient 3D Tracking of Textureless Objects [25.448657318818764]
ICG is a novel probabilistic tracker that fuses region and depth information and only requires the object geometry.
Our method deploys correspondence lines and points to iteratively refine the pose.
Experiments on the YCB-Video, OPT, and Choi datasets demonstrate that, even for textured objects, our approach outperforms the current state of the art.
arXiv Detail & Related papers (2022-03-10T12:30:50Z) - Few-Shot Object Detection and Viewpoint Estimation for Objects in the
Wild [40.132988301147776]
We tackle the problems of few-shot object detection and few-shot viewpoint estimation.
We demonstrate on both tasks the benefits of guiding the network prediction with class-representative features extracted from data.
Our method outperforms state-of-the-art methods by a large margin on a range of datasets.
arXiv Detail & Related papers (2020-07-23T16:17:25Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.