An Efficient Wide-Range Pseudo-3D Vehicle Detection Using A Single
Camera
- URL: http://arxiv.org/abs/2309.08369v1
- Date: Fri, 15 Sep 2023 12:50:09 GMT
- Title: An Efficient Wide-Range Pseudo-3D Vehicle Detection Using A Single
Camera
- Authors: Zhupeng Ye, Yinqi Li, Zejian Yuan
- Abstract summary: This paper proposes a novel wide-range Pseudo-3D Vehicle Detection method based on images from a single camera.
To detect pseudo-3D objects, our model adopts specifically designed detection heads.
Joint constraint loss combining both the object box and SPL is designed during model training, improving the efficiency, stability, and prediction accuracy of the model.
- Score: 10.573423265001706
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Wide-range and fine-grained vehicle detection plays a critical role in
enabling active safety features in intelligent driving systems. However,
existing vehicle detection methods based on rectangular bounding boxes (BBox)
often struggle with perceiving wide-range objects, especially small objects at
long distances. And BBox expression cannot provide detailed geometric shape and
pose information of vehicles. This paper proposes a novel wide-range Pseudo-3D
Vehicle Detection method based on images from a single camera and incorporates
efficient learning methods. This model takes a spliced image as input, which is
obtained by combining two sub-window images from a high-resolution image. This
image format maximizes the utilization of limited image resolution to retain
essential information about wide-range vehicle objects. To detect pseudo-3D
objects, our model adopts specifically designed detection heads. These heads
simultaneously output extended BBox and Side Projection Line (SPL)
representations, which capture vehicle shapes and poses, enabling
high-precision detection. To further enhance the performance of detection, a
joint constraint loss combining both the object box and SPL is designed during
model training, improving the efficiency, stability, and prediction accuracy of
the model. Experimental results on our self-built dataset demonstrate that our
model achieves favorable performance in wide-range pseudo-3D vehicle detection
across multiple evaluation metrics. Our demo video has been placed at
https://www.youtube.com/watch?v=1gk1PmsQ5Q8.
Related papers
- HeightFormer: A Semantic Alignment Monocular 3D Object Detection Method from Roadside Perspective [11.841338298700421]
We propose a novel 3D object detection framework integrating Spatial Former and Voxel Pooling Former to enhance 2D-to-3D projection based on height estimation.
Experiments were conducted using the Rope3D and DAIR-V2X-I dataset, and the results demonstrated the outperformance of the proposed algorithm in the detection of both vehicles and cyclists.
arXiv Detail & Related papers (2024-10-10T09:37:33Z) - 3DiffTection: 3D Object Detection with Geometry-Aware Diffusion Features [70.50665869806188]
3DiffTection is a state-of-the-art method for 3D object detection from single images.
We fine-tune a diffusion model to perform novel view synthesis conditioned on a single image.
We further train the model on target data with detection supervision.
arXiv Detail & Related papers (2023-11-07T23:46:41Z) - AdaptiveShape: Solving Shape Variability for 3D Object Detection with
Geometry Aware Anchor Distributions [1.3807918535446089]
3D object detection with point clouds and images plays an important role in perception tasks such as autonomous driving.
Current methods show great performance on detection and pose estimation of standard-shaped vehicles but lack behind on more complex shapes.
This work introduces several new methods to improve and measure the performance for such classes.
arXiv Detail & Related papers (2023-02-28T12:31:31Z) - DETR4D: Direct Multi-View 3D Object Detection with Sparse Attention [50.11672196146829]
3D object detection with surround-view images is an essential task for autonomous driving.
We propose DETR4D, a Transformer-based framework that explores sparse attention and direct feature query for 3D object detection in multi-view images.
arXiv Detail & Related papers (2022-12-15T14:18:47Z) - A Simple Baseline for Multi-Camera 3D Object Detection [94.63944826540491]
3D object detection with surrounding cameras has been a promising direction for autonomous driving.
We present SimMOD, a Simple baseline for Multi-camera Object Detection.
We conduct extensive experiments on the 3D object detection benchmark of nuScenes to demonstrate the effectiveness of SimMOD.
arXiv Detail & Related papers (2022-08-22T03:38:01Z) - AutoAlignV2: Deformable Feature Aggregation for Dynamic Multi-Modal 3D
Object Detection [17.526914782562528]
We propose AutoAlignV2, a faster and stronger multi-modal 3D detection framework, built on top of AutoAlign.
Our best model reaches 72.4 NDS on nuScenes test leaderboard, achieving new state-of-the-art results.
arXiv Detail & Related papers (2022-07-21T06:17:23Z) - Weakly Supervised Training of Monocular 3D Object Detectors Using Wide
Baseline Multi-view Traffic Camera Data [19.63193201107591]
7DoF prediction of vehicles at an intersection is an important task for assessing potential conflicts between road users.
We develop an approach using a weakly supervised method of fine tuning 3D object detectors for traffic observation cameras.
Our method achieves vehicle 7DoF pose prediction accuracy on our dataset comparable to the top performing monocular 3D object detectors on autonomous vehicle datasets.
arXiv Detail & Related papers (2021-10-21T08:26:48Z) - High-level camera-LiDAR fusion for 3D object detection with machine
learning [0.0]
This paper tackles the 3D object detection problem, which is of vital importance for applications such as autonomous driving.
It uses a Machine Learning pipeline on a combination of monocular camera and LiDAR data to detect vehicles in the surrounding 3D space of a moving platform.
Our results demonstrate an efficient and accurate inference on a validation set, achieving an overall accuracy of 87.1%.
arXiv Detail & Related papers (2021-05-24T01:57:34Z) - PLUME: Efficient 3D Object Detection from Stereo Images [95.31278688164646]
Existing methods tackle the problem in two steps: first depth estimation is performed, a pseudo LiDAR point cloud representation is computed from the depth estimates, and then object detection is performed in 3D space.
We propose a model that unifies these two tasks in the same metric space.
Our approach achieves state-of-the-art performance on the challenging KITTI benchmark, with significantly reduced inference time compared with existing methods.
arXiv Detail & Related papers (2021-01-17T05:11:38Z) - PerMO: Perceiving More at Once from a Single Image for Autonomous
Driving [76.35684439949094]
We present a novel approach to detect, segment, and reconstruct complete textured 3D models of vehicles from a single image.
Our approach combines the strengths of deep learning and the elegance of traditional techniques.
We have integrated these algorithms with an autonomous driving system.
arXiv Detail & Related papers (2020-07-16T05:02:45Z) - Road Curb Detection and Localization with Monocular Forward-view Vehicle
Camera [74.45649274085447]
We propose a robust method for estimating road curb 3D parameters using a calibrated monocular camera equipped with a fisheye lens.
Our approach is able to estimate the vehicle to curb distance in real time with mean accuracy of more than 90%.
arXiv Detail & Related papers (2020-02-28T00:24:18Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.