Holistic Parking Slot Detection with Polygon-Shaped Representations
- URL: http://arxiv.org/abs/2310.11629v1
- Date: Tue, 17 Oct 2023 23:37:23 GMT
- Title: Holistic Parking Slot Detection with Polygon-Shaped Representations
- Authors: Lihao Wang, Antonyo Musabini, Christel Leonet, Rachid Benmokhtar,
Amaury Breheret, Chaima Yedes, Fabian Burger, Thomas Boulay, Xavier Perrotton
- Abstract summary: We propose one-step Holistic Parking Slot Network (HPS-Net), a tailor-made adaptation of the You Only Look Once (YOLO)v4 algorithm.
Experiments show that HPS-Net can detect various vacant parking slots with a F1-score of 0.92.
It achieves a real-time detection speed of 17 FPS on Nvidia Drive AGX Xavier.
- Score: 1.1649926489639983
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Current parking slot detection in advanced driver-assistance systems (ADAS)
primarily relies on ultrasonic sensors. This method has several limitations
such as the need to scan the entire parking slot before detecting it, the
incapacity of detecting multiple slots in a row, and the difficulty of
classifying them. Due to the complex visual environment, vehicles are equipped
with surround view camera systems to detect vacant parking slots. Previous
research works in this field mostly use image-domain models to solve the
problem. These two-stage approaches separate the 2D detection and 3D pose
estimation steps using camera calibration. In this paper, we propose one-step
Holistic Parking Slot Network (HPS-Net), a tailor-made adaptation of the You
Only Look Once (YOLO)v4 algorithm. This camera-based approach directly outputs
the four vertex coordinates of the parking slot in topview domain, instead of a
bounding box in raw camera images. Several visible points and shapes can be
proposed from different angles. A novel regression loss function named
polygon-corner Generalized Intersection over Union (GIoU) for polygon vertex
position optimization is also proposed to manage the slot orientation and to
distinguish the entrance line. Experiments show that HPS-Net can detect various
vacant parking slots with a F1-score of 0.92 on our internal Valeo Parking
Slots Dataset (VPSD) and 0.99 on the public dataset PS2.0. It provides a
satisfying generalization and robustness in various parking scenarios, such as
indoor (F1: 0.86) or paved ground (F1: 0.91). Moreover, it achieves a real-time
detection speed of 17 FPS on Nvidia Drive AGX Xavier. A demo video can be found
at https://streamable.com/75j7sj.
Related papers
- Enhanced Parking Perception by Multi-Task Fisheye Cross-view Transformers [1.1227698959066101]
Current parking area perception algorithms primarily focus on detecting vacant slots within a limited range.
This paper introduces Multi-Task Fisheye Cross View Transformers (MT F-CVT)
MT F-CVT positions objects within a 25m x 25m real open-road scenes with an average error of only 20 cm.
arXiv Detail & Related papers (2024-08-22T17:42:16Z) - Automatic Vision-Based Parking Slot Detection and Occupancy
Classification [3.038642416291856]
Parking guidance information (PGI) systems are used to provide information to drivers about the nearest parking lots and the number of vacant parking slots.
Recently, vision-based solutions started to appear as a cost-effective alternative to standard PGI systems.
In this paper, the algorithm that performs Automatic Parking Slot Detection and Occupancy Classification (APSD-OC) solely on input images is proposed.
arXiv Detail & Related papers (2023-08-16T07:44:34Z) - HPointLoc: Point-based Indoor Place Recognition using Synthetic RGB-D
Images [58.720142291102135]
We present a novel dataset named as HPointLoc, specially designed for exploring capabilities of visual place recognition in indoor environment.
The dataset is based on the popular Habitat simulator, in which it is possible to generate indoor scenes using both own sensor data and open datasets.
arXiv Detail & Related papers (2022-12-30T12:20:56Z) - PersFormer: 3D Lane Detection via Perspective Transformer and the
OpenLane Benchmark [109.03773439461615]
PersFormer is an end-to-end monocular 3D lane detector with a novel Transformer-based spatial feature transformation module.
We release one of the first large-scale real-world 3D lane datasets, called OpenLane, with high-quality annotation and scenario diversity.
arXiv Detail & Related papers (2022-03-21T16:12:53Z) - Automatic Map Update Using Dashcam Videos [1.6911482053867475]
We propose an SfM-based solution for automatic map update, with a focus on real-time change detection and localization.
Our system can locate the objects detected from 2D images in a 3D space, utilizing sparse SfM point clouds.
arXiv Detail & Related papers (2021-09-24T18:00:57Z) - Robust 2D/3D Vehicle Parsing in CVIS [54.825777404511605]
We present a novel approach to robustly detect and perceive vehicles in different camera views as part of a cooperative vehicle-infrastructure system (CVIS)
Our formulation is designed for arbitrary camera views and makes no assumptions about intrinsic or extrinsic parameters.
In practice, our approach outperforms SOTA methods on 2D detection, instance segmentation, and 6-DoF pose estimation.
arXiv Detail & Related papers (2021-03-11T03:35:05Z) - Multiview Detection with Feature Perspective Transformation [59.34619548026885]
We propose a novel multiview detection system, MVDet.
We take an anchor-free approach to aggregate multiview information by projecting feature maps onto the ground plane.
Our entire model is end-to-end learnable and achieves 88.2% MODA on the standard Wildtrack dataset.
arXiv Detail & Related papers (2020-07-14T17:58:30Z) - MVLidarNet: Real-Time Multi-Class Scene Understanding for Autonomous
Driving Using Multiple Views [60.538802124885414]
We present Multi-View LidarNet (MVLidarNet), a two-stage deep neural network for multi-class object detection and drivable space segmentation.
MVLidarNet is able to detect and classify objects while simultaneously determining the drivable space using a single LiDAR scan as input.
We show results on both KITTI and a much larger internal dataset, thus demonstrating the method's ability to scale by an order of magnitude.
arXiv Detail & Related papers (2020-06-09T21:28:17Z) - PSDet: Efficient and Universal Parking Slot Detection [14.085693334348827]
Real-time parking slot detection plays a critical role in valet parking systems.
Existing methods have limited success in real-world applications.
We argue two reasons accounting for the unsatisfactory performance:.
romannumeral1, The available datasets have limited diversity, which causes the low generalization ability.
arXiv Detail & Related papers (2020-05-12T03:06:25Z) - Road Curb Detection and Localization with Monocular Forward-view Vehicle
Camera [74.45649274085447]
We propose a robust method for estimating road curb 3D parameters using a calibrated monocular camera equipped with a fisheye lens.
Our approach is able to estimate the vehicle to curb distance in real time with mean accuracy of more than 90%.
arXiv Detail & Related papers (2020-02-28T00:24:18Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.