PSDet: Efficient and Universal Parking Slot Detection
- URL: http://arxiv.org/abs/2005.05528v1
- Date: Tue, 12 May 2020 03:06:25 GMT
- Title: PSDet: Efficient and Universal Parking Slot Detection
- Authors: Zizhang Wu, Weiwei Sun, Man Wang, Xiaoquan Wang, Lizhu Ding, Fan Wang
- Abstract summary: Real-time parking slot detection plays a critical role in valet parking systems.
Existing methods have limited success in real-world applications.
We argue two reasons accounting for the unsatisfactory performance:.
romannumeral1, The available datasets have limited diversity, which causes the low generalization ability.
- Score: 14.085693334348827
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: While real-time parking slot detection plays a critical role in valet parking
systems, existing methods have limited success in real-world applications. We
argue two reasons accounting for the unsatisfactory performance:
\romannumeral1, The available datasets have limited diversity, which causes the
low generalization ability. \romannumeral2, Expert knowledge for parking slot
detection is under-estimated. Thus, we annotate a large-scale benchmark for
training the network and release it for the benefit of community. Driven by the
observation of various parking lots in our benchmark, we propose the circular
descriptor to regress the coordinates of parking slot vertexes and accordingly
localize slots accurately. To further boost the performance, we develop a
two-stage deep architecture to localize vertexes in the coarse-to-fine manner.
In our benchmark and other datasets, it achieves the state-of-the-art accuracy
while being real-time in practice. Benchmark is available at:
https://github.com/wuzzh/Parking-slot-dataset
Related papers
- LaB-CL: Localized and Balanced Contrastive Learning for improving parking slot detection [4.813333335683417]
We propose the first supervised contrastive learning framework for parking slot detection, Localized and Balanced Contrastive Learning (LaB-CL)
The proposed LaB-CL framework uses two main approaches. First, we propose to include class prototypes to consider representations from all classes in every mini batch, from the local perspective. Second, we propose a new hard negative sampling scheme that selects local representations with high prediction error. Experiments with the benchmark dataset demonstrate that the proposed LaB-CL framework can outperform existing parking slot detection methods.
arXiv Detail & Related papers (2024-10-10T11:50:26Z) - Holistic Parking Slot Detection with Polygon-Shaped Representations [1.1649926489639983]
We propose one-step Holistic Parking Slot Network (HPS-Net), a tailor-made adaptation of the You Only Look Once (YOLO)v4 algorithm.
Experiments show that HPS-Net can detect various vacant parking slots with a F1-score of 0.92.
It achieves a real-time detection speed of 17 FPS on Nvidia Drive AGX Xavier.
arXiv Detail & Related papers (2023-10-17T23:37:23Z) - Towards Real-World Visual Tracking with Temporal Contexts [64.7981374129495]
We propose a two-level framework (TCTrack) that can exploit temporal contexts efficiently.
Based on it, we propose a stronger version for real-world visual tracking, i.e., TCTrack++.
For feature extraction, we propose an attention-based temporally adaptive convolution to enhance the spatial features.
For similarity map refinement, we introduce an adaptive temporal transformer to encode the temporal knowledge efficiently.
arXiv Detail & Related papers (2023-08-20T17:59:40Z) - Beyond Prediction: On-street Parking Recommendation using Heterogeneous
Graph-based List-wise Ranking [18.08128929432942]
We first time propose an on-street parking recommendation (OPR) task to directly recommend a parking space for a driver.
We design a highly efficient heterogeneous graph called ESGraph to represent historical and real-time meters' turnover events.
A ranking model is further utilized to learn a score function that helps recommend a list of ranked parking spots.
arXiv Detail & Related papers (2023-04-29T03:59:35Z) - SFD2: Semantic-guided Feature Detection and Description [34.36397639248686]
We propose to extract globally reliable features by implicitly embedding high-level semantics into both the detection and description processes.
Specifically, our semantic-aware detector is able to detect keypoints from reliable regions and suppress unreliable areas.
This boosts the accuracy of keypoint matching by reducing the number of features sensitive to appearance changes.
arXiv Detail & Related papers (2023-04-28T13:34:26Z) - HPointLoc: Point-based Indoor Place Recognition using Synthetic RGB-D
Images [58.720142291102135]
We present a novel dataset named as HPointLoc, specially designed for exploring capabilities of visual place recognition in indoor environment.
The dataset is based on the popular Habitat simulator, in which it is possible to generate indoor scenes using both own sensor data and open datasets.
arXiv Detail & Related papers (2022-12-30T12:20:56Z) - CNN-based Two-Stage Parking Slot Detection Using Region-Specific
Multi-Scale Feature Extraction [7.652701739127332]
Parking slot detection performance has been dramatically improved by deep learning techniques.
Deep learning-based object detection methods can be categorized into one-stage and two-stage approaches.
This paper proposes a highly specialized two-stage parking slot detector that uses region-specific multi-scale feature extraction.
arXiv Detail & Related papers (2021-08-13T12:02:02Z) - Deep Multi-Task Learning for Joint Localization, Perception, and
Prediction [68.50217234419922]
This paper investigates the issues that arise in state-of-the-art autonomy stacks under localization error.
We design a system that jointly performs perception, prediction, and localization.
Our architecture is able to reuse computation between both tasks, and is thus able to correct localization errors efficiently.
arXiv Detail & Related papers (2021-01-17T17:20:31Z) - SOE-Net: A Self-Attention and Orientation Encoding Network for Point
Cloud based Place Recognition [50.9889997200743]
We tackle the problem of place recognition from point cloud data with a self-attention and orientation encoding network (SOE-Net)
SOE-Net fully explores the relationship between points and incorporates long-range context into point-wise local descriptors.
Experiments on various benchmark datasets demonstrate superior performance of the proposed network over the current state-of-the-art approaches.
arXiv Detail & Related papers (2020-11-24T22:28:25Z) - LiDAR-based Panoptic Segmentation via Dynamic Shifting Network [56.71765153629892]
LiDAR-based panoptic segmentation aims to parse both objects and scenes in a unified manner.
We propose the Dynamic Shifting Network (DS-Net), which serves as an effective panoptic segmentation framework in the point cloud realm.
Our proposed DS-Net achieves superior accuracies over current state-of-the-art methods.
arXiv Detail & Related papers (2020-11-24T08:44:46Z) - Universal Embeddings for Spatio-Temporal Tagging of Self-Driving Logs [72.67604044776662]
We tackle the problem of of-temporal tagging of self-driving scenes from raw sensor data.
Our approach learns a universal embedding for all tags, enabling efficient tagging of many attributes and faster learning of new attributes with limited data.
arXiv Detail & Related papers (2020-11-12T02:18:16Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.