Advancing Real-World Parking Slot Detection with Large-Scale Dataset and Semi-Supervised Baseline
- URL: http://arxiv.org/abs/2509.13133v1
- Date: Tue, 16 Sep 2025 14:50:19 GMT
- Title: Advancing Real-World Parking Slot Detection with Large-Scale Dataset and Semi-Supervised Baseline
- Authors: Zhihao Zhang, Chunyu Lin, Lang Nie, Jiyuan Wang, Yao Zhao,
- Abstract summary: This study focuses on parking slot detection using surround-view cameras, which offer a comprehensive bird's-eye view of the parking environment.<n>We first construct a large-scale parking slot detection dataset (CRPS-D), which includes various lighting distributions, diverse weather conditions, and challenging parking slot variants.<n>We develop a semi-supervised baseline for parking slot detection, termed SS-PSD, to further improve performance by exploiting unlabeled data.
- Score: 65.25540269603553
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: As automatic parking systems evolve, the accurate detection of parking slots has become increasingly critical. This study focuses on parking slot detection using surround-view cameras, which offer a comprehensive bird's-eye view of the parking environment. However, the current datasets are limited in scale, and the scenes they contain are seldom disrupted by real-world noise (e.g., light, occlusion, etc.). Moreover, manual data annotation is prone to errors and omissions due to the complexity of real-world conditions, significantly increasing the cost of annotating large-scale datasets. To address these issues, we first construct a large-scale parking slot detection dataset (named CRPS-D), which includes various lighting distributions, diverse weather conditions, and challenging parking slot variants. Compared with existing datasets, the proposed dataset boasts the largest data scale and consists of a higher density of parking slots, particularly featuring more slanted parking slots. Additionally, we develop a semi-supervised baseline for parking slot detection, termed SS-PSD, to further improve performance by exploiting unlabeled data. To our knowledge, this is the first semi-supervised approach in parking slot detection, which is built on the teacher-student model with confidence-guided mask consistency and adaptive feature perturbation. Experimental results demonstrate the superiority of SS-PSD over the existing state-of-the-art (SoTA) solutions on both the proposed dataset and the existing dataset. Particularly, the more unlabeled data there is, the more significant the gains brought by our semi-supervised scheme. The relevant source codes and the dataset have been made publicly available at https://github.com/zzh362/CRPS-D.
Related papers
- TaCarla: A comprehensive benchmarking dataset for end-to-end autonomous driving [3.037642191465275]
We have collected a new dataset comprising over 2.85 million frames using the CARLA simulation environment for the diverse Leaderboard 2.0 challenge scenarios.<n>Our dataset is designed not only for planning tasks but also supports dynamic object detection, lane divider detection, centerline detection, traffic light recognition, prediction tasks and visual language action models.
arXiv Detail & Related papers (2026-02-26T21:16:20Z) - From Few-Shot to Zero-Shot: Towards Generalist Graph Anomaly Detection [89.52759572485276]
ARC is a few-shot generalist GAD method that leverages in-context learning and requires only a few labeled normal samples at inference time.<n> ARC and ARC_zero effectively detect anomalies, exhibit strong generalization ability, and perform efficiently under few-shot and zero-shot settings.
arXiv Detail & Related papers (2026-02-21T10:59:00Z) - PFSD: A Multi-Modal Pedestrian-Focus Scene Dataset for Rich Tasks in Semi-Structured Environments [73.80718037070773]
We present the multi-modal Pedestrian-Focused Scene dataset, rigorously annotated in semi-structured scenes with the format of nuScenes.<n>We also propose a novel Hybrid Multi-Scale Fusion Network (HMFN) to detect pedestrians in densely populated and occluded scenarios.
arXiv Detail & Related papers (2025-02-21T09:57:53Z) - Leveraging Driver Field-of-View for Multimodal Ego-Trajectory Prediction [69.29802752614677]
RouteFormer is a novel ego-trajectory prediction network combining GPS data, environmental context, and the driver's field-of-view.<n>To tackle data scarcity and enhance diversity, we introduce GEM, a dataset of urban driving scenarios enriched with synchronized driver field-of-view and gaze data.
arXiv Detail & Related papers (2023-12-13T23:06:30Z) - Deep Single Models vs. Ensembles: Insights for a Fast Deployment of
Parking Monitoring Systems [3.00363876980149]
Intelligent parking monitoring is still a challenge since most approaches involve collecting and labeling large amounts of data.
Our study aims to uncover the challenges in creating a global framework, trained using publicly available labeled parking lot images.
We found that models trained on diverse datasets can achieve 95% accuracy without the burden of data annotation and model training on the target parking lot.
arXiv Detail & Related papers (2023-09-28T14:59:53Z) - PPD: A New Valet Parking Pedestrian Fisheye Dataset for Autonomous
Driving [18.71208933251644]
Parking Pedestrian dataset consists of several distinctive types of pedestrians captured with fisheye cameras.
We present a pedestrian detection baseline on PPD dataset, and introduce two data augmentation techniques to improve the baseline.
arXiv Detail & Related papers (2023-09-20T01:55:19Z) - LargeST: A Benchmark Dataset for Large-Scale Traffic Forecasting [65.71129509623587]
Road traffic forecasting plays a critical role in smart city initiatives and has experienced significant advancements thanks to the power of deep learning.
However, the promising results achieved on current public datasets may not be applicable to practical scenarios.
We introduce the LargeST benchmark dataset, which includes a total of 8,600 sensors in California with a 5-year time coverage.
arXiv Detail & Related papers (2023-06-14T05:48:36Z) - Multimodal Dataset from Harsh Sub-Terranean Environment with Aerosol
Particles for Frontier Exploration [55.41644538483948]
This paper introduces a multimodal dataset from the harsh and unstructured underground environment with aerosol particles.
It contains synchronized raw data measurements from all onboard sensors in Robot Operating System (ROS) format.
The focus of this paper is not only to capture both temporal and spatial data diversities but also to present the impact of harsh conditions on captured data.
arXiv Detail & Related papers (2023-04-27T20:21:18Z) - SUPS: A Simulated Underground Parking Scenario Dataset for Autonomous
Driving [41.221988979184665]
SUPS is a simulated dataset for underground automatic parking.
It supports multiple tasks with multiple sensors and multiple semantic labels aligned with successive images.
We also evaluate the state-of-the-art SLAM algorithms and perception models on our dataset.
arXiv Detail & Related papers (2023-02-25T02:59:12Z) - Smart Parking Space Detection under Hazy conditions using Convolutional
Neural Networks: A Novel Approach [0.0]
This paper investigates the use of dehazing networks that improves the performance of parking space occupancy under hazy conditions.
The proposed system is deployable as part of existing smart parking systems where limited number of cameras are used to monitor hundreds of parking spaces.
arXiv Detail & Related papers (2022-01-15T14:15:46Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.