ParkGaussian: Surround-view 3D Gaussian Splatting for Autonomous Parking
- URL: http://arxiv.org/abs/2601.01386v1
- Date: Sun, 04 Jan 2026 05:54:13 GMT
- Title: ParkGaussian: Surround-view 3D Gaussian Splatting for Autonomous Parking
- Authors: Xiaobao Wei, Zhangjie Ye, Yuxiang Gu, Zunjie Zhu, Yunfei Guo, Yingying Shen, Shan Zhao, Ming Lu, Haiyang Sun, Bing Wang, Guang Chen, Rongfeng Lu, Hangjun Ye,
- Abstract summary: Parking is a critical task for autonomous driving systems (ADS), with unique challenges in crowded parking slots and GPS-denied environments.<n>Existing works focus on 2D parking slot perception, mapping, and localization, 3D reconstruction remains underexplored.<n>We propose ParkGaussian, the first framework that integrates 3D Gaussian Splatting (3DGS) for parking scene reconstruction.
- Score: 27.87315646330573
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Parking is a critical task for autonomous driving systems (ADS), with unique challenges in crowded parking slots and GPS-denied environments. However, existing works focus on 2D parking slot perception, mapping, and localization, 3D reconstruction remains underexplored, which is crucial for capturing complex spatial geometry in parking scenarios. Naively improving the visual quality of reconstructed parking scenes does not directly benefit autonomous parking, as the key entry point for parking is the slots perception module. To address these limitations, we curate the first benchmark named ParkRecon3D, specifically designed for parking scene reconstruction. It includes sensor data from four surround-view fisheye cameras with calibrated extrinsics and dense parking slot annotations. We then propose ParkGaussian, the first framework that integrates 3D Gaussian Splatting (3DGS) for parking scene reconstruction. To further improve the alignment between reconstruction and downstream parking slot detection, we introduce a slot-aware reconstruction strategy that leverages existing parking perception methods to enhance the synthesis quality of slot regions. Experiments on ParkRecon3D demonstrate that ParkGaussian achieves state-of-the-art reconstruction quality and better preserves perception consistency for downstream tasks. The code and dataset will be released at: https://github.com/wm-research/ParkGaussian
Related papers
- Rectify and Align GPS Points to Parking Spots via Rank-1 Constraint [7.9112014094183145]
High-rise buildings tend to cause GPS points to drift from the actual locations of parking spots.<n>It is a non-trivial task to correct a few wrong GPS points from a large number of parking spots in an unsupervised approach.<n>We propose an unsupervised low-rank method to effectively rectify errors in GPS points and further align them to the parking spots in a unified framework.
arXiv Detail & Related papers (2025-10-15T11:36:27Z) - Advancing Real-World Parking Slot Detection with Large-Scale Dataset and Semi-Supervised Baseline [65.25540269603553]
This study focuses on parking slot detection using surround-view cameras, which offer a comprehensive bird's-eye view of the parking environment.<n>We first construct a large-scale parking slot detection dataset (CRPS-D), which includes various lighting distributions, diverse weather conditions, and challenging parking slot variants.<n>We develop a semi-supervised baseline for parking slot detection, termed SS-PSD, to further improve performance by exploiting unlabeled data.
arXiv Detail & Related papers (2025-09-16T14:50:19Z) - 3D StreetUnveiler with Semantic-aware 2DGS -- a simple baseline [66.90611944550392]
StreetUnveiler learns a 3D representation of an empty street from crowded observations.<n>We divide the empty street scene into observed, partial-observed, and unobserved regions.<n>Experiments conducted on the street scene dataset successfully reconstructed a 3D representation of the empty street.
arXiv Detail & Related papers (2024-05-28T17:57:12Z) - Holistic Parking Slot Detection with Polygon-Shaped Representations [1.1649926489639983]
We propose one-step Holistic Parking Slot Network (HPS-Net), a tailor-made adaptation of the You Only Look Once (YOLO)v4 algorithm.
Experiments show that HPS-Net can detect various vacant parking slots with a F1-score of 0.92.
It achieves a real-time detection speed of 17 FPS on Nvidia Drive AGX Xavier.
arXiv Detail & Related papers (2023-10-17T23:37:23Z) - Automatic Vision-Based Parking Slot Detection and Occupancy
Classification [3.038642416291856]
Parking guidance information (PGI) systems are used to provide information to drivers about the nearest parking lots and the number of vacant parking slots.
Recently, vision-based solutions started to appear as a cost-effective alternative to standard PGI systems.
In this paper, the algorithm that performs Automatic Parking Slot Detection and Occupancy Classification (APSD-OC) solely on input images is proposed.
arXiv Detail & Related papers (2023-08-16T07:44:34Z) - Satellite Image Based Cross-view Localization for Autonomous Vehicle [59.72040418584396]
This paper shows that by using an off-the-shelf high-definition satellite image as a ready-to-use map, we are able to achieve cross-view vehicle localization up to a satisfactory accuracy.
Our method is validated on KITTI and Ford Multi-AV Seasonal datasets as ground view and Google Maps as the satellite view.
arXiv Detail & Related papers (2022-07-27T13:16:39Z) - PersFormer: 3D Lane Detection via Perspective Transformer and the
OpenLane Benchmark [109.03773439461615]
PersFormer is an end-to-end monocular 3D lane detector with a novel Transformer-based spatial feature transformation module.
We release one of the first large-scale real-world 3D lane datasets, called OpenLane, with high-quality annotation and scenario diversity.
arXiv Detail & Related papers (2022-03-21T16:12:53Z) - Soft Expectation and Deep Maximization for Image Feature Detection [68.8204255655161]
We propose SEDM, an iterative semi-supervised learning process that flips the question and first looks for repeatable 3D points, then trains a detector to localize them in image space.
Our results show that this new model trained using SEDM is able to better localize the underlying 3D points in a scene.
arXiv Detail & Related papers (2021-04-21T00:35:32Z) - Recovering and Simulating Pedestrians in the Wild [81.38135735146015]
We propose to recover the shape and motion of pedestrians from sensor readings captured in the wild by a self-driving car driving around.
We incorporate the reconstructed pedestrian assets bank in a realistic 3D simulation system.
We show that the simulated LiDAR data can be used to significantly reduce the amount of real-world data required for visual perception tasks.
arXiv Detail & Related papers (2020-11-16T17:16:32Z) - PSDet: Efficient and Universal Parking Slot Detection [14.085693334348827]
Real-time parking slot detection plays a critical role in valet parking systems.
Existing methods have limited success in real-world applications.
We argue two reasons accounting for the unsatisfactory performance:.
romannumeral1, The available datasets have limited diversity, which causes the low generalization ability.
arXiv Detail & Related papers (2020-05-12T03:06:25Z) - Road Curb Detection and Localization with Monocular Forward-view Vehicle
Camera [74.45649274085447]
We propose a robust method for estimating road curb 3D parameters using a calibrated monocular camera equipped with a fisheye lens.
Our approach is able to estimate the vehicle to curb distance in real time with mean accuracy of more than 90%.
arXiv Detail & Related papers (2020-02-28T00:24:18Z) - Trained Trajectory based Automated Parking System using Visual SLAM on
Surround View Cameras [0.0]
We discuss the use cases, design and implementation of a trained trajectory automated parking system.
The proposed system is deployed on commercial vehicles and the consumer application is illustrated.
arXiv Detail & Related papers (2020-01-07T16:43:27Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.