PPD: A New Valet Parking Pedestrian Fisheye Dataset for Autonomous
Driving
- URL: http://arxiv.org/abs/2309.11002v2
- Date: Mon, 25 Sep 2023 03:36:47 GMT
- Title: PPD: A New Valet Parking Pedestrian Fisheye Dataset for Autonomous
Driving
- Authors: Zizhang Wu, Xinyuan Chen, Fan Song, Yuanzhu Gan, Tianhao Xu, Jian Pu,
Rui Tang
- Abstract summary: Parking Pedestrian dataset consists of several distinctive types of pedestrians captured with fisheye cameras.
We present a pedestrian detection baseline on PPD dataset, and introduce two data augmentation techniques to improve the baseline.
- Score: 18.71208933251644
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Pedestrian detection under valet parking scenarios is fundamental for
autonomous driving. However, the presence of pedestrians can be manifested in a
variety of ways and postures under imperfect ambient conditions, which can
adversely affect detection performance. Furthermore, models trained on
publicdatasets that include pedestrians generally provide suboptimal outcomes
for these valet parking scenarios. In this paper, wepresent the Parking
Pedestrian Dataset (PPD), a large-scale fisheye dataset to support research
dealing with real-world pedestrians, especially with occlusions and diverse
postures. PPD consists of several distinctive types of pedestrians captured
with fisheye cameras. Additionally, we present a pedestrian detection baseline
on PPD dataset, and introduce two data augmentation techniques to improve the
baseline by enhancing the diversity ofthe original dataset. Extensive
experiments validate the effectiveness of our novel data augmentation
approaches over baselinesand the dataset's exceptional generalizability.
Related papers
- TaCarla: A comprehensive benchmarking dataset for end-to-end autonomous driving [3.037642191465275]
We have collected a new dataset comprising over 2.85 million frames using the CARLA simulation environment for the diverse Leaderboard 2.0 challenge scenarios.<n>Our dataset is designed not only for planning tasks but also supports dynamic object detection, lane divider detection, centerline detection, traffic light recognition, prediction tasks and visual language action models.
arXiv Detail & Related papers (2026-02-26T21:16:20Z) - Spatial Retrieval Augmented Autonomous Driving [81.39665750557526]
Existing autonomous driving systems rely on onboard sensors for environmental perception.<n>We propose the spatial retrieval paradigm, introducing offline retrieved geographic images as an additional input.<n>We will open-source dataset curation code, data, and benchmarks for further study of this new autonomous driving paradigm.
arXiv Detail & Related papers (2025-12-07T14:40:49Z) - Advancing Real-World Parking Slot Detection with Large-Scale Dataset and Semi-Supervised Baseline [65.25540269603553]
This study focuses on parking slot detection using surround-view cameras, which offer a comprehensive bird's-eye view of the parking environment.<n>We first construct a large-scale parking slot detection dataset (CRPS-D), which includes various lighting distributions, diverse weather conditions, and challenging parking slot variants.<n>We develop a semi-supervised baseline for parking slot detection, termed SS-PSD, to further improve performance by exploiting unlabeled data.
arXiv Detail & Related papers (2025-09-16T14:50:19Z) - SEPose: A Synthetic Event-based Human Pose Estimation Dataset for Pedestrian Monitoring [20.537672896807063]
SEPose is a synthetic multi-person pose estimation dataset for fixed pedestrian perception generated using dynamic vision sensors in the CARLA simulator.<n>We train existing state-of-the-art models such as RVT and YOLOv8 on our dataset and evaluate them on real event-based data to demonstrate the sim-to-real generalization capabilities of the proposed dataset.
arXiv Detail & Related papers (2025-07-16T04:54:11Z) - Pedestrian Intention and Trajectory Prediction in Unstructured Traffic Using IDD-PeD [26.011293248078797]
We introduce an Indian driving pedestrian dataset designed to address the complexities of modeling pedestrian behavior in unstructured environments.<n>The dataset provides high-level and detailed low-level comprehensive annotations focused on pedestrians requiring the ego-vehicle's attention.
arXiv Detail & Related papers (2025-06-27T10:41:18Z) - PFSD: A Multi-Modal Pedestrian-Focus Scene Dataset for Rich Tasks in Semi-Structured Environments [73.80718037070773]
We present the multi-modal Pedestrian-Focused Scene dataset, rigorously annotated in semi-structured scenes with the format of nuScenes.
We also propose a novel Hybrid Multi-Scale Fusion Network (HMFN) to detect pedestrians in densely populated and occluded scenarios.
arXiv Detail & Related papers (2025-02-21T09:57:53Z) - Transfer Your Perspective: Controllable 3D Generation from Any Viewpoint in a Driving Scene [56.73568220959019]
Collaborative autonomous driving (CAV) seems like a promising direction, but collecting data for development is non-trivial.
We introduce a novel surrogate to the rescue, which is to generate realistic perception from different viewpoints in a driving scene.
We present the very first solution, using a combination of simulated collaborative data and real ego-car data.
arXiv Detail & Related papers (2025-02-10T17:07:53Z) - Unsupervised Domain Adaptation for Self-Driving from Past Traversal
Features [69.47588461101925]
We propose a method to adapt 3D object detectors to new driving environments.
Our approach enhances LiDAR-based detection models using spatial quantized historical features.
Experiments on real-world datasets demonstrate significant improvements.
arXiv Detail & Related papers (2023-09-21T15:00:31Z) - SKoPe3D: A Synthetic Dataset for Vehicle Keypoint Perception in 3D from
Traffic Monitoring Cameras [26.457695296042903]
We propose SKoPe3D, a unique synthetic vehicle keypoint dataset from a roadside perspective.
SKoPe3D contains over 150k vehicle instances and 4.9 million keypoints.
Our experiments highlight the dataset's applicability and the potential for knowledge transfer between synthetic and real-world data.
arXiv Detail & Related papers (2023-09-04T02:57:30Z) - Surround-view Fisheye BEV-Perception for Valet Parking: Dataset,
Baseline and Distortion-insensitive Multi-task Framework [14.165082243555988]
Surround-view fisheye perception under valet parking scenes is fundamental and crucial in autonomous driving.
We introduce a new large-scale fisheye dataset called Fisheye Parking dataset.
We also propose our real-time distortion-insensitive multi-task framework Fisheye Perception Network (FPNet)
arXiv Detail & Related papers (2022-12-08T07:06:08Z) - DOLPHINS: Dataset for Collaborative Perception enabled Harmonious and
Interconnected Self-driving [19.66714697653504]
Vehicle-to-Everything (V2X) network has enabled collaborative perception in autonomous driving.
The lack of datasets has severely blocked the development of collaborative perception algorithms.
We release DOLPHINS: dataset for cOllaborative Perception enabled Harmonious and INterconnected Self-driving.
arXiv Detail & Related papers (2022-07-15T17:07:07Z) - SurroundDepth: Entangling Surrounding Views for Self-Supervised
Multi-Camera Depth Estimation [101.55622133406446]
We propose a SurroundDepth method to incorporate the information from multiple surrounding views to predict depth maps across cameras.
Specifically, we employ a joint network to process all the surrounding views and propose a cross-view transformer to effectively fuse the information from multiple views.
In experiments, our method achieves the state-of-the-art performance on the challenging multi-camera depth estimation datasets.
arXiv Detail & Related papers (2022-04-07T17:58:47Z) - Pedestrian Detection: Domain Generalization, CNNs, Transformers and
Beyond [82.37430109152383]
We show that, current pedestrian detectors poorly handle even small domain shifts in cross-dataset evaluation.
We attribute the limited generalization to two main factors, the method and the current sources of data.
We propose a progressive fine-tuning strategy which improves generalization.
arXiv Detail & Related papers (2022-01-10T06:00:26Z) - PSI: A Pedestrian Behavior Dataset for Socially Intelligent Autonomous
Car [47.01116716025731]
This paper proposes and shares another benchmark dataset called the IUPUI-CSRC Pedestrian Situated Intent (PSI) data.
The first novel label is the dynamic intent changes for the pedestrians to cross in front of the ego-vehicle, achieved from 24 drivers.
The second one is the text-based explanations of the driver reasoning process when estimating pedestrian intents and predicting their behaviors.
arXiv Detail & Related papers (2021-12-05T15:54:57Z) - Fine-Grained Vehicle Perception via 3D Part-Guided Visual Data
Augmentation [77.60050239225086]
We propose an effective training data generation process by fitting a 3D car model with dynamic parts to vehicles in real images.
Our approach is fully automatic without any human interaction.
We present a multi-task network for VUS parsing and a multi-stream network for VHI parsing.
arXiv Detail & Related papers (2020-12-15T03:03:38Z) - Detecting 32 Pedestrian Attributes for Autonomous Vehicles [103.87351701138554]
In this paper, we address the problem of jointly detecting pedestrians and recognizing 32 pedestrian attributes.
We introduce a Multi-Task Learning (MTL) model relying on a composite field framework, which achieves both goals in an efficient way.
We show competitive detection and attribute recognition results, as well as a more stable MTL training.
arXiv Detail & Related papers (2020-12-04T15:10:12Z) - The P-DESTRE: A Fully Annotated Dataset for Pedestrian Detection,
Tracking, Re-Identification and Search from Aerial Devices [7.095987222706225]
This paper introduces the P-DESTRE dataset, which is the first of its kind to provide consistent ID annotations across multiple days.
We also compare the results attained by state-of-the-art pedestrian detection, tracking, reidentification and search techniques in well-known surveillance datasets.
The dataset and the full details of the empirical evaluation carried out are freely available at http://p-destre.di.ubi.pt/.
arXiv Detail & Related papers (2020-04-06T16:17:32Z) - Generalizable Pedestrian Detection: The Elephant In The Room [82.37430109152383]
We find that existing state-of-the-art pedestrian detectors, though perform quite well when trained and tested on the same dataset, generalize poorly in cross dataset evaluation.
We illustrate that diverse and dense datasets, collected by crawling the web, serve to be an efficient source of pre-training for pedestrian detection.
arXiv Detail & Related papers (2020-03-19T14:14:52Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.