City-scale synthetic individual-level vehicle trip data
- URL: http://arxiv.org/abs/2206.03273v3
- Date: Wed, 1 Feb 2023 09:11:57 GMT
- Title: City-scale synthetic individual-level vehicle trip data
- Authors: Guilong Li, Yixian Chen, Yimin Wang, Zhi Yu, Peilin Nie, Zhaocheng He
- Abstract summary: We produce a city-scale synthetic individual-level vehicle trip dataset by generating for each individual based on the historical trip data.
The result shows that the synthetic data is consistent with the real data on the aggregated level and reasonable from the individual perspective.
- Score: 5.191606231133951
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Trip data that records each vehicle's trip activity on the road network
describes the operation of urban traffic from the individual perspective, and
it is extremely valuable for transportation research. However, restricted by
data privacy, the trip data of individual-level cannot be opened for all
researchers, while the need for it is very urgent. In this paper, we produce a
city-scale synthetic individual-level vehicle trip dataset by generating for
each individual based on the historical trip data, where the availability and
trip data privacy protection are balanced. Privacy protection inevitably
affects the availability of data. Therefore, we have conducted numerous
experiments to demonstrate the performance and reliability of the synthetic
data in different dimensions and at different granularities to help users
properly judge the tasks it can perform. The result shows that the synthetic
data is consistent with the real data (i.e., historical data) on the aggregated
level and reasonable from the individual perspective.
Related papers
- A Systematic Evaluation of Generative Models on Tabular Transportation Data [14.566059618333155]
We use New York City taxi data as a case study to evaluate the performance of widely used data generative models.
We introduce an improved privacy metric to address the limitations of the commonly-used one.
This work underscores the potential need to develop generative models specifically tailored to take advantage of the unique characteristics of emerging domains, such as transportation.
arXiv Detail & Related papers (2025-02-13T00:14:55Z) - Investigating Vulnerabilities of GPS Trip Data to Trajectory-User Linking Attacks [49.1574468325115]
We propose a novel attack to reconstruct user identifiers in GPS trip datasets consisting of single trips.
We show that the risk of re-identification is significant even when personal identifiers have been removed.
Further investigations indicate that users who frequently visit locations that are only visited by a small number of others tend to be more vulnerable to re-identification.
arXiv Detail & Related papers (2025-02-12T08:54:49Z) - Traffic and Safety Rule Compliance of Humans in Diverse Driving Situations [48.924085579865334]
Analyzing human data is crucial for developing autonomous systems that replicate safe driving practices.
This paper presents a comparative evaluation of human compliance with traffic and safety rules across multiple trajectory prediction datasets.
arXiv Detail & Related papers (2024-11-04T09:21:00Z) - Urban Mobility Assessment Using LLMs [19.591156495742922]
This work proposes an innovative AI-based approach for synthesizing travel surveys by prompting large language models (LLMs)
Our study evaluates the effectiveness of this approach across various U.S. metropolitan areas by comparing the results against existing survey data at different levels.
arXiv Detail & Related papers (2024-08-22T19:17:33Z) - Reconsidering utility: unveiling the limitations of synthetic mobility data generation algorithms in real-life scenarios [49.1574468325115]
We evaluate the utility of five state-of-the-art synthesis approaches in terms of real-world applicability.
We focus on so-called trip data that encode fine granular urban movements such as GPS-tracked taxi rides.
One model fails to produce data within reasonable time and another generates too many jumps to meet the requirements for map matching.
arXiv Detail & Related papers (2024-07-03T16:08:05Z) - Best Practices and Lessons Learned on Synthetic Data [83.63271573197026]
The success of AI models relies on the availability of large, diverse, and high-quality datasets.
Synthetic data has emerged as a promising solution by generating artificial data that mimics real-world patterns.
arXiv Detail & Related papers (2024-04-11T06:34:17Z) - Scaling While Privacy Preserving: A Comprehensive Synthetic Tabular Data
Generation and Evaluation in Learning Analytics [0.412484724941528]
Privacy poses a significant obstacle to the progress of learning analytics (LA), presenting challenges like inadequate anonymization and data misuse.
Synthetic data emerges as a potential remedy, offering robust privacy protection.
Prior LA research on synthetic data lacks thorough evaluation, essential for assessing the delicate balance between privacy and data utility.
arXiv Detail & Related papers (2024-01-12T20:27:55Z) - Synthetic Datasets for Autonomous Driving: A Survey [13.287734271923565]
It is difficult for real-world datasets to keep up with the pace of changing requirements due to their expensive and time-consuming experimental and labeling costs.
More and more researchers are turning to synthetic datasets to easily generate rich and changeable data.
arXiv Detail & Related papers (2023-04-24T15:46:10Z) - Beyond Privacy: Navigating the Opportunities and Challenges of Synthetic
Data [91.52783572568214]
Synthetic data may become a dominant force in the machine learning world, promising a future where datasets can be tailored to individual needs.
We discuss which fundamental challenges the community needs to overcome for wider relevance and application of synthetic data.
arXiv Detail & Related papers (2023-04-07T16:38:40Z) - Urban Vehicle Mobility Characteristic Mining and Trip Generation Based
on Knowledge Graph [14.032480351768395]
We introduce the knowledge graph for the study to handle individual-level urban vehicle trip big data better.
For organization of individual level trip data, we designed and constructed an individual-level trip knowledge graph.
We propose an individual-level trip synthesis method based on knowledge graph generation to address the privacy issue of individual-level traffic data.
arXiv Detail & Related papers (2022-02-19T04:13:17Z) - An Experimental Urban Case Study with Various Data Sources and a Model
for Traffic Estimation [65.28133251370055]
We organize an experimental campaign with video measurement in an area within the urban network of Zurich, Switzerland.
We focus on capturing the traffic state in terms of traffic flow and travel times by ensuring measurements from established thermal cameras.
We propose a simple yet efficient Multiple Linear Regression (MLR) model to estimate travel times with fusion of various data sources.
arXiv Detail & Related papers (2021-08-02T08:13:57Z) - Measuring Utility and Privacy of Synthetic Genomic Data [3.635321290763711]
We provide the first evaluation of the utility and the privacy protection of five state-of-the-art models for generating synthetic genomic data.
Overall, there is no single approach for generating synthetic genomic data that performs well across the board.
arXiv Detail & Related papers (2021-02-05T17:41:01Z) - End-to-end Interpretable Neural Motion Planner [78.69295676456085]
We propose a neural motion planner (NMP) for learning to drive autonomously in complex urban scenarios.
We design a holistic model that takes as input raw LIDAR data and a HD map and produces interpretable intermediate representations.
We demonstrate the effectiveness of our approach in real-world driving data captured in several cities in North America.
arXiv Detail & Related papers (2021-01-17T14:16:12Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.