On Designing Day Ahead and Same Day Ridership Level Prediction Models
for City-Scale Transit Networks Using Noisy APC Data
- URL: http://arxiv.org/abs/2210.04989v1
- Date: Mon, 10 Oct 2022 19:50:59 GMT
- Title: On Designing Day Ahead and Same Day Ridership Level Prediction Models
for City-Scale Transit Networks Using Noisy APC Data
- Authors: Jose Paolo Talusan (1), Ayan Mukhopadhyay (1), Dan Freudberg (2),
Abhishek Dubey (1) ((1) Vanderbilt University, (2) Nashville Metropolitan
Transit Authority)
- Abstract summary: We propose the use and fusion of data from multiple sources, cleaned, processed, and merged together, for use in training machine learning models to predict transit ridership.
We evaluate our approach on real-world transit data provided by the public transit agency of Nashville, TN.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The ability to accurately predict public transit ridership demand benefits
passengers and transit agencies. Agencies will be able to reallocate buses to
handle under or over-utilized bus routes, improving resource utilization, and
passengers will be able to adjust and plan their schedules to avoid overcrowded
buses and maintain a certain level of comfort. However, accurately predicting
occupancy is a non-trivial task. Various reasons such as heterogeneity,
evolving ridership patterns, exogenous events like weather, and other
stochastic variables, make the task much more challenging. With the progress of
big data, transit authorities now have access to real-time passenger occupancy
information for their vehicles. The amount of data generated is staggering.
While there is no shortage in data, it must still be cleaned, processed,
augmented, and merged before any useful information can be generated. In this
paper, we propose the use and fusion of data from multiple sources, cleaned,
processed, and merged together, for use in training machine learning models to
predict transit ridership. We use data that spans a 2-year period (2020-2022)
incorporating transit, weather, traffic, and calendar data. The resulting data,
which equates to 17 million observations, is used to train separate models for
the trip and stop level prediction. We evaluate our approach on real-world
transit data provided by the public transit agency of Nashville, TN. We
demonstrate that the trip level model based on Xgboost and the stop level model
based on LSTM outperform the baseline statistical model across the entire
transit service day.
Related papers
- BjTT: A Large-scale Multimodal Dataset for Traffic Prediction [49.93028461584377]
Traditional traffic prediction methods rely on historical traffic data to predict traffic trends.
In this work, we explore how generative models combined with text describing the traffic system can be applied for traffic generation.
We propose ChatTraffic, the first diffusion model for text-to-traffic generation.
arXiv Detail & Related papers (2024-03-08T04:19:56Z) - Pre-training on Synthetic Driving Data for Trajectory Prediction [61.520225216107306]
We propose a pipeline-level solution to mitigate the issue of data scarcity in trajectory forecasting.
We adopt HD map augmentation and trajectory synthesis for generating driving data, and then we learn representations by pre-training on them.
We conduct extensive experiments to demonstrate the effectiveness of our data expansion and pre-training strategies.
arXiv Detail & Related papers (2023-09-18T19:49:22Z) - A Dynamic Model for Bus Arrival Time Estimation based on Spatial
Patterns using Machine Learning [1.2891210250935146]
Bus arrival prediction model is proposed for forecasting the arrival time using limited data sets.
One of the routes of Tumakuru city service, Tumakuru, India, is selected and divided into two spatial patterns.
A model to dynamically predict bus arrival time is developed using the preceding trip information and the machine learning model to estimate the arrival time at a downstream bus stop.
arXiv Detail & Related papers (2022-10-03T06:35:03Z) - Pedestrian Stop and Go Forecasting with Hybrid Feature Fusion [87.77727495366702]
We introduce the new task of pedestrian stop and go forecasting.
Considering the lack of suitable existing datasets for it, we release TRANS, a benchmark for explicitly studying the stop and go behaviors of pedestrians in urban traffic.
We build it from several existing datasets annotated with pedestrians' walking motions, in order to have various scenarios and behaviors.
arXiv Detail & Related papers (2022-03-04T18:39:31Z) - Identifying the Factors that Influence Urban Public Transit Demand [0.0]
The rise in urbanization throughout the United States (US) in recent years has required urban planners and transportation engineers to have greater consideration for the transportation services available to residents of a metropolitan region.
These improvements can be achieved by identifying and understanding the factors that influence urban public transit demand.
Common factors that can influence urban public transit demand can be internal and/or external factors.
arXiv Detail & Related papers (2021-11-16T05:37:51Z) - Euro-PVI: Pedestrian Vehicle Interactions in Dense Urban Centers [126.81938540470847]
We propose Euro-PVI, a dataset of pedestrian and bicyclist trajectories.
In this work, we develop a joint inference model that learns an expressive multi-modal shared latent space across agents in the urban scene.
We achieve state of the art results on the nuScenes and Euro-PVI datasets demonstrating the importance of capturing interactions between ego-vehicle and pedestrians (bicyclists) for accurate predictions.
arXiv Detail & Related papers (2021-06-22T15:40:21Z) - Public Transit for Special Events: Ridership Prediction and Train
Optimization [10.531110013870792]
It is important for transit providers to understand their impact on disruptions, delays, and fare revenues.
This paper proposes a suite of data-driven techniques for evaluating, anticipating, and managing the performance of transit systems during recurring congestion peaks due to special events.
arXiv Detail & Related papers (2021-06-09T19:52:18Z) - Injecting Knowledge in Data-driven Vehicle Trajectory Predictors [82.91398970736391]
Vehicle trajectory prediction tasks have been commonly tackled from two perspectives: knowledge-driven or data-driven.
In this paper, we propose to learn a "Realistic Residual Block" (RRB) which effectively connects these two perspectives.
Our proposed method outputs realistic predictions by confining the residual range and taking into account its uncertainty.
arXiv Detail & Related papers (2021-03-08T16:03:09Z) - Crowding Prediction of In-Situ Metro Passengers Using Smart Card Data [11.781685156308475]
We propose a statistical model to predict in-situ passenger density inside a closed metro system.
Based on the prediction results, we are able to provide accurate prediction of in-situ passenger density for a future time point.
arXiv Detail & Related papers (2020-09-07T04:07:37Z) - Study on Key Technologies of Transit Passengers Travel Pattern Mining
and Applications based on Multiple Sources of Data [1.370633147306388]
We propose a series of methodologies to mine transit riders travel pattern and behavioral preferences.
We use these knowledges to adjust and optimize the transit systems.
arXiv Detail & Related papers (2020-05-26T22:35:28Z) - Physical-Virtual Collaboration Modeling for Intra-and Inter-Station
Metro Ridership Prediction [116.66657468425645]
We propose a unified Physical-Virtual Collaboration Graph Network (PVCGN), which can effectively learn the complex ridership patterns from the tailor-designed graphs.
Specifically, a physical graph is directly built based on the realistic topology of the studied metro system.
A similarity graph and a correlation graph are built with virtual topologies under the guidance of the inter-station passenger flow similarity and correlation.
arXiv Detail & Related papers (2020-01-14T16:47:54Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.