Evaluating the effects of Data Sparsity on the Link-level Bicycling Volume Estimation: A Graph Convolutional Neural Network Approach
- URL: http://arxiv.org/abs/2410.08522v2
- Date: Thu, 27 Mar 2025 08:18:23 GMT
- Title: Evaluating the effects of Data Sparsity on the Link-level Bicycling Volume Estimation: A Graph Convolutional Neural Network Approach
- Authors: Mohit Gupta, Debjit Bhowmick, Meead Saberi, Shirui Pan, Ben Beck,
- Abstract summary: We present the first study to utilize a Graph Convolutional Network (GCN) architecture to model link-level bicycling volumes.<n>We benchmark it against traditional machine learning models, such as linear regression, support vector machines, and random forest.<n>Our results show that the GCN model outperforms these traditional models in predicting Annual Average Daily Bicycle (AADB) counts.
- Score: 54.84957282120537
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Accurate bicycling volume estimation is crucial for making informed decisions and planning about future investments in bicycling infrastructure. However, traditional link-level volume estimation models are effective for motorized traffic but face significant challenges when applied to the bicycling context because of sparse data and the intricate nature of bicycling mobility patterns. To the best of our knowledge, we present the first study to utilize a Graph Convolutional Network (GCN) architecture to model link-level bicycling volumes and systematically investigate the impact of varying levels of data sparsity (0%--99%) on model performance, simulating real-world scenarios. We have leveraged Strava Metro data as the primary source of bicycling counts across 15,933 road segments/links in the City of Melbourne, Australia. To evaluate the effectiveness of the GCN model, we benchmark it against traditional machine learning models, such as linear regression, support vector machines, and random forest. Our results show that the GCN model outperforms these traditional models in predicting Annual Average Daily Bicycle (AADB) counts, demonstrating its ability to capture the spatial dependencies inherent in bicycle traffic networks. While GCN remains robust up to 80% sparsity, its performance declines sharply beyond this threshold, highlighting the challenges of extreme data sparsity. These findings underscore the potential of GCNs in enhancing bicycling volume estimation, while also emphasizing the need for further research on methods to improve model resilience under high-sparsity conditions. Our findings offer valuable insights for city planners aiming to improve bicycling infrastructure and promote sustainable transportation.
Related papers
- Efficient Self-Supervised Learning for Earth Observation via Dynamic Dataset Curation [67.23953699167274]
Self-supervised learning (SSL) has enabled the development of vision foundation models for Earth Observation (EO)
In EO, this challenge is amplified by the redundancy and heavy-tailed distributions common in satellite imagery.
We propose a dynamic dataset pruning strategy designed to improve SSL pre-training by maximizing dataset diversity and balance.
arXiv Detail & Related papers (2025-04-09T15:13:26Z) - Improving Traffic Flow Predictions with SGCN-LSTM: A Hybrid Model for Spatial and Temporal Dependencies [55.2480439325792]
This paper introduces the Signal-Enhanced Graph Convolutional Network Long Short Term Memory (SGCN-LSTM) model for predicting traffic speeds across road networks.
Experiments on the PEMS-BAY road network traffic dataset demonstrate the SGCN-LSTM model's effectiveness.
arXiv Detail & Related papers (2024-11-01T00:37:00Z) - Modeling Large-Scale Walking and Cycling Networks: A Machine Learning Approach Using Mobile Phone and Crowdsourced Data [0.0]
We develop and apply a machine learning based modeling approach for estimating daily walking and cycling volumes across a large-scale regional network in New South Wales, Australia.
The study discusses the unique challenges and limitations related to all three aspects of model training, testing, and inference.
arXiv Detail & Related papers (2024-03-29T21:37:23Z) - Data-driven Energy Consumption Modelling for Electric Micromobility using an Open Dataset [6.000804135802873]
This paper presents an open dataset for energy modelling research related to E-Scooters and E-Bikes.
We provide a comprehensive analysis of energy consumption modelling based on the dataset using a set of representative machine learning algorithms.
Our results demonstrate a notable advantage for data-driven models in comparison to the corresponding mathematical models for estimating energy consumption.
arXiv Detail & Related papers (2024-03-26T12:08:05Z) - Bridging the Sim-to-Real Gap with Bayesian Inference [53.61496586090384]
We present SIM-FSVGD for learning robot dynamics from data.
We use low-fidelity physical priors to regularize the training of neural network models.
We demonstrate the effectiveness of SIM-FSVGD in bridging the sim-to-real gap on a high-performance RC racecar system.
arXiv Detail & Related papers (2024-03-25T11:29:32Z) - Pre-training on Synthetic Driving Data for Trajectory Prediction [61.520225216107306]
We propose a pipeline-level solution to mitigate the issue of data scarcity in trajectory forecasting.
We adopt HD map augmentation and trajectory synthesis for generating driving data, and then we learn representations by pre-training on them.
We conduct extensive experiments to demonstrate the effectiveness of our data expansion and pre-training strategies.
arXiv Detail & Related papers (2023-09-18T19:49:22Z) - Revisiting Random Forests in a Comparative Evaluation of Graph
Convolutional Neural Network Variants for Traffic Prediction [15.248412426672694]
Graph convolutional neural networks (GCNNs) have become the prevailing models in the traffic prediction literature.
In this work, we classify the components of successful GCNN prediction models and analyze the effects of factorization, attention mechanism, and weight sharing on their performance.
arXiv Detail & Related papers (2023-05-30T00:50:51Z) - BikeDNA: A Tool for Bicycle Infrastructure Data & Network Assessment [0.0]
BikeDNA is an open-source tool for reproducible quality assessment of bicycle infrastructure data.
BikeDNA supports quality assessments of bicycle infrastructure data for a wide range of applications.
arXiv Detail & Related papers (2023-03-02T13:06:59Z) - Predicting Citi Bike Demand Evolution Using Dynamic Graphs [81.12174591442479]
We apply a graph neural network model to predict bike demand in the New York City, Citi Bike dataset.
In this paper, we attempt to apply a graph neural network model to predict bike demand in the New York City, Citi Bike dataset.
arXiv Detail & Related papers (2022-12-18T21:43:27Z) - Spatio-Temporal Graph Few-Shot Learning with Cross-City Knowledge
Transfer [58.6106391721944]
Cross-city knowledge has shown its promise, where the model learned from data-sufficient cities is leveraged to benefit the learning process of data-scarce cities.
We propose a model-agnostic few-shot learning framework for S-temporal graph called ST-GFSL.
We conduct comprehensive experiments on four traffic speed prediction benchmarks and the results demonstrate the effectiveness of ST-GFSL compared with state-of-the-art methods.
arXiv Detail & Related papers (2022-05-27T12:46:52Z) - LHNN: Lattice Hypergraph Neural Network for VLSI Congestion Prediction [70.31656245793302]
lattice hypergraph (LH-graph) is a novel graph formulation for circuits.
LHNN constantly achieves more than 35% improvements compared with U-nets and Pix2Pix on the F1 score.
arXiv Detail & Related papers (2022-03-24T03:31:18Z) - Automated Detection of Missing Links in Bicycle Networks [0.15293427903448023]
We develop the IPDC procedure (Identify, Prioritize, Decluster, Classify) for finding the most important missing links in urban bicycle networks.
We first identify all possible gaps following a multiplex network approach, prioritize them according to a flow-based metric, decluster emerging gap clusters, and manually classify the types of gaps.
Our results show how network analysis with minimal data requirements can serve as a cost-efficient support tool for bicycle network planning.
arXiv Detail & Related papers (2022-01-10T15:35:14Z) - A Cluster-Based Trip Prediction Graph Neural Network Model for Bike
Sharing Systems [2.1423963702744597]
Bike Sharing Systems (BSSs) are emerging as an innovative transportation service.
Ensuring the proper functioning of a BSS is crucial given that these systems are committed to eradicating many of the current global concerns.
Good knowledge of users' transition patterns is a decisive contribution to the quality and operability of the service.
arXiv Detail & Related papers (2022-01-03T15:47:40Z) - Euro-PVI: Pedestrian Vehicle Interactions in Dense Urban Centers [126.81938540470847]
We propose Euro-PVI, a dataset of pedestrian and bicyclist trajectories.
In this work, we develop a joint inference model that learns an expressive multi-modal shared latent space across agents in the urban scene.
We achieve state of the art results on the nuScenes and Euro-PVI datasets demonstrating the importance of capturing interactions between ego-vehicle and pedestrians (bicyclists) for accurate predictions.
arXiv Detail & Related papers (2021-06-22T15:40:21Z) - A Comparative Study of Using Spatial-Temporal Graph Convolutional
Networks for Predicting Availability in Bike Sharing Schemes [13.819341724635319]
We present an Attention-based ST-GCN (AST-GCN) for predicting the number of available bikes in bike-sharing systems in cities.
Our experimental results are presented using two real-world datasets, Dublinbikes and NYC-Citi Bike.
arXiv Detail & Related papers (2021-04-21T17:13:29Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.