Pre-Tactical Flight-Delay and Turnaround Forecasting with Synthetic Aviation Data
- URL: http://arxiv.org/abs/2508.02294v1
- Date: Mon, 04 Aug 2025 11:04:15 GMT
- Title: Pre-Tactical Flight-Delay and Turnaround Forecasting with Synthetic Aviation Data
- Authors: Abdulmajid Murad, Massimiliano Ruocco,
- Abstract summary: This paper investigates whether synthetic data can effectively replace real operational data for training machine learning models.<n>We evaluate four state-of-the-art synthetic data generators on three prediction tasks.<n>Our results show that advanced neural network architectures, specifically transformer-based generators, can retain 94-97% of real-data predictive performance.
- Score: 0.18416014644193066
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Access to comprehensive flight operations data remains severely restricted in aviation due to commercial sensitivity and competitive considerations, hindering the development of predictive models for operational planning. This paper investigates whether synthetic data can effectively replace real operational data for training machine learning models in pre-tactical aviation scenarios-predictions made hours to days before operations using only scheduled flight information. We evaluate four state-of-the-art synthetic data generators on three prediction tasks: aircraft turnaround time, departure delays, and arrival delays. Using a Train on Synthetic, Test on Real (TSTR) methodology on over 1.7 million European flight records, we first validate synthetic data quality through fidelity assessments, then assess both predictive performance and the preservation of operational relationships. Our results show that advanced neural network architectures, specifically transformer-based generators, can retain 94-97% of real-data predictive performance while maintaining feature importance patterns informative for operational decision-making. Our analysis reveals that even with real data, prediction accuracy is inherently limited when only scheduled information is available-establishing realistic baselines for pre-tactical forecasting. These findings suggest that high-quality synthetic data can enable broader access to aviation analytics capabilities while preserving commercial confidentiality, though stakeholders must maintain realistic expectations about pre-tactical prediction accuracy given the stochastic nature of flight operations.
Related papers
- Deep learning for predicting hauling fleet production capacity under uncertainties in open pit mines using real and simulated data [0.0]
We propose a deep-learning framework that blends real-world operational records with synthetically generated mechanical-breakdown scenarios.<n>We evaluate two architectures: an XGBoost regressor achieving a median absolute error (MedAE) of 14.3 per cent and a Long Short-Term Memory network with a MedAE of 15.1 per cent.
arXiv Detail & Related papers (2025-06-04T12:12:56Z) - Synthetic Aircraft Trajectory Generation Using Time-Based VQ-VAE [0.18416014644193066]
We propose a novel method for trajectory synthesis by adapting the Time-Based Vector Quantized Variational Autoencoder (TimeVQVAE)<n>We evaluate the adapted TimeVQVAE using an extensive suite of quality, statistical, and distributional metrics, as well as a flyability assessment conducted in an open-source air traffic simulator.<n>Results indicate that TimeVQVAE outperforms a temporal convolution VAalE baseline, generating synthetic trajectories that mirror real flight data in terms of accuracy, temporal consistency, and statistical properties.
arXiv Detail & Related papers (2025-04-12T06:46:51Z) - Graph machine learning for flight delay prediction due to holding manouver [0.0]
This study models the prediction of flight delays due to holding maneuvers as a graph problem.<n>We leverage advanced Graph Machine Learning (Graph ML) techniques to capture complex interdependencies in air traffic networks.<n>We discuss the model's potential operational impact through a web-based tool that allows users to simulate real-time delay predictions.
arXiv Detail & Related papers (2025-02-06T17:18:53Z) - Physics-guided Active Sample Reweighting for Urban Flow Prediction [75.24539704456791]
Urban flow prediction is a nuanced-temporal modeling that estimates the throughput of transportation services like buses, taxis and ride-driven models.
Some recent prediction solutions bring remedies with the notion of physics-guided machine learning (PGML)
We develop a atized physics-guided network (PN), and propose a data-aware framework Physics-guided Active Sample Reweighting (P-GASR)
arXiv Detail & Related papers (2024-07-18T15:44:23Z) - Towards an end-to-end artificial intelligence driven global weather forecasting system [57.5191940978886]
We present an AI-based data assimilation model, i.e., Adas, for global weather variables.
We demonstrate that Adas can assimilate global observations to produce high-quality analysis, enabling the system operate stably for long term.
We are the first to apply the methods to real-world scenarios, which is more challenging and has considerable practical application potential.
arXiv Detail & Related papers (2023-12-18T09:05:28Z) - FengWu-4DVar: Coupling the Data-driven Weather Forecasting Model with 4D Variational Assimilation [67.20588721130623]
We develop an AI-based cyclic weather forecasting system, FengWu-4DVar.
FengWu-4DVar can incorporate observational data into the data-driven weather forecasting model.
Experiments on the simulated observational dataset demonstrate that FengWu-4DVar is capable of generating reasonable analysis fields.
arXiv Detail & Related papers (2023-12-16T02:07:56Z) - JRDB-Traj: A Dataset and Benchmark for Trajectory Forecasting in Crowds [79.00975648564483]
Trajectory forecasting models, employed in fields such as robotics, autonomous vehicles, and navigation, face challenges in real-world scenarios.
This dataset provides comprehensive data, including the locations of all agents, scene images, and point clouds, all from the robot's perspective.
The objective is to predict the future positions of agents relative to the robot using raw sensory input data.
arXiv Detail & Related papers (2023-11-05T18:59:31Z) - Learning Generative Models for Climbing Aircraft from Radar Data [0.0]
This paper proposes a generative model for climbing aircraft in which the standard Base of Aircraft Data (BADA) model is enriched by a functional correction to the thrust that is learned from data.
The method offers three features: predictions of the arrival time with 66.3% less error when compared to BADA; generated trajectories that are realistic when compared to test data.
arXiv Detail & Related papers (2023-09-26T13:53:53Z) - Pre-training on Synthetic Driving Data for Trajectory Prediction [61.520225216107306]
We propose a pipeline-level solution to mitigate the issue of data scarcity in trajectory forecasting.
We adopt HD map augmentation and trajectory synthesis for generating driving data, and then we learn representations by pre-training on them.
We conduct extensive experiments to demonstrate the effectiveness of our data expansion and pre-training strategies.
arXiv Detail & Related papers (2023-09-18T19:49:22Z) - DeepVol: Volatility Forecasting from High-Frequency Data with Dilated Causal Convolutions [53.37679435230207]
We propose DeepVol, a model based on Dilated Causal Convolutions that uses high-frequency data to forecast day-ahead volatility.
Our empirical results suggest that the proposed deep learning-based approach effectively learns global features from high-frequency data.
arXiv Detail & Related papers (2022-09-23T16:13:47Z) - Multi-Airport Delay Prediction with Transformers [0.0]
Temporal Fusion Transformer (TFT) was proposed to predict departure and arrival delays simultaneously for multiple airports.
This approach can capture complex temporal dynamics of the inputs known at the time of prediction and then forecast selected delay metrics up to four hours into the future.
arXiv Detail & Related papers (2021-11-04T21:58:11Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.