Aeolus: A Multi-structural Flight Delay Dataset
- URL: http://arxiv.org/abs/2510.26616v2
- Date: Fri, 31 Oct 2025 08:49:49 GMT
- Title: Aeolus: A Multi-structural Flight Delay Dataset
- Authors: Lin Xu, Xinyun Yuan, Yuxuan Liang, Suwan Yin, Yuankai Wu,
- Abstract summary: Aeolus is a large-scale Multi-modal Flight Delay dataset.<n>It is designed to advance research on flight delay prediction.<n>Aeolus supports a broad range of tasks, including regression, classification, temporal structure modeling, and graph learning.
- Score: 41.17078926798434
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We introduce Aeolus, a large-scale Multi-modal Flight Delay Dataset designed to advance research on flight delay prediction and support the development of foundation models for tabular data. Existing datasets in this domain are typically limited to flat tabular structures and fail to capture the spatiotemporal dynamics inherent in delay propagation. Aeolus addresses this limitation by providing three aligned modalities: (i) a tabular dataset with rich operational, meteorological, and airportlevel features for over 50 million flights; (ii) a flight chain module that models delay propagation along sequential flight legs, capturing upstream and downstream dependencies; and (iii) a flight network graph that encodes shared aircraft, crew, and airport resource connections, enabling cross-flight relational reasoning. The dataset is carefully constructed with temporal splits, comprehensive features, and strict leakage prevention to support realistic and reproducible machine learning evaluation. Aeolus supports a broad range of tasks, including regression, classification, temporal structure modeling, and graph learning, serving as a unified benchmark across tabular, sequential, and graph modalities. We release baseline experiments and preprocessing tools to facilitate adoption. Aeolus fills a key gap for both domain-specific modeling and general-purpose structured data research.Our source code and data can be accessed at https://github.com/Flnny/Delay-data
Related papers
- Time-TK: A Multi-Offset Temporal Interaction Framework Combining Transformer and Kolmogorov-Arnold Networks for Time Series Forecasting [6.1337977581640075]
Existing methods typically employ a strategy of embedding each time step as an independent token.<n>Time-TK significantly outperforms all baseline models, achieving state-of-the-art forecasting accuracy.
arXiv Detail & Related papers (2026-01-30T10:11:51Z) - Estimating Time Series Foundation Model Transferability via In-Context Learning [74.65355820906355]
Time series foundation models (TSFMs) offer strong zero-shot forecasting via large-scale pre-training.<n>Fine-tuning remains critical for boosting performance in domains with limited public data.<n>We introduce TimeTic, a transferability estimation framework that recasts model selection as an in-context-learning problem.
arXiv Detail & Related papers (2025-09-28T07:07:13Z) - Adaptive Fuzzy Time Series Forecasting via Partially Asymmetric Convolution and Sub-Sliding Window Fusion [0.0]
We propose a novel convolutional architecture with partially asymmetric design based on the time of sliding window.<n>The proposed method achieves state-of-the-art results on most of popular time series datasets.
arXiv Detail & Related papers (2025-07-28T08:58:25Z) - Tackling Data Heterogeneity in Federated Time Series Forecasting [61.021413959988216]
Time series forecasting plays a critical role in various real-world applications, including energy consumption prediction, disease transmission monitoring, and weather forecasting.
Most existing methods rely on a centralized training paradigm, where large amounts of data are collected from distributed devices to a central cloud server.
We propose a novel framework, Fed-TREND, to address data heterogeneity by generating informative synthetic data as auxiliary knowledge carriers.
arXiv Detail & Related papers (2024-11-24T04:56:45Z) - Deciphering Air Travel Disruptions: A Machine Learning Approach [0.0]
This research investigates flight delay trends by examining factors such as departure time, airline, and airport.
It employs regression machine learning methods to predict the contributions of various sources to delays.
arXiv Detail & Related papers (2024-08-05T19:45:07Z) - FlightPatchNet: Multi-Scale Patch Network with Differential Coding for Flight Trajectory Prediction [10.891498306893103]
We propose FlightPatchNet, a multi-scale patch network with differential coding for flight trajectory prediction.<n>To fully explore the diverse temporal patterns in flight trajectories, a multi-scale patch network is delicately designed to serve as the backbone.
arXiv Detail & Related papers (2024-05-25T12:15:56Z) - Pushing the Limits of Pre-training for Time Series Forecasting in the
CloudOps Domain [54.67888148566323]
We introduce three large-scale time series forecasting datasets from the cloud operations domain.
We show it is a strong zero-shot baseline and benefits from further scaling, both in model and dataset size.
Accompanying these datasets and results is a suite of comprehensive benchmark results comparing classical and deep learning baselines to our pre-trained method.
arXiv Detail & Related papers (2023-10-08T08:09:51Z) - Spatiotemporal Propagation Learning for Network-Wide Flight Delay
Prediction [17.632313431251383]
We propose Spatiotemporal Network (STP), a space-time separable convolutional network, which is novel capturing temporal dependency modeling.
From the aspect of relation of temporal dependency modeling, we propose a multi-head self-attentional that can be learned end-to-end and explicitly reason multiple kinds of temporal dependency delay time.
arXiv Detail & Related papers (2022-07-14T14:30:59Z) - ASAP-Net: Attention and Structure Aware Point Cloud Sequence
Segmentation [49.15948235059343]
We further improve point-temporal cloud feature with a flexible module called ASAP.
Our ASAP module contains an attentive temporal embedding layer to fuse the relatively informative local features across frames in a recurrent fashion.
We show the generalization ability of the proposed ASAP module with different computation backbone networks for point cloud sequence segmentation.
arXiv Detail & Related papers (2020-08-12T07:37:16Z) - Connecting the Dots: Multivariate Time Series Forecasting with Graph
Neural Networks [91.65637773358347]
We propose a general graph neural network framework designed specifically for multivariate time series data.
Our approach automatically extracts the uni-directed relations among variables through a graph learning module.
Our proposed model outperforms the state-of-the-art baseline methods on 3 of 4 benchmark datasets.
arXiv Detail & Related papers (2020-05-24T04:02:18Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.