Uncertainty-Aware Delivery Delay Duration Prediction via Multi-Task Deep Learning
- URL: http://arxiv.org/abs/2602.20271v1
- Date: Mon, 23 Feb 2026 19:01:03 GMT
- Title: Uncertainty-Aware Delivery Delay Duration Prediction via Multi-Task Deep Learning
- Authors: Stefan Faulkner, Reza Zandehshahvar, Vahid Eghbal Akhlaghi, Sebastien Ouellet, Carsten Jordan, Pascal Van Hentenryck,
- Abstract summary: This paper introduces a multi-task deep learning model for delivery delay duration prediction in the presence of significant imbalanced data.<n>The proposed model is evaluated on a large-scale real-world dataset from an industrial partner.<n> Experimental results show that the proposed method achieves a mean absolute error of 0.67-0.91 days for delayed-shipment predictions.
- Score: 11.2212153491325
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Accurate delivery delay prediction is critical for maintaining operational efficiency and customer satisfaction across modern supply chains. Yet the increasing complexity of logistics networks, spanning multimodal transportation, cross-country routing, and pronounced regional variability, makes this prediction task inherently challenging. This paper introduces a multi-task deep learning model for delivery delay duration prediction in the presence of significant imbalanced data, where delayed shipments are rare but operationally consequential. The model embeds high-dimensional shipment features with dedicated embedding layers for tabular data, and then uses a classification-then-regression strategy to predict the delivery delay duration for on-time and delayed shipments. Unlike sequential pipelines, this approach enables end-to-end training, improves the detection of delayed cases, and supports probabilistic forecasting for uncertainty-aware decision making. The proposed approach is evaluated on a large-scale real-world dataset from an industrial partner, comprising more than 10 million historical shipment records across four major source locations with distinct regional characteristics. The proposed model is compared with traditional machine learning methods. Experimental results show that the proposed method achieves a mean absolute error of 0.67-0.91 days for delayed-shipment predictions, outperforming single-step tree-based regression baselines by 41-64% and two-step classify-then-regress tree-based models by 15-35%. These gains demonstrate the effectiveness of the proposed model in operational delivery delay forecasting under highly imbalanced and heterogeneous conditions.
Related papers
- Streaming Real-Time Trajectory Prediction Using Endpoint-Aware Modeling [54.94692733670454]
Future trajectories of neighboring traffic agents have a significant influence on the path planning and decision-making of autonomous vehicles.<n>We propose a lightweight yet highly accurate streaming-based trajectory forecasting approach.<n>Our approach significantly reduces inference latency, making it well-suited for real-world deployment.
arXiv Detail & Related papers (2026-03-02T13:44:23Z) - Towards Anytime-Valid Statistical Watermarking [63.02116925616554]
We develop the first e-value-based watermarking framework, Anchored E-Watermarking, that unifies optimal sampling with anytime-valid inference.<n>Our framework can significantly enhance sample efficiency, reducing the average token budget required for detection by 13-15% relative to state-of-the-art baselines.
arXiv Detail & Related papers (2026-02-19T18:32:26Z) - Simulation-Driven Railway Delay Prediction: An Imitation Learning Approach [12.018920884898215]
We introduce Drift-Corrected Imitation Learning (DCIL), a novel self-supervised algorithm that extends DAgger by incorporating distance-based drift correction.<n>We evaluate DCIL using a comprehensive real-world dataset from textscInfrabel, the Belgian railway infrastructure manager.
arXiv Detail & Related papers (2025-12-17T14:06:26Z) - Deep Learning to Identify the Spatio-Temporal Cascading Effects of Train Delays in a High-Density Network [7.850219269698452]
This paper develops and evaluating a novel XGeoAI framework for live, explainable, multi-step train delay forecasting.<n>The core of this work is a two-stage, autoregressive Graph Attention Network (GAT) model, trained on a real-world dataset covering over 40% of the Dutch railway network.<n>To test its viability for live deployment, the model is rigorously evaluated using a sequential, k-step-ahead forecasting protocol.
arXiv Detail & Related papers (2025-10-10T13:03:00Z) - ResAD: Normalized Residual Trajectory Modeling for End-to-End Autonomous Driving [64.42138266293202]
ResAD is a Normalized Residual Trajectory Modeling framework.<n>It reframes the learning task to predict the residual deviation from an inertial reference.<n>On the NAVSIM benchmark, ResAD achieves a state-of-the-art PDMS of 88.6 using a vanilla diffusion policy.
arXiv Detail & Related papers (2025-10-09T17:59:36Z) - Conformal Predictive Distributions for Order Fulfillment Time Forecasting [15.378087950770684]
This paper introduces a novel framework for distributional forecasting of order fulfillment time.<n>The proposed methods generate competitive distributional forecasts, while machine learning-based point predictions significantly outperform the existing rule-based system.
arXiv Detail & Related papers (2025-05-22T23:23:52Z) - Causally-Aware Spatio-Temporal Multi-Graph Convolution Network for Accurate and Reliable Traffic Prediction [5.200012764049096]
This study focuses on an instance of--temporal problem--traffic prediction--to demonstrate an advanced deep learning model for making accurate and reliable forecast.
We propose an end-to-end traffic prediction framework that leverages three primary components to accurate and reliable traffic predictions.
Experimental results on two real-world traffic datasets demonstrate that the method outperforms several state-of-the-art models in prediction accuracy.
arXiv Detail & Related papers (2024-08-23T14:35:54Z) - Physics-guided Active Sample Reweighting for Urban Flow Prediction [75.24539704456791]
Urban flow prediction is a nuanced-temporal modeling that estimates the throughput of transportation services like buses, taxis and ride-driven models.
Some recent prediction solutions bring remedies with the notion of physics-guided machine learning (PGML)
We develop a atized physics-guided network (PN), and propose a data-aware framework Physics-guided Active Sample Reweighting (P-GASR)
arXiv Detail & Related papers (2024-07-18T15:44:23Z) - Certified Human Trajectory Prediction [66.1736456453465]
We propose a certification approach tailored for trajectory prediction that provides guaranteed robustness.<n>To mitigate the inherent performance drop through certification, we propose a diffusion-based trajectory denoiser and integrate it into our method.<n>We demonstrate the accuracy and robustness of the certified predictors and highlight their advantages over the non-certified ones.
arXiv Detail & Related papers (2024-03-20T17:41:35Z) - Streaming Motion Forecasting for Autonomous Driving [71.7468645504988]
We introduce a benchmark that queries future trajectories on streaming data and we refer to it as "streaming forecasting"
Our benchmark inherently captures the disappearance and re-appearance of agents, which is a safety-critical problem yet overlooked by snapshot-based benchmarks.
We propose a plug-and-play meta-algorithm called "Predictive Streamer" that can adapt any snapshot-based forecaster into a streaming forecaster.
arXiv Detail & Related papers (2023-10-02T17:13:16Z) - Multi-Airport Delay Prediction with Transformers [0.0]
Temporal Fusion Transformer (TFT) was proposed to predict departure and arrival delays simultaneously for multiple airports.
This approach can capture complex temporal dynamics of the inputs known at the time of prediction and then forecast selected delay metrics up to four hours into the future.
arXiv Detail & Related papers (2021-11-04T21:58:11Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.