Time-to-Injury Forecasting in Elite Female Football: A DeepHit Survival Approach
- URL: http://arxiv.org/abs/2601.19479v1
- Date: Tue, 27 Jan 2026 11:11:52 GMT
- Title: Time-to-Injury Forecasting in Elite Female Football: A DeepHit Survival Approach
- Authors: Victoria Catterall, Cise Midoglu, Stephen Lynch,
- Abstract summary: This study investigates the feasibility of using a DeepHit neural network to forecast time-to-injury from longitudinal athlete monitoring data.<n>The analysis utilised the publicly available SoccerMon dataset, containing two seasons of training, match, and wellness records from elite female footballers.
- Score: 0.5980822697955565
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Injury occurrence in football poses significant challenges for athletes and teams, carrying personal, competitive, and financial consequences. While machine learning has been applied to injury prediction before, existing approaches often rely on static pre-season data and binary outcomes, limiting their real-world utility. This study investigates the feasibility of using a DeepHit neural network to forecast time-to-injury from longitudinal athlete monitoring data, while providing interpretable predictions. The analysis utilised the publicly available SoccerMon dataset, containing two seasons of training, match, and wellness records from elite female footballers. Data was pre-processed through cleaning, feature engineering, and the application of three imputation strategies. Baseline models (Random Forest, XGBoost, Logistic Regression) were optimised via grid search for benchmarking, while the DeepHit model, implemented with a multilayer perceptron backbone, was evaluated using chronological and leave-one-player-out (LOPO) validation. DeepHit achieved a concordance index of 0.762, outperforming baseline models and delivering individualised, time-varying risk estimates. Shapley Additive Explanations (SHAP) identified clinically relevant predictors consistent with established risk factors, enhancing interpretability. Overall, this study provides a novel proof of concept: survival modelling with DeepHit shows strong potential to advance injury forecasting in football, offering accurate, explainable, and actionable insights for injury prevention across competitive levels.
Related papers
- Finding Pre-Injury Patterns in Triathletes from Lifestyle, Recovery and Load Dynamics Features [0.7448254811651419]
Triathlon training places athletes at substantial risk for overuse injuries due to repetitive physiological stress.<n>Current injury prediction approaches rely on training load metrics, neglecting critical factors such as sleep quality, stress, and individual lifestyle patterns.<n>We introduce a novel synthetic data generation framework tailored explicitly for triathlon training.
arXiv Detail & Related papers (2025-11-18T08:48:37Z) - Accuracy Law for the Future of Deep Time Series Forecasting [65.46625911002202]
Time series forecasting inherently faces a non-zero error lower bound due to its partially observable and uncertain nature.<n>This paper focuses on a fundamental question: how to estimate the performance upper bound of deep time series forecasting.<n>Based on rigorous statistical tests of over 2,800 newly trained deep forecasters, we discover a significant exponential relationship between the minimum forecasting error of deep models and the complexity of window-wise series patterns.
arXiv Detail & Related papers (2025-10-03T05:18:47Z) - Deep operator network models for predicting post-burn contraction [41.94295877935867]
Burn injuries present a significant global health challenge.
Understanding and predicting the evolution of post-burn wounds is essential for developing effective treatment strategies.
Recent advancements in machine learning, particularly in deep learning, offer promising alternatives for accelerating these predictions.
arXiv Detail & Related papers (2024-11-21T19:57:29Z) - RNN-BOF: A Multivariate Global Recurrent Neural Network for Binary
Outcome Forecasting of Inpatient Aggression [3.789219860006095]
We propose modelling a patient's future risk using a time series methodology that learns from longitudinal data.
We use a moving window training scheme on a real world dataset of 83 patients, where the main binary time series represents the presence of aggressive events.
arXiv Detail & Related papers (2023-12-02T04:42:22Z) - Performative Time-Series Forecasting [64.03865043422597]
We formalize performative time-series forecasting (PeTS) from a machine-learning perspective.<n>We propose a novel approach, Feature Performative-Shifting (FPS), which leverages the concept of delayed response to anticipate distribution shifts.<n>We conduct comprehensive experiments using multiple time-series models on COVID-19 and traffic forecasting tasks.
arXiv Detail & Related papers (2023-10-09T18:34:29Z) - MedDiffusion: Boosting Health Risk Prediction via Diffusion-based Data
Augmentation [58.93221876843639]
This paper introduces a novel, end-to-end diffusion-based risk prediction model, named MedDiffusion.
It enhances risk prediction performance by creating synthetic patient data during training to enlarge sample space.
It discerns hidden relationships between patient visits using a step-wise attention mechanism, enabling the model to automatically retain the most vital information for generating high-quality data.
arXiv Detail & Related papers (2023-10-04T01:36:30Z) - CenTime: Event-Conditional Modelling of Censoring in Survival Analysis [49.44664144472712]
We introduce CenTime, a novel approach to survival analysis that directly estimates the time to event.
Our method features an innovative event-conditional censoring mechanism that performs robustly even when uncensored data is scarce.
Our results indicate that CenTime offers state-of-the-art performance in predicting time-to-death while maintaining comparable ranking performance.
arXiv Detail & Related papers (2023-09-07T17:07:33Z) - Transformer-based Time-to-Event Prediction for Chronic Kidney Disease
Deterioration [0.0]
STRAFE is a generalizable survival analysis transformer-based architecture for electronic health records.
The performance of STRAFE was evaluated using a real-world claim dataset of over 130,000 individuals with stage 3 chronic kidney disease.
STRAFE predictions can improve the positive predictive value of high-risk patients by 3-fold, demonstrating possible usage to improve targeting for intervention programs.
arXiv Detail & Related papers (2023-06-09T09:46:38Z) - Predicting Seriousness of Injury in a Traffic Accident: A New Imbalanced
Dataset and Benchmark [62.997667081978825]
The paper introduces a new dataset to assess the performance of machine learning algorithms in the prediction of the seriousness of injury in a traffic accident.
The dataset is created by aggregating publicly available datasets from the UK Department for Transport.
arXiv Detail & Related papers (2022-05-20T21:15:26Z) - Evaluation of soccer team defense based on prediction models of ball
recovery and being attacked [0.8921166277011345]
We propose a method to evaluate team defense based on the prediction of ball recovery and being attacked.
Using data from 45 soccer matches, we examined the relationship between the proposed index and team performance.
arXiv Detail & Related papers (2021-03-17T13:15:41Z) - Adversarial Refinement Network for Human Motion Prediction [61.50462663314644]
Two popular methods, recurrent neural networks and feed-forward deep networks, are able to predict rough motion trend.
We propose an Adversarial Refinement Network (ARNet) following a simple yet effective coarse-to-fine mechanism with novel adversarial error augmentation.
arXiv Detail & Related papers (2020-11-23T05:42:20Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.