Evaluating Time-Dependent Methods and Seasonal Effects in Code Technical Debt Prediction
- URL: http://arxiv.org/abs/2408.08095v2
- Date: Thu, 19 Jun 2025 11:21:02 GMT
- Title: Evaluating Time-Dependent Methods and Seasonal Effects in Code Technical Debt Prediction
- Authors: Mikel Robredo, Nyyti Saarimaki, Matteo Esposito, Davide Taibi, Rafael Penaloza, Valentina Lenarduzzi,
- Abstract summary: Code Technical Debt (Code TD) prediction has gained significant attention in recent software engineering research.<n>No standardized approach to Code TD prediction fully captures the factors influencing its evolution.<n>Our study aims to assess the impact of time-dependent models and seasonal effects on Code TD prediction.
- Score: 6.244245899613162
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Background. Code Technical Debt (Code TD) prediction has gained significant attention in recent software engineering research. However, no standardized approach to Code TD prediction fully captures the factors influencing its evolution. Objective. Our study aims to assess the impact of time-dependent models and seasonal effects on Code TD prediction. It evaluates such models against widely used Machine Learning models, also considering the influence of seasonality on prediction performance. Methods. We trained 11 prediction models with 31 Java open-source projects. To assess their performance, we predicted future observations of the SQALE index. To evaluate the practical usability of our TD forecasting model and its impact on practitioners, we surveyed 23 software engineering professionals. Results. Our study confirms the benefits of time-dependent techniques, with the ARIMAX model outperforming the others. Seasonal effects improved predictive performance, though the impact remained modest. \ReviewerA{ARIMAX/SARIMAX models demonstrated to provide well-balanced long-term forecasts. The survey highlighted strong industry interest in short- to medium-term TD forecasts. Conclusions. Our findings support using techniques that capture time dependence in historical software metric data, particularly for Code TD. Effectively addressing this evidence requires adopting methods that account for temporal patterns.
Related papers
- A Distillation-based Future-aware Graph Neural Network for Stock Trend Prediction [4.655696097611871]
We propose a future-aware graph neural network framework (DishFT-GNN) for stock trend prediction.<n>DishFT-GNN trains a teacher model and a student model, iteratively.<n>We verify the state-of-the-art performance of DishFT-GNN on two real-world datasets.
arXiv Detail & Related papers (2025-02-15T11:44:15Z) - Time Series Feature Redundancy Paradox: An Empirical Study Based on Mortgage Default Prediction [0.5755004576310334]
Conventional wisdom suggests that longer training periods and more feature variables contribute to improved model performance.
This paper, focusing on mortgage default prediction, empirically discovers a phenomenon that contradicts traditional knowledge.
arXiv Detail & Related papers (2024-12-23T21:28:32Z) - Future-Guided Learning: A Predictive Approach To Enhance Time-Series Forecasting [4.866362841501992]
We introduce Future-Guided Learning, an approach that enhances time-series event forecasting through a dynamic feedback mechanism inspired by predictive coding.<n>Our method involves two models: a detection model that analyzes future data to identify critical events and a forecasting model that predicts these events based on current data.<n>We validate our approach on a variety of tasks, demonstrating a 44.8% increase in AUC-ROC for seizure prediction using EEG data, and a 48.7% reduction in MSE for forecasting in nonlinear dynamical systems.
arXiv Detail & Related papers (2024-10-19T21:22:55Z) - Benchmarking Graph Conformal Prediction: Empirical Analysis, Scalability, and Theoretical Insights [6.801587574420671]
Conformal prediction has become increasingly popular for quantifying the uncertainty associated with machine learning models.
Recent work in graph uncertainty quantification has built upon this approach for conformal graph prediction.
We analyze the design choices made in the literature and discuss the tradeoffs associated with existing methods.
arXiv Detail & Related papers (2024-09-26T23:13:51Z) - Systematic literature review on forecasting and prediction of technical debt evolution [0.0]
Technical debt (TD) refers to the additional costs incurred due to compromises in software quality.
This study aims to explore existing knowledge in software engineering to gain insights into approaches proposed in research and industry.
arXiv Detail & Related papers (2024-06-17T18:50:37Z) - Deep learning for precipitation nowcasting: A survey from the perspective of time series forecasting [4.5424061912112474]
This paper reviews recent progress in time series precipitation forecasting models using deep learning.
We categorize forecasting models into textitrecursive and textitmultiple strategies based on their approaches to predict future frames.
We evaluate current deep learning-based models for precipitation forecasting on a public benchmark, discuss their limitations and challenges, and present some promising research directions.
arXiv Detail & Related papers (2024-06-07T12:07:09Z) - Selective Temporal Knowledge Graph Reasoning [70.11788354442218]
Temporal Knowledge Graph (TKG) aims to predict future facts based on given historical ones.
Existing TKG reasoning models are unable to abstain from predictions they are uncertain.
We propose an abstention mechanism for TKG reasoning, which helps the existing models make selective, instead of indiscriminate, predictions.
arXiv Detail & Related papers (2024-04-02T06:56:21Z) - Learning-Augmented Algorithms with Explicit Predictors [67.02156211760415]
Recent advances in algorithmic design show how to utilize predictions obtained by machine learning models from past and present data.
Prior research in this context was focused on a paradigm where the predictor is pre-trained on past data and then used as a black box.
In this work, we unpack the predictor and integrate the learning problem it gives rise for within the algorithmic challenge.
arXiv Detail & Related papers (2024-03-12T08:40:21Z) - ExtremeCast: Boosting Extreme Value Prediction for Global Weather Forecast [57.6987191099507]
We introduce Exloss, a novel loss function that performs asymmetric optimization and highlights extreme values to obtain accurate extreme weather forecast.
We also introduce ExBooster, which captures the uncertainty in prediction outcomes by employing multiple random samples.
Our solution can achieve state-of-the-art performance in extreme weather prediction, while maintaining the overall forecast accuracy comparable to the top medium-range forecast models.
arXiv Detail & Related papers (2024-02-02T10:34:13Z) - Performative Time-Series Forecasting [71.18553214204978]
We formalize performative time-series forecasting (PeTS) from a machine-learning perspective.
We propose a novel approach, Feature Performative-Shifting (FPS), which leverages the concept of delayed response to anticipate distribution shifts.
We conduct comprehensive experiments using multiple time-series models on COVID-19 and traffic forecasting tasks.
arXiv Detail & Related papers (2023-10-09T18:34:29Z) - DeepVol: Volatility Forecasting from High-Frequency Data with Dilated Causal Convolutions [53.37679435230207]
We propose DeepVol, a model based on Dilated Causal Convolutions that uses high-frequency data to forecast day-ahead volatility.
Our empirical results suggest that the proposed deep learning-based approach effectively learns global features from high-frequency data.
arXiv Detail & Related papers (2022-09-23T16:13:47Z) - Logic Constraints to Feature Importances [17.234442722611803]
"Black box" nature of AI models is often a limit for a reliable application in high-stakes fields like diagnostic techniques, autonomous guide, etc.
Recent works have shown that an adequate level of interpretability could enforce the more general concept of model trustworthiness.
The basic idea of this paper is to exploit the human prior knowledge of the features' importance for a specific task, in order to coherently aid the phase of the model's fitting.
arXiv Detail & Related papers (2021-10-13T09:28:38Z) - Back2Future: Leveraging Backfill Dynamics for Improving Real-time
Predictions in Future [73.03458424369657]
In real-time forecasting in public health, data collection is a non-trivial and demanding task.
'Backfill' phenomenon and its effect on model performance has been barely studied in the prior literature.
We formulate a novel problem and neural framework Back2Future that aims to refine a given model's predictions in real-time.
arXiv Detail & Related papers (2021-06-08T14:48:20Z) - Learning Prediction Intervals for Model Performance [1.433758865948252]
We propose a method to compute prediction intervals for model performance.
We evaluate our approach across a wide range of drift conditions and show substantial improvement over competitive baselines.
arXiv Detail & Related papers (2020-12-15T21:32:03Z) - Counterfactual Predictions under Runtime Confounding [74.90756694584839]
We study the counterfactual prediction task in the setting where all relevant factors are captured in the historical data.
We propose a doubly-robust procedure for learning counterfactual prediction models in this setting.
arXiv Detail & Related papers (2020-06-30T15:49:05Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.