Improving Accuracy Without Losing Interpretability: A ML Approach for
Time Series Forecasting
- URL: http://arxiv.org/abs/2212.06620v1
- Date: Tue, 13 Dec 2022 14:51:10 GMT
- Title: Improving Accuracy Without Losing Interpretability: A ML Approach for
Time Series Forecasting
- Authors: Yiqi Sun, Zhengxin Shi, Jianshen Zhang, Yongzhi Qi, Hao Hu, Zuojun Max
Shen
- Abstract summary: In time series forecasting, decomposition-based algorithms break aggregate data into meaningful components.
Recent algorithms often combine machine learning (hereafter ML) methodology with decomposition to improve prediction accuracy.
We propose the W-R algorithm, a hybrid algorithm that combines decomposition and ML from a novel perspective.
- Score: 4.025941501724274
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In time series forecasting, decomposition-based algorithms break aggregate
data into meaningful components and are therefore appreciated for their
particular advantages in interpretability. Recent algorithms often combine
machine learning (hereafter ML) methodology with decomposition to improve
prediction accuracy. However, incorporating ML is generally considered to
sacrifice interpretability inevitably. In addition, existing hybrid algorithms
usually rely on theoretical models with statistical assumptions and focus only
on the accuracy of aggregate predictions, and thus suffer from accuracy
problems, especially in component estimates. In response to the above issues,
this research explores the possibility of improving accuracy without losing
interpretability in time series forecasting. We first quantitatively define
interpretability for data-driven forecasts and systematically review the
existing forecasting algorithms from the perspective of interpretability.
Accordingly, we propose the W-R algorithm, a hybrid algorithm that combines
decomposition and ML from a novel perspective. Specifically, the W-R algorithm
replaces the standard additive combination function with a weighted variant and
uses ML to modify the estimates of all components simultaneously. We
mathematically analyze the theoretical basis of the algorithm and validate its
performance through extensive numerical experiments. In general, the W-R
algorithm outperforms all decomposition-based and ML benchmarks. Based on
P50_QL, the algorithm relatively improves by 8.76% in accuracy on the practical
sales forecasts of JD.com and 77.99% on a public dataset of electricity loads.
This research offers an innovative perspective to combine the statistical and
ML algorithms, and JD.com has implemented the W-R algorithm to make accurate
sales predictions and guide its marketing activities.
Related papers
- Learning optimal objective values for MILP [3.2170228238416123]
We propose a methodology for predicting the optimal objective value, or, equivalently, predicting if the current incumbent is optimal.
For this task, we introduce a predictor based on a graph neural network (GNN) architecture, together with a set of dynamic features.
Experimental results on diverse benchmarks demonstrate the efficacy of our approach, achieving high accuracy in the prediction task and outperforming existing methods.
arXiv Detail & Related papers (2024-11-27T13:22:31Z) - Comparative Evaluation of Metaheuristic Algorithms for Hyperparameter
Selection in Short-Term Weather Forecasting [0.0]
This paper explores the application of metaheuristic algorithms, namely Genetic Algorithm (GA), Differential Evolution (DE) and Particle Swarm Optimization (PSO)
We evaluate their performance in weather forecasting based on metrics such as Mean Squared Error (MSE) and Mean Absolute Percentage Error (MAPE)
arXiv Detail & Related papers (2023-09-05T22:13:35Z) - Algorithme EM r\'egularis\'e [0.0]
This paper presents a regularized version of the EM algorithm that efficiently uses prior knowledge to cope with a small sample size.
Experiments on real data highlight the good performance of the proposed algorithm for clustering purposes.
arXiv Detail & Related papers (2023-07-04T23:19:25Z) - Representation Learning with Multi-Step Inverse Kinematics: An Efficient
and Optimal Approach to Rich-Observation RL [106.82295532402335]
Existing reinforcement learning algorithms suffer from computational intractability, strong statistical assumptions, and suboptimal sample complexity.
We provide the first computationally efficient algorithm that attains rate-optimal sample complexity with respect to the desired accuracy level.
Our algorithm, MusIK, combines systematic exploration with representation learning based on multi-step inverse kinematics.
arXiv Detail & Related papers (2023-04-12T14:51:47Z) - Development and Evaluation of Conformal Prediction Methods for QSAR [0.5161531917413706]
The quantitative structure-activity relationship (QSAR) regression model is a commonly used technique for predicting biological activities of compounds.
Most machine learning (ML) algorithms that achieve superior predictive performance require some add-on methods for estimating uncertainty of their prediction.
Conformal prediction (CP) is a promising approach. It is agnostic to the prediction algorithm and can produce valid prediction intervals under some weak assumptions on the data distribution.
arXiv Detail & Related papers (2023-04-03T13:41:09Z) - Scalable computation of prediction intervals for neural networks via
matrix sketching [79.44177623781043]
Existing algorithms for uncertainty estimation require modifying the model architecture and training procedure.
This work proposes a new algorithm that can be applied to a given trained neural network and produces approximate prediction intervals.
arXiv Detail & Related papers (2022-05-06T13:18:31Z) - Efficient and Differentiable Conformal Prediction with General Function
Classes [96.74055810115456]
We propose a generalization of conformal prediction to multiple learnable parameters.
We show that it achieves approximate valid population coverage and near-optimal efficiency within class.
Experiments show that our algorithm is able to learn valid prediction sets and improve the efficiency significantly.
arXiv Detail & Related papers (2022-02-22T18:37:23Z) - Non-Clairvoyant Scheduling with Predictions Revisited [77.86290991564829]
In non-clairvoyant scheduling, the task is to find an online strategy for scheduling jobs with a priori unknown processing requirements.
We revisit this well-studied problem in a recently popular learning-augmented setting that integrates (untrusted) predictions in algorithm design.
We show that these predictions have desired properties, admit a natural error measure as well as algorithms with strong performance guarantees.
arXiv Detail & Related papers (2022-02-21T13:18:11Z) - Robustification of Online Graph Exploration Methods [59.50307752165016]
We study a learning-augmented variant of the classical, notoriously hard online graph exploration problem.
We propose an algorithm that naturally integrates predictions into the well-known Nearest Neighbor (NN) algorithm.
arXiv Detail & Related papers (2021-12-10T10:02:31Z) - Dual Optimization for Kolmogorov Model Learning Using Enhanced Gradient
Descent [8.714458129632158]
Kolmogorov model (KM) is an interpretable and predictable representation approach to learning the underlying probabilistic structure of a set of random variables.
We propose a computationally scalable KM learning algorithm, based on the regularized dual optimization combined with enhanced gradient descent (GD) method.
It is shown that the accuracy of logical relation mining for interpretability by using the proposed KM learning algorithm exceeds $80%$.
arXiv Detail & Related papers (2021-07-11T10:33:02Z) - Fractal Structure and Generalization Properties of Stochastic
Optimization Algorithms [71.62575565990502]
We prove that the generalization error of an optimization algorithm can be bounded on the complexity' of the fractal structure that underlies its generalization measure.
We further specialize our results to specific problems (e.g., linear/logistic regression, one hidden/layered neural networks) and algorithms.
arXiv Detail & Related papers (2021-06-09T08:05:36Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.