VEST: Automatic Feature Engineering for Forecasting
- URL: http://arxiv.org/abs/2010.07137v1
- Date: Wed, 14 Oct 2020 14:54:56 GMT
- Title: VEST: Automatic Feature Engineering for Forecasting
- Authors: Vitor Cerqueira, Nuno Moniz, Carlos Soares
- Abstract summary: We investigate the extension of auto-regressive processes using statistics which summarise the recent past dynamics of time series.
The proposed approach works in three main steps. First, recent observations are mapped onto different representations. Second, each representation is summarised by statistical functions. Finally, a filter is applied for feature selection.
- Score: 3.9747898273716697
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Time series forecasting is a challenging task with applications in a wide
range of domains. Auto-regression is one of the most common approaches to
address these problems. Accordingly, observations are modelled by multiple
regression using their past lags as predictor variables. We investigate the
extension of auto-regressive processes using statistics which summarise the
recent past dynamics of time series. The result of our research is a novel
framework called VEST, designed to perform feature engineering using univariate
and numeric time series automatically. The proposed approach works in three
main steps. First, recent observations are mapped onto different
representations. Second, each representation is summarised by statistical
functions. Finally, a filter is applied for feature selection. We discovered
that combining the features generated by VEST with auto-regression
significantly improves forecasting performance. We provide evidence using 90
time series with high sampling frequency. VEST is publicly available online.
Related papers
- Deciphering Movement: Unified Trajectory Generation Model for Multi-Agent [53.637837706712794]
We propose a Unified Trajectory Generation model, UniTraj, that processes arbitrary trajectories as masked inputs.
Specifically, we introduce a Ghost Spatial Masking (GSM) module embedded within a Transformer encoder for spatial feature extraction.
We benchmark three practical sports game datasets, Basketball-U, Football-U, and Soccer-U, for evaluation.
arXiv Detail & Related papers (2024-05-27T22:15:23Z) - Lag Selection for Univariate Time Series Forecasting using Deep Learning: An Empirical Study [0.393259574660092]
We focus on deep learning methods trained in a global approach, i.e., on datasets comprising multiple univariate time series.
The results indicate that the lag size is a relevant parameter for accurate forecasts.
Cross-validation approaches show the best performance for lag selection.
arXiv Detail & Related papers (2024-05-18T09:31:54Z) - FreDF: Learning to Forecast in Frequency Domain [56.24773675942897]
Time series modeling is uniquely challenged by the presence of autocorrelation in both historical and label sequences.
We introduce the Frequency-enhanced Direct Forecast (FreDF) which bypasses the complexity of label autocorrelation by learning to forecast in the frequency domain.
arXiv Detail & Related papers (2024-02-04T08:23:41Z) - Unsupervised Feature Based Algorithms for Time Series Extrinsic
Regression [0.9659642285903419]
Time Series Extrinsic Regression (TSER) involves using a set of training time series to form a predictive model of a continuous response variable.
DrCIF and FreshPRINCE models are the only ones that significantly outperform the standard rotation forest regressor.
arXiv Detail & Related papers (2023-05-02T13:58:20Z) - Generative Time Series Forecasting with Diffusion, Denoise, and
Disentanglement [51.55157852647306]
Time series forecasting has been a widely explored task of great importance in many applications.
It is common that real-world time series data are recorded in a short time period, which results in a big gap between the deep model and the limited and noisy time series.
We propose to address the time series forecasting problem with generative modeling and propose a bidirectional variational auto-encoder equipped with diffusion, denoise, and disentanglement.
arXiv Detail & Related papers (2023-01-08T12:20:46Z) - Instance-wise Graph-based Framework for Multivariate Time Series
Forecasting [69.38716332931986]
We propose a simple yet efficient instance-wise graph-based framework to utilize the inter-dependencies of different variables at different time stamps.
The key idea of our framework is aggregating information from the historical time series of different variables to the current time series that we need to forecast.
arXiv Detail & Related papers (2021-09-14T07:38:35Z) - AutoML Meets Time Series Regression Design and Analysis of the
AutoSeries Challenge [21.49840594645196]
First Automated Time Series Regression challenge (AutoSeries) for the WSDM Cup 2020.
We present its design, analysis, and post-hoc experiments.
arXiv Detail & Related papers (2021-07-28T06:30:46Z) - ScoreGrad: Multivariate Probabilistic Time Series Forecasting with
Continuous Energy-based Generative Models [10.337742174633052]
We propose ScoreGrad, a probabilistic time series forecasting framework based on continuous energy-based generative models.
ScoreGrad is composed of time series feature extraction module and conditional differential equation based score matching module.
It achieves state-of-the-art results on six real-world datasets.
arXiv Detail & Related papers (2021-06-18T13:22:12Z) - Interpretable Feature Construction for Time Series Extrinsic Regression [0.028675177318965035]
In some application domains, it occurs that the target variable is numerical and the problem is known as time series extrinsic regression (TSER)
We suggest an extension of a Bayesian method for robust and interpretable feature construction and selection in the context of TSER.
Our approach exploits a relational way to tackle with TSER: (i), we build various and simple representations of the time series which are stored in a relational data scheme, then, (ii), a propositionalisation technique is applied to build interpretable features from secondary tables to "flatten" the data.
arXiv Detail & Related papers (2021-03-15T08:12:19Z) - Superiority of Simplicity: A Lightweight Model for Network Device
Workload Prediction [58.98112070128482]
We propose a lightweight solution for series prediction based on historic observations.
It consists of a heterogeneous ensemble method composed of two models - a neural network and a mean predictor.
It achieves an overall $R2$ score of 0.10 on the available FedCSIS 2020 challenge dataset.
arXiv Detail & Related papers (2020-07-07T15:44:16Z) - Ambiguity in Sequential Data: Predicting Uncertain Futures with
Recurrent Models [110.82452096672182]
We propose an extension of the Multiple Hypothesis Prediction (MHP) model to handle ambiguous predictions with sequential data.
We also introduce a novel metric for ambiguous problems, which is better suited to account for uncertainties.
arXiv Detail & Related papers (2020-03-10T09:15:42Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.