Enhancing Multivariate Time Series Forecasting with Mutual
Information-driven Cross-Variable and Temporal Modeling
- URL: http://arxiv.org/abs/2403.00869v1
- Date: Fri, 1 Mar 2024 04:42:47 GMT
- Title: Enhancing Multivariate Time Series Forecasting with Mutual
Information-driven Cross-Variable and Temporal Modeling
- Authors: Shiyi Qi, Liangjian Wen, Yiduo Li, Yuanhang Yang, Zhe Li, Zhongwen
Rao, Lujia Pan, Zenglin Xu
- Abstract summary: We introduce the Cross-variable Decorrelation Aware feature Modeling (CDAM) for Channel-mixing approaches.
We also introduce the Temporal correlation Aware Modeling (TAM) to exploit temporal correlations, a step beyond conventional single-step forecasting methods.
Our novel framework significantly surpasses existing models, including those previously considered state-of-the-art, in comprehensive tests.
- Score: 24.041263835195423
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Recent advancements have underscored the impact of deep learning techniques
on multivariate time series forecasting (MTSF). Generally, these techniques are
bifurcated into two categories: Channel-independence and Channel-mixing
approaches. Although Channel-independence methods typically yield better
results, Channel-mixing could theoretically offer improvements by leveraging
inter-variable correlations. Nonetheless, we argue that the integration of
uncorrelated information in channel-mixing methods could curtail the potential
enhancement in MTSF model performance. To substantiate this claim, we introduce
the Cross-variable Decorrelation Aware feature Modeling (CDAM) for
Channel-mixing approaches, aiming to refine Channel-mixing by minimizing
redundant information between channels while enhancing relevant mutual
information. Furthermore, we introduce the Temporal correlation Aware Modeling
(TAM) to exploit temporal correlations, a step beyond conventional single-step
forecasting methods. This strategy maximizes the mutual information between
adjacent sub-sequences of both the forecasted and target series. Combining CDAM
and TAM, our novel framework significantly surpasses existing models, including
those previously considered state-of-the-art, in comprehensive tests.
Related papers
- DisenTS: Disentangled Channel Evolving Pattern Modeling for Multivariate Time Series Forecasting [43.071713191702486]
DisenTS is a tailored framework for modeling disentangled channel evolving patterns in general time series forecasting.
We introduce a novel Forecaster Aware Gate (FAG) module that generates the routing signals adaptively according to both the forecasters' states and input series' characteristics.
arXiv Detail & Related papers (2024-10-30T12:46:14Z) - SOFTS: Efficient Multivariate Time Series Forecasting with Series-Core Fusion [59.96233305733875]
Time series forecasting plays a crucial role in various fields such as finance, traffic management, energy, and healthcare.
Several methods utilize mechanisms like attention or mixer to address this by capturing channel correlations.
This paper presents an efficient-based model, the Series-cOre Fused Time Series forecaster (SOFTS)
arXiv Detail & Related papers (2024-04-22T14:06:35Z) - From Similarity to Superiority: Channel Clustering for Time Series Forecasting [61.96777031937871]
We develop a novel and adaptable Channel Clustering Module ( CCM)
CCM dynamically groups channels characterized by intrinsic similarities and leverages cluster information instead of individual channel identities.
CCM can boost the performance of CI and CD models by an average margin of 2.4% and 7.2% on long-term and short-term forecasting, respectively.
arXiv Detail & Related papers (2024-03-31T02:46:27Z) - MCformer: Multivariate Time Series Forecasting with Mixed-Channels Transformer [8.329947472853029]
Channel Independence (CI) strategy treats all channels as a single channel, expanding the dataset.
Mixed Channels strategy combines the data expansion advantages of the CI strategy with the ability to counteract inter-channel correlation forgetting.
Model blends a specific number of channels, leveraging an attention mechanism to effectively capture inter-channel correlation information.
arXiv Detail & Related papers (2024-03-14T09:43:07Z) - Revitalizing Multivariate Time Series Forecasting: Learnable Decomposition with Inter-Series Dependencies and Intra-Series Variations Modeling [14.170879566023098]
We introduce a learnable decomposition strategy to capture dynamic trend information more reasonably.
We also propose a dual attention module tailored to capture inter-series dependencies and intra-series variations simultaneously.
arXiv Detail & Related papers (2024-02-20T03:45:59Z) - Leveraging Diffusion Disentangled Representations to Mitigate Shortcuts
in Underspecified Visual Tasks [92.32670915472099]
We propose an ensemble diversification framework exploiting the generation of synthetic counterfactuals using Diffusion Probabilistic Models (DPMs)
We show that diffusion-guided diversification can lead models to avert attention from shortcut cues, achieving ensemble diversity performance comparable to previous methods requiring additional data collection.
arXiv Detail & Related papers (2023-10-03T17:37:52Z) - The Capacity and Robustness Trade-off: Revisiting the Channel
Independent Strategy for Multivariate Time Series Forecasting [50.48888534815361]
We show that models trained with the Channel Independent (CI) strategy outperform those trained with the Channel Dependent (CD) strategy.
Our results conclude that the CD approach has higher capacity but often lacks robustness to accurately predict distributionally drifted time series.
We propose a modified CD method called Predict Residuals with Regularization (PRReg) that can surpass the CI strategy.
arXiv Detail & Related papers (2023-04-11T13:15:33Z) - Multi-scale Attention Flow for Probabilistic Time Series Forecasting [68.20798558048678]
We propose a novel non-autoregressive deep learning model, called Multi-scale Attention Normalizing Flow(MANF)
Our model avoids the influence of cumulative error and does not increase the time complexity.
Our model achieves state-of-the-art performance on many popular multivariate datasets.
arXiv Detail & Related papers (2022-05-16T07:53:42Z) - Learning to Perform Downlink Channel Estimation in Massive MIMO Systems [72.76968022465469]
We study downlink (DL) channel estimation in a Massive multiple-input multiple-output (MIMO) system.
A common approach is to use the mean value as the estimate, motivated by channel hardening.
We propose two novel estimation methods.
arXiv Detail & Related papers (2021-09-06T13:42:32Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.