Time-IMM: A Dataset and Benchmark for Irregular Multimodal Multivariate Time Series
- URL: http://arxiv.org/abs/2506.10412v2
- Date: Mon, 23 Jun 2025 21:10:15 GMT
- Title: Time-IMM: A Dataset and Benchmark for Irregular Multimodal Multivariate Time Series
- Authors: Ching Chang, Jeehyun Hwang, Yidan Shi, Haixin Wang, Wen-Chih Peng, Tien-Fu Chen, Wei Wang,
- Abstract summary: Time-IMM is a dataset designed to capture cause-driven irregularity in multimodal time series.<n>IMM-TSF is a benchmark library for forecasting on irregular multimodal time series.<n> Empirical results demonstrate that explicitly modeling multimodality on irregular time series data leads to substantial gains in forecasting performance.
- Score: 12.066711928647265
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Time series data in real-world applications such as healthcare, climate modeling, and finance are often irregular, multimodal, and messy, with varying sampling rates, asynchronous modalities, and pervasive missingness. However, existing benchmarks typically assume clean, regularly sampled, unimodal data, creating a significant gap between research and real-world deployment. We introduce Time-IMM, a dataset specifically designed to capture cause-driven irregularity in multimodal multivariate time series. Time-IMM represents nine distinct types of time series irregularity, categorized into trigger-based, constraint-based, and artifact-based mechanisms. Complementing the dataset, we introduce IMM-TSF, a benchmark library for forecasting on irregular multimodal time series, enabling asynchronous integration and realistic evaluation. IMM-TSF includes specialized fusion modules, including a timestamp-to-text fusion module and a multimodality fusion module, which support both recency-aware averaging and attention-based integration strategies. Empirical results demonstrate that explicitly modeling multimodality on irregular time series data leads to substantial gains in forecasting performance. Time-IMM and IMM-TSF provide a foundation for advancing time series analysis under real-world conditions. The dataset is publicly available at https://www.kaggle.com/datasets/blacksnail789521/time-imm/data, and the benchmark library can be accessed at https://anonymous.4open.science/r/IMMTSF_NeurIPS2025.
Related papers
- Robust Group Anomaly Detection for Quasi-Periodic Network Time Series [47.60720976101336]
We propose a framework to identify unusual and interesting time series within a network time series database.<n>We develop a surrogate-based optimization algorithm that can efficiently train the seq2GMM model.
arXiv Detail & Related papers (2025-06-20T08:11:04Z) - Breaking Silos: Adaptive Model Fusion Unlocks Better Time Series Forecasting [64.45587649141842]
Time-series forecasting plays a critical role in many real-world applications.<n>No single model consistently outperforms others across different test samples, but instead (ii) each model excels in specific cases.<n>We introduce TimeFuse, a framework for collective time-series forecasting with sample-level adaptive fusion of heterogeneous models.
arXiv Detail & Related papers (2025-05-24T00:45:07Z) - Multimodal Conditioned Diffusive Time Series Forecasting [16.72476672866356]
We propose a multimodal conditioned diffusion model for time series forecasting (TSF)<n>Timestamps and texts are combined to establish temporal and semantic correlations among different data points.<n>Experiments on real-world benchmark datasets demonstrate that the proposed MCD-TSF model achieves state-of-the-art performance.
arXiv Detail & Related papers (2025-04-28T10:56:23Z) - TimePFN: Effective Multivariate Time Series Forecasting with Synthetic Data [22.458320848520042]
TimePFN is based on the concept of Prior-data Fitted Networks (PFN), which aims to approximate Bayesian inference.<n>We evaluate TimePFN on several benchmark datasets and demonstrate that it outperforms the existing state-of-the-art models for MTS forecasting.
arXiv Detail & Related papers (2025-02-22T16:55:14Z) - Multi-Modal Forecaster: Jointly Predicting Time Series and Textual Data [23.10730301634422]
Current forecasting approaches are largely unimodal and ignore the rich textual data that often accompany the time series.
We develop TimeText Corpus (TTC), a carefully curated, time-aligned text and time dataset for multimodal forecasting.
Our dataset is composed of sequences of numbers and text aligned to timestamps, and includes data from two different domains: climate science and healthcare.
arXiv Detail & Related papers (2024-11-11T06:04:15Z) - Moirai-MoE: Empowering Time Series Foundation Models with Sparse Mixture of Experts [103.725112190618]
This paper introduces Moirai-MoE, using a single input/output projection layer while delegating the modeling of diverse time series patterns to the sparse mixture of experts.
Extensive experiments on 39 datasets demonstrate the superiority of Moirai-MoE over existing foundation models in both in-distribution and zero-shot scenarios.
arXiv Detail & Related papers (2024-10-14T13:01:11Z) - Time-MMD: Multi-Domain Multimodal Dataset for Time Series Analysis [40.44013652777716]
Time-MMD is the first multi-domain, multimodal time series dataset.<n> MM-TSFlib is the first-cut multimodal time-series forecasting library.
arXiv Detail & Related papers (2024-06-12T20:20:09Z) - UniTST: Effectively Modeling Inter-Series and Intra-Series Dependencies for Multivariate Time Series Forecasting [98.12558945781693]
We propose a transformer-based model UniTST containing a unified attention mechanism on the flattened patch tokens.
Although our proposed model employs a simple architecture, it offers compelling performance as shown in our experiments on several datasets for time series forecasting.
arXiv Detail & Related papers (2024-06-07T14:39:28Z) - Time-LLM: Time Series Forecasting by Reprogramming Large Language Models [110.20279343734548]
Time series forecasting holds significant importance in many real-world dynamic systems.
We present Time-LLM, a reprogramming framework to repurpose large language models for time series forecasting.
Time-LLM is a powerful time series learner that outperforms state-of-the-art, specialized forecasting models.
arXiv Detail & Related papers (2023-10-03T01:31:25Z) - Robust Detection of Lead-Lag Relationships in Lagged Multi-Factor Models [61.10851158749843]
Key insights can be obtained by discovering lead-lag relationships inherent in the data.
We develop a clustering-driven methodology for robust detection of lead-lag relationships in lagged multi-factor models.
arXiv Detail & Related papers (2023-05-11T10:30:35Z) - TFAD: A Decomposition Time Series Anomaly Detection Architecture with
Time-Frequency Analysis [12.867257563413972]
Time series anomaly detection is a challenging problem due to the complex temporal dependencies and the limited label data.
We propose a Time-Frequency analysis based time series Anomaly Detection model, or TFAD, to exploit both time and frequency domains for performance improvement.
arXiv Detail & Related papers (2022-10-18T09:08:57Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.