Fugu-MT 論文翻訳(概要): Effective Dataset Distillation for Spatio-Temporal Forecasting with Bi-dimensional Compression

論文の概要: Effective Dataset Distillation for Spatio-Temporal Forecasting with Bi-dimensional Compression

arxiv url: http://arxiv.org/abs/2603.10410v1
Date: Wed, 11 Mar 2026 04:48:12 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-12 16:22:32.783996
Title: Effective Dataset Distillation for Spatio-Temporal Forecasting with Bi-dimensional Compression
Title（参考訳）: 2次元圧縮による時空間予測のための効果的なデータセット蒸留法
Authors: Taehyung Kwon, Yeonje Choi, Yeongho Kim, Kijung Shin,
Abstract要約: 本稿では,S時間時系列予測のための最初のデータセット蒸留法STemDistを提案する。我々のソリューションのキーとなる考え方は、時間と空間の両次元をバランスよく圧縮し、トレーニング時間と記憶時間を短縮することである。
参考スコア（独自算出の注目度）: 26.189594254326334
License: http://creativecommons.org/licenses/by-sa/4.0/
Abstract: Spatio-temporal time series are widely used in real-world applications, including traffic prediction and weather forecasting. They are sequences of observations over extensive periods and multiple locations, naturally represented as multidimensional data. Forecasting is a central task in spatio-temporal analysis, and numerous deep learning methods have been developed to address it. However, as dataset sizes and model complexities continue to grow in practice, training deep learning models has become increasingly time- and resource-intensive. A promising solution to this challenge is dataset distillation, which synthesizes compact datasets that can effectively replace the original data for model training. Although successful in various domains, including time series analysis, existing dataset distillation methods compress only one dimension, making them less suitable for spatio-temporal datasets, where both spatial and temporal dimensions jointly contribute to the large data volume. To address this limitation, we propose STemDist, the first dataset distillation method specialized for spatio-temporal time series forecasting. A key idea of our solution is to compress both temporal and spatial dimensions in a balanced manner, reducing training time and memory. We further reduce the distillation cost by performing distillation at the cluster level rather than the individual location level, and we complement this coarse-grained approach with a subset-based granular distillation technique that enhances forecasting performance. On five real-world datasets, we show empirically that, compared to both general and time-series dataset distillation methods, datasets distilled by our STemDist method enable model training (1) faster (up to 6X) (2) more memory-efficient (up to 8X), and (3) more effective (with up to 12% lower prediction error).
Abstract（参考訳）: 時空間の時系列は、交通予報や天気予報など、現実世界のアプリケーションで広く使われている。これらは、多次元データとして自然に表される、広範囲の周期と複数の位置にわたる観測のシーケンスである。予測は時空間分析において中心的な課題であり、それに対応するために多くのディープラーニング手法が開発されている。しかし、データセットのサイズやモデル複雑度が実際に増加し続けるにつれ、ディープラーニングモデルのトレーニングは時間とリソースの集約化が進んでいる。この課題に対する有望な解決策はデータセットの蒸留であり、モデルトレーニングのために元のデータを効果的に置き換えることのできるコンパクトなデータセットを合成する。時系列分析を含む様々な領域で成功したが、既存のデータセット蒸留法は1次元のみを圧縮し、空間次元と時間次元の両方が大きなデータボリュームに共寄与する時空間データセットには適さない。この制限に対処するため,時空間時系列予測に特化した最初のデータセット蒸留法であるSTemDistを提案する。我々のソリューションのキーとなる考え方は、時間と空間の両次元をバランスよく圧縮し、トレーニング時間と記憶時間を短縮することである。我々は, 個々の位置ではなく, クラスターレベルで蒸留を行うことにより蒸留コストをさらに削減し, この粗粒化手法を, 予測性能を高めるサブセットベースグラニュラー蒸留技術で補完する。実世界の5つのデータセットにおいて、一般および時系列のデータセット蒸留法と比較して、STemDist法で蒸留したデータセットは、(1)より高速(最大6倍)、(2)よりメモリ効率が良く(最大8倍)、(3)より効果的(最大12%低い予測誤差)であることが実証的に示されている。

論文の概要: Effective Dataset Distillation for Spatio-Temporal Forecasting with Bi-dimensional Compression

関連論文リスト