Fugu-MT 論文翻訳(概要): SSDA: Bridging Spectral and Structural Gaps via Dual Adaptation for Vision-Based Time Series Forecasting

論文の概要: SSDA: Bridging Spectral and Structural Gaps via Dual Adaptation for Vision-Based Time Series Forecasting

arxiv url: http://arxiv.org/abs/2605.12550v1
Date: Sun, 10 May 2026 07:17:08 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-14 23:30:27.569921
Title: SSDA: Bridging Spectral and Structural Gaps via Dual Adaptation for Vision-Based Time Series Forecasting
Title（参考訳）: SSDA:ビジョンベース時系列予測のためのデュアル適応によるスペクトルと構造ギャップのブリッジ化
Authors: Mingrui Zhang, Hanchen Yang, Wengen Li, Xudong Jiang, Yichao Zhang, Jihong Guan, Shuigeng Zhou,
Abstract要約: レンダリングされた時系列画像は、LVMが認識するために事前訓練されている自然な画像よりも、非常に浅いパワースペクトルを示すことを示す。時系列予測のためのLVMのポテンシャルを解放するために、スペクトル的かつ構造的に適応するデュアルブランチネットワークであるSSDAを提案する。
参考スコア（独自算出の注目度）: 39.55585786455421
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Large vision models (LVMs) have recently proven to be surprisingly effective time series forecasters, simply by rendering temporal data as images. This success, how ever, rests on a largely unexamined premise: the rendered time series images are sufficiently close to natural images for knowledge in pre-trained models to transfer effectively. We argue that two gaps still remain, i.e., spectral and structural gaps, fundamentally limiting the potential of LVMs for time series forecasting. Spectrally, we systematically reveal that rendered time series images exhibit a markedly shallower power spectrum than the natural images LVMs are pre-trained to recognize. Structurally, reshaping 1D temporal sequences into 2D grids fabricates spurious spatial adjacencies while severing genuine temporal continuities, misleading the spatial inductive biases of pre-trained LVMs. To bridge these gaps, we propose SSDA, a dual-branch network that spectrally and structurally adapts to unlock the full potential of LVMs for time series forecasting. At the data level, a Spectral Magnitude Aligner (SMA) applies 2D FFT to selectively enhance the magnitude spectrum toward natural-image statistics while preserving phase. At the model level, a Structural-Guided Low-Rank Adaptation (SG-LoRA) injects position-aware temporal encodings into patch embeddings and adapts at tention via low-rank updates. The two branches are further adaptively fused to produce the final forecast. Extensive experiments on seven real-world benchmarks demonstrate that SSDA consistently outperforms strong LVM- and LLM-based baselines under both full-shot and few-shot settings. Code is publicly available at https://anonymous.4open.science/r/SSDA-8C5B.
Abstract（参考訳）: 大規模ビジョンモデル(LVM)は、単に時間データを画像としてレンダリングすることで、驚くほど効果的な時系列予測器であることが最近証明された。レンダリングされた時系列画像は、訓練済みのモデルにおいて、効果的に転送するための知識を得るために、自然な画像に十分近い。我々は、2つのギャップ、すなわちスペクトルと構造的ギャップが残っており、時系列予測のためのLVMのポテンシャルを根本的に制限していると主張している。分光学的には、レンダリングされた時系列画像は、LVMが認識するために事前訓練されている自然な画像よりも、非常に浅いパワースペクトルを示す。構造的に、1次元時間列を2次元グリッドに変換することで、真の時間的連続性を保ちながら、空間的隣接性を生じさせ、事前学習されたLVMの空間的帰納バイアスを誤解させる。これらのギャップを埋めるために、時系列予測のためのLVMのポテンシャルを解放するためにスペクトル的かつ構造的に適応するデュアルブランチネットワークであるSSDAを提案する。データレベルでは、SMA(Spectral Magnitude Aligner)が2D FFTを適用して、位相を保ちながら自然画像統計に対する大きさスペクトルを選択的に拡張する。モデルレベルでは、Structure-Guided Low-Rank Adaptation (SG-LoRA)は、位置認識の時間エンコーディングをパッチ埋め込みに注入し、低ランク更新を通じて保持時に適応する。 2つの枝はさらに適応的に融合して最終予測を生成する。 7つの実世界のベンチマークに関する大規模な実験では、SSDAはフルショットと少数ショットの両方で強力なLVMとLLMベースのベースラインを一貫して上回っている。コードはhttps://anonymous.4open.science/r/SSDA-8C5Bで公開されている。

論文の概要: SSDA: Bridging Spectral and Structural Gaps via Dual Adaptation for Vision-Based Time Series Forecasting

関連論文リスト