Fugu-MT 論文翻訳(概要): Exploring the Potential of Probabilistic Transformer for Time Series Modeling: A Report on the ST-PT Framework

論文の概要: Exploring the Potential of Probabilistic Transformer for Time Series Modeling: A Report on the ST-PT Framework

arxiv url: http://arxiv.org/abs/2604.26762v1
Date: Wed, 29 Apr 2026 14:57:21 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-30 15:59:36.447732
Title: Exploring the Potential of Probabilistic Transformer for Time Series Modeling: A Report on the ST-PT Framework
Title（参考訳）: 時系列モデリングのための確率変換器の可能性を探る:ST-PTフレームワークの報告
Authors: Zhangzhi Xiong, Haoyi Wu, You Wu, Shuqi Gu, Kan Ren, Kewei Tu,
Abstract要約: Probabilistic Transformer (PT) は、トランスフォーマーの自己注意とフィードフォワードブロックが、条件付きランダムフィールド(CRF)上の平均場変分推論(MFVI)と数学的に等価であることを示す。この等価性の下で、Transformerはブラックボックスニューラルネットワークをやめ、プログラム可能な因子グラフとなる。グラフトポロジ、因子ポテンシャル、メッセージパススケジュールはすべて明示的で検査可能なプリミティブで、設計することができる。
参考スコア（独自算出の注目度）: 45.082922583467116
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: The Probabilistic Transformer (PT) establishes that the Transformer's self-attention plus its feed-forward block is mathematically equivalent to Mean-Field Variational Inference (MFVI) on a Conditional Random Field (CRF). Under this equivalence the Transformer ceases to be a black-box neural network and becomes a programmable factor graph: graph topology, factor potentials, and the message-passing schedule are all explicit and inspectable primitives that can be engineered. PT was originally developed for natural language and in this report we investigate its potential for time series. We first lift PT into the Spatial-Temporal Probabilistic Transformer (ST-PT) to repair PT's missing channel axis and weak per-step semantics, and adopt ST-PT as a shared cornerstone backbone. We then identify three distinct properties that PT/ST-PT offers as a factor-graph model and derive three Research Questions, one per property, that probe how each property can be exploited in time series: RQ1. The graph topology and potentials are direct programmable primitives. Can this be used to inject symbolic time-series priors into ST-PT through structural graph modifications, especially under data scarcity and noise? RQ2. The CRF's factor matrices are the operator's potentials. Can an external condition program these factor matrices on a per-sample basis, so that conditional generation becomes structural rather than feature-level modulation of a fixed one? RQ3. Each MFVI iteration is a Bayesian posterior update on the factor graph. Can this turn the latent transition of latent-space AutoRegressive (AR) forecasting from an opaque MLP into a principled posterior update, and can a CRF teacher distill its latents into the AR student to counter cumulative error? We give one empirical study per question. Together, these three studies position ST-PT as a programmable framework for time-series modeling.
Abstract（参考訳）: Probabilistic Transformer (PT) は、トランスフォーマーの自己注意とフィードフォワードブロックが、条件付きランダムフィールド(CRF)上の平均場変分推論(MFVI)と数学的に等価であることを示す。この等価性の下で、Transformerはブラックボックスニューラルネットワークをやめ、プログラム可能な因子グラフとなる。グラフトポロジ、因子ポテンシャル、メッセージパススケジュールはすべて明示的で検査可能なプリミティブで、設計することができる。 PTはもともと自然言語用に開発されたもので,本報告では時系列化の可能性について検討する。まず,PTを時空間確率変換器(ST-PT)に持ち込み,PTのチャネル軸の欠如とステップごとのセマンティクスの弱さを修復し,ST-PTを共通グランドストーンバックボーンとして採用する。次に、PT/ST-PTが因子グラフモデルとして提供する3つの異なる特性を特定し、各特性をどのように時系列で活用できるかを調査する3つの調査質問を導出する: RQ1。グラフトポロジーとポテンシャルは直接プログラム可能なプリミティブである。これは、特にデータ不足とノイズの下で、構造グラフの修正によって、ST-PTに象徴的な時系列を注入することができるか? RQ2。 CRFの因子行列は作用素のポテンシャルである。外部条件は、これらの因子行列をサンプル単位でプログラムできるので、固定された要素の特徴レベル変調よりも条件生成が構造的になるか? RQ3。各MFVIイテレーションは、因子グラフ上のベイズ的後続更新である。これは、潜伏空間の自己回帰(AR)予測の潜伏遷移を、不透明なMLPから原則的な後続更新に変え、CRF教師が潜伏者をAR学生に蒸留して累積誤差に対処できるか? 私たちは1問に1つの実証的研究をする。これら3つの研究は、ST-PTを時系列モデリングのプログラム可能なフレームワークとして位置づけている。

論文の概要: Exploring the Potential of Probabilistic Transformer for Time Series Modeling: A Report on the ST-PT Framework

関連論文リスト