Fugu-MT 論文翻訳(概要): How Transparent is DiffusionGemma?

論文の概要: How Transparent is DiffusionGemma?

arxiv url: http://arxiv.org/abs/2606.20560v1
Date: Thu, 18 Jun 2026 17:59:46 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-19 18:23:40.055521
Title: How Transparent is DiffusionGemma?
Title（参考訳）: DiffusionGemmaはいかに透明か?
Authors: Joshua Engels, Callum McDougall, Bilal Chughtai, Janos Kramar, Senthoran Rajamanoharan, Cindy Wu, Arthur Conmy, Asic Q Chen, Jean Tarbouriech, Min Ma, Brendan O'Donoghue, João Gabriel Lopes de Oliveira, Rohin Shah, Neel Nanda,
Abstract要約: LLM推論の透明性は、モデル決定を理解する上で重要な能力である。また, DiffusionGemmaは, 対応する自己回帰的Gemma 4モデルよりも, 連続的な潜時空間で計算のかなりの部分を実行することを示した。中間状態を解釈可能なものとして扱うと、不透明な直列深さはGemma 4の1.1倍になる。
参考スコア（独自算出の注目度）: 23.924817257547502
License: http://creativecommons.org/licenses/by/4.0/
Abstract: LLM reasoning transparency is a critical affordance for understanding model decisions, mitigating misuse and misalignment, and debugging surprising model behaviors. However, DiffusionGemma performs a larger fraction of its computation in a continuous latent space; does this make its reasoning less transparent? We study this question by decomposing transparency into two components: variable transparency, whether we understand intermediate snapshots of a model's computational state; and algorithmic transparency, whether we can use these snapshots to reconstruct the process by which the model arrived at its outputs. Naively, DiffusionGemma has poor variable transparency: its opaque serial depth, the amount of serial computation that occurs in between interpretable model states, seems at first 28.6X higher than the corresponding autoregressive Gemma 4 model. However, we show that we can map the information flowing between denoising steps through an interpretable token bottleneck with no decrease in downstream performance. Treating these intermediate states as interpretable reduces the opaque serial depth to just 1.1X that of Gemma 4. Algorithmic transparency is harder for diffusion models than for autoregressive models because all token predictions in the canvas can change at every denoising step, giving the model the power to implement complicated distributed algorithms during the denoising process. To begin bridging this gap, we conduct a suite of interpretability case studies, uncovering initial evidence of novel diffusion-specific phenomena such as non-chronological reasoning, token and sequence smearing, and intermediate-context reasoning. Finally, we test monitorability, a key application of transparency that measures whether model outputs are useful for downstream tasks. We find that DiffusionGemma is similarly monitorable to Gemma 4.
Abstract（参考訳）: LLM推論の透明性は、モデル決定を理解し、誤用と修正を緩和し、驚くべきモデルの振る舞いをデバッグする上で、重要な機会である。しかし、DiffusionGemmaはその計算の大部分を連続的な潜在空間で実行します。本稿では,この問題を,変数透過性,モデルの計算状態の中間スナップショットを理解するか否か,モデルが出力に到達したプロセスの再構築にこれらのスナップショットを使用できるか,という2つのコンポーネントに分解して検討する。 DiffusionGemmaは、不透明なシリアル深度、解釈可能なモデル状態間で発生するシリアル計算の量は、対応する自己回帰的Gemma 4モデルよりもまず28.6倍高いように思われる。しかし、ダウンストリーム性能を低下させることなく、解釈可能なトークンボトルネックを通じて、デノベーションステップ間で流れる情報をマッピングできることが示される。これらの中間状態を解釈可能なものとして扱うと、不透明な直列深さはGemma 4の1.1倍に減少する。アルゴリズムの透明性は、自動回帰モデルよりも拡散モデルにとって難しい。なぜなら、キャンバス内の全てのトークン予測は、デノナイジングステップ毎に変化し、デノナイジングプロセス中に複雑な分散アルゴリズムを実装する力を与えるからだ。このギャップを埋めるために、我々は、非時間的推論、トークンとシーケンススミアリング、中間コンテキスト推論などの新しい拡散特異的現象の初期の証拠を明らかにする、解釈可能性に関する一連のケーススタディを実施している。最後に、ダウンストリームタスクにモデル出力が有用かどうかを測定する透過性の重要な応用である監視可能性をテストする。 DiffusionGemma も Gemma 4 と同様に監視可能である。

論文の概要: How Transparent is DiffusionGemma?

関連論文リスト