Fugu-MT 論文翻訳(概要): Rethinking the Role of Positional Encoding: Sliding-Window Transformers without PE Remain Turing Complete

論文の概要: Rethinking the Role of Positional Encoding: Sliding-Window Transformers without PE Remain Turing Complete

arxiv url: http://arxiv.org/abs/2606.01532v1
Date: Mon, 01 Jun 2026 01:28:42 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-02 21:34:29.771981
Title: Rethinking the Role of Positional Encoding: Sliding-Window Transformers without PE Remain Turing Complete
Title（参考訳）: 位置符号化の役割を再考する: PE残調を伴わないスライディング・ウィンドウ変換器
Authors: Qian Li, Xinyu Mao, Shang-Hua Teng,
Abstract要約: 位置符号化(PE)は、整列処理に必要な変換器として広く見なされている。この直観は、任意の普遍性を実現することができることを証明するために位置情報に依存する全ての先行結果の根底にある。我々は、この信念を、有限なスライディングコンテキストウインドウを通して生成が進行する、長期的推論に最も関係のある体制に再考する。
参考スコア（独自算出の注目度）: 13.001718919406164
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Positional encoding (PE) is widely viewed as necessary for transformers to process ordered sequences: without them, the next-token map appears permutation-invariant in its context tokens. This intuition underlies all prior universality results, which rely on positional information to prove that transformers with chain-of-thought can perform arbitrary computation, i.e., they are Turing complete. We revisit this belief in the regime most relevant to long-form reasoning, where generation proceeds through a finite sliding context window. Our opening perception is that the window mechanism itself (mildly) breaks the permutation symmetry. To distill and precisely capture the degree of this added expressiveness, we introduce an abstract autoregressive model, the HIST model, in which each update depends only on constant-size internal state and the token-count histogram within the current window. We prove that this HIST model is Turing complete by showing that the evolution of the window can reveal the token that has just left the window, which suffices to simulate Turing-complete Post machines. We then construct a sliding-window transformer over a constant-size token alphabet, without PE, and show that it can simulate the HIST model. Our result demonstrates that positional encodings are not indispensable for transformers to perform universal computation: The window sliding itself already breaks permutation symmetry and captures sufficient positional information.
Abstract（参考訳）: 位置符号化(PE)は、トランスフォーマーが順序付けられたシーケンスを処理するために必要であると考えられており、それらなしでは、次のトーケン写像はそのコンテキストトークンに置換不変である。この直観は任意の計算、すなわちチューリング完全であることを示すために位置情報に依存する全ての先行普遍性の結果の基盤となる。我々は、この信念を、有限なスライディングコンテキストウインドウを通して生成が進行する、長期的推論に最も関係のある体制に再考する。我々のオープニング・インセプションは、ウィンドウ機構自体が(わずかに)置換対称性を破るということです。この付加表現性の度合いを抽出し,正確に把握するために,各更新は,現在のウィンドウ内の一定サイズの内部状態とトークン数ヒストグラムにのみ依存する抽象自己回帰モデル,HISTモデルを導入する。我々は、このHISTモデルがチューリング完全であることを証明し、ウィンドウの進化によってちょうどウィンドウを離れたトークンが明らかになり、それがチューリング完全Postマシンをシミュレートするのに十分であることを示す。次に,PEを使わずに,一定サイズのトークンアルファベット上にスライディングウィンドウ変換器を構築し,HISTモデルをシミュレート可能であることを示す。ウィンドウスライディング自体は、置換対称性を破り、十分な位置情報をキャプチャする。

論文の概要: Rethinking the Role of Positional Encoding: Sliding-Window Transformers without PE Remain Turing Complete

関連論文リスト