Fugu-MT 論文翻訳(概要): Lost in Tokenization: Fundamental Trade-offs in Graph Tokenization for Transformers

論文の概要: Lost in Tokenization: Fundamental Trade-offs in Graph Tokenization for Transformers

arxiv url: http://arxiv.org/abs/2605.22471v1
Date: Thu, 21 May 2026 13:32:20 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-22 16:35:42.275275
Title: Lost in Tokenization: Fundamental Trade-offs in Graph Tokenization for Transformers
Title（参考訳）: トークン化の損失:変換器のグラフトークン化における基本的なトレードオフ
Authors: Maya Bechler-Speicher, Gilad Yehudai, Gil Harari, Clayton Sanford, Amir Globerson, Joan Bruna,
Abstract要約: グラフ・ツー・トケン写像の選択は変換器の表現性の基本成分であることを示す。既存の多くのグラフトークン化のためのビルディングブロックとして機能する3つのトークン化(スペクトル、ランダムウォーク、隣接トークン化)について検討する。
参考スコア（独自算出の注目度）: 50.98108117044413
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Transformers have become a central architecture for graph learning, but their application to graphs requires first choosing a tokenization: a graph-to-token map that determines which structural information is exposed at the input. In this work, we show that this choice is a fundamental component of transformer expressivity. We examine three tokenizations that serve as building blocks for many existing graph tokenizations: spectral, random-walk, and adjacency tokenizations. We prove that different tokenizations induce distinct depth regimes: the same graph computation may be realizable by a shallow transformer under one tokenization, while requiring substantially larger depth under another. For example, we prove that random-walk tokenization is lossy for any walk length, making it impossible in general to recover the graph from it, and that while spectral tokenization is lossless, it is ill-conditioned for local tasks. We further show that although both random-walk and spectral tokenizations are derived from adjacency information, it is impossible for a limited-depth transformer to convert between tokenization families in general. In particular, we establish lower bounds and impossibility results showing that unfavorable tokenizations may preclude the efficient recovery of more suitable structural representations. Finally, we complement our theory with controlled experiments on synthetic and real-world tasks, validating the predicted separations and showing that different tasks favor different structural views, and combining complementary tokenizations allows the transformer to leverage distinct signals from each representation.
Abstract（参考訳）: トランスフォーマーは、グラフ学習の中心的なアーキテクチャとなっているが、そのグラフへの応用には、最初にトークン化を選択する必要がある。本研究では,この選択が変圧器表現性の基本成分であることを示す。既存の多くのグラフトークン化のためのビルディングブロックとして機能する3つのトークン化(スペクトル、ランダムウォーク、隣接トークン化)について検討する。同じグラフ計算は、1つのトークン化の下で浅い変圧器によって実現可能であり、他方ではより大きな深さを必要とする。例えば、ランダムウォークトークン化は任意のウォーク長に対して損失であり、一般的にグラフを復元することは不可能であり、スペクトルトークン化は損失のないが、局所的なタスクに対しては不条件であることを示す。さらに、ランダムウォークとスペクトルトークン化の両方が隣接情報から導出されるが、限定深度変換器では一般にトークン化ファミリ間の変換は不可能であることを示す。特に、不利なトークン化がより適切な構造表現の効率的な回復を妨げていることを示す。最後に、我々の理論を合成的および実世界のタスクに関する制御された実験で補完し、予測された分離を検証し、異なるタスクが異なる構造的視点を優先することを示し、補完的なトークン化を組み合わせることで、変換器は各表現から異なる信号を利用することができる。

論文の概要: Lost in Tokenization: Fundamental Trade-offs in Graph Tokenization for Transformers

関連論文リスト