Fugu-MT 論文翻訳(概要): <SOG_k>: One LLM Token for Explicit Graph Structural Understanding

論文の概要: : One LLM Token for Explicit Graph Structural Understanding

arxiv url: http://arxiv.org/abs/2602.01771v1
Date: Mon, 02 Feb 2026 07:55:09 GMT
ステータス: 翻訳完了
システム内更新日: 2026-02-03 19:28:33.993965
Title: <SOG_k>: One LLM Token for Explicit Graph Structural Understanding
Title（参考訳）: <SOG_k>:グラフ構造を明示的に理解するための1つのLLMトークン
Authors: Jingyao Wu, Bin Lu, Zijun Di, Xiaoying Gan, Meng Jin, Luoyi Fu, Xinbing Wang, Chenghu Zhou,
Abstract要約: 我々は、グラフの構造を統一トークン空間内に完全に表現するために、1つの特別なトークン SOG_k> を組み込むことを提案する。 SOG_k>は、簡潔で正確な方法でLLMに理解、生成、理性を与える。
参考スコア（独自算出の注目度）: 57.017902343605364
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Large language models show great potential in unstructured data understanding, but still face significant challenges with graphs due to their structural hallucination. Existing approaches mainly either verbalize graphs into natural language, which leads to excessive token consumption and scattered attention, or transform graphs into trainable continuous embeddings (i.e., soft prompt), but exhibit severe misalignment with original text tokens. To solve this problem, we propose to incorporate one special token <SOG_k> to fully represent the Structure Of Graph within a unified token space, facilitating explicit topology input and structural information sharing. Specifically, we propose a topology-aware structural tokenizer that maps each graph topology into a highly selective single token. Afterwards, we construct a set of hybrid structure Question-Answering corpora to align new structural tokens with existing text tokens. With this approach, <SOG_k> empowers LLMs to understand, generate, and reason in a concise and accurate manner. Extensive experiments on five graph-level benchmarks demonstrate the superiority of our method, achieving a performance improvement of 9.9% to 41.4% compared to the baselines while exhibiting interpretability and consistency. Furthermore, our method provides a flexible extension to node-level tasks, enabling both global and local structural understanding. The codebase is publicly available at https://github.com/Jingyao-Wu/SOG.
Abstract（参考訳）: 大規模言語モデルは、構造化されていないデータ理解において大きな可能性を秘めているが、その構造的幻覚のため、グラフでは依然として重大な課題に直面している。既存のアプローチは、グラフを自然言語に動詞化することで、過剰なトークン消費と散在する注意を惹きつけるか、あるいはグラフをトレーニング可能な連続的な埋め込み(ソフトプロンプト)に変換するかのいずれかであるが、元のテキストトークンと深刻なミスアライメントを示す。この問題を解決するために,一種類の特別なトークン<SOG_k>を導入して,グラフの構造を統一されたトークン空間内に完全に表現し,明示的なトポロジ入力と構造情報共有を容易にすることを提案する。具体的には、各グラフトポロジを高度に選択的な単一トークンにマッピングするトポロジ対応構造トークン化器を提案する。その後、新しい構造トークンを既存のテキストトークンと整合させるために、ハイブリッド構造問合せコーパスを構築した。このアプローチにより、<SOG_k>はLLMに簡潔で正確な方法で理解、生成、理性を与える。 5つのグラフレベルのベンチマークによる大規模な実験により,解析性と整合性を示しながら,ベースラインと比較して9.9%から41.4%の性能向上を実現した。さらに,本手法はノードレベルのタスクを柔軟に拡張し,大域的および局所的な構造的理解を可能にする。コードベースはhttps://github.com/Jingyao-Wu/SOGで公開されている。

論文の概要: : One LLM Token for Explicit Graph Structural Understanding

関連論文リスト