Fugu-MT 論文翻訳(概要): Visual Graph Scaffolds for Structural Reasoning in Large Language Models

論文の概要: Visual Graph Scaffolds for Structural Reasoning in Large Language Models

arxiv url: http://arxiv.org/abs/2606.02673v1
Date: Mon, 01 Jun 2026 12:17:52 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-03 22:00:04.508815
Title: Visual Graph Scaffolds for Structural Reasoning in Large Language Models
Title（参考訳）: 大規模言語モデルにおける構造推論のためのビジュアルグラフスカッホールド
Authors: Runlin Lei, Xiaokui Xiao, Zhewei Wei,
Abstract要約: グラフは、大きな言語モデル(LLM)を強化するために使われてきた。本稿では, LLM のグラフの値は情報提供だけでなく, 組織的推論にも当てはまる。人間はグラフ構造化マインドマップを使って、枝分かれや収束する思考を整理し、グラフが推論支援の内的形態として機能するかどうかを問う。
参考スコア（独自算出の注目度）: 49.507575509154385
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: Graphs have been used to enhance large language models (LLMs) for structured reasoning, mostly as external knowledge sources are provided to models at test time. In this paper, we take a different view: the value of graphs for LLMs lie not only in supplying information, but also in organizing reasoning. Inspired by how humans use graph-structured mind maps to organize branching and converging thoughts, we ask whether graphs can serve as an internal form of reasoning assistance. We study this question on multi-hop question answering tasks, where teacher-provided reasoning traces are rewritten as graph mind maps and used to guide a student model. Our experiments reveal a clear modality gap. When graph structures are flattened into text, their benefits become limited once direct answer hints are removed. Under this abstract guidance setting, both reasoning efficiency and answer quality degrade substantially. In contrast, visual graph guidance remains effective without direct answer clues, and its advantage persists after supervised fine-tuning and KL-based distillation. The above findings support the claim that graphs should be studied not only as external knowledge structures for LLMs, but also as visual scaffolds for organizing reasoning.
Abstract（参考訳）: グラフは構造化推論のための大きな言語モデル(LLM)を強化するために使われてきた。本稿では, LLM のグラフの値は情報提供だけでなく, 組織的推論にも当てはまる。人間はグラフ構造化マインドマップを使って、枝分かれや収束する思考を整理し、グラフが推論支援の内的形態として機能するかどうかを問う。本研究では,教師が提供する推論トレースをグラフマインドマップとして書き直し,学生モデル案内に用いるマルチホップ質問応答タスクについて検討する。私たちの実験は明らかなモダリティギャップを明らかにします。グラフ構造がテキストにフラット化されると、直接回答ヒントが取り除かれると、そのメリットは制限される。この抽象的なガイダンス設定の下では、推論効率と回答品質の両方が大幅に低下する。対照的に、視覚グラフ誘導は直接答えの手がかりなしに有効であり、その優位性は教師付き微調整とKLベースの蒸留によって持続する。以上の結果は、グラフはLLMの外部知識構造としてだけでなく、推論を整理するための視覚的な足場として研究されるべきである、という主張を支持している。

論文の概要: Visual Graph Scaffolds for Structural Reasoning in Large Language Models

関連論文リスト