Fugu-MT 論文翻訳(概要): Integrated and Cross-Architecture Interpretation of LLM Reasoning

論文の概要: Integrated and Cross-Architecture Interpretation of LLM Reasoning

arxiv url: http://arxiv.org/abs/2605.28006v1
Date: Wed, 27 May 2026 05:56:35 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-28 17:38:55.786227
Title: Integrated and Cross-Architecture Interpretation of LLM Reasoning
Title（参考訳）: LLM推論の総合的・横断的解釈
Authors: Leonardo Matthew Yauw, Wei-Bin Kou, Yujiu Yang,
Abstract要約: 統合されたクロスアーキテクチャ推論(IAR)フレームワークは、LLM推論の解釈可能性に対する統一的なアプローチを提供するように設計されている。まず、チューキーIQRピーク検出と組み合わされた帯域幅校正MIPを用いて、出力層における推論・クラシカルトークンを分離することを提案する。次に、MIP-pickedトークンと計算深度トークンの重なり解析を行い、それらのトークンの層間軌跡をトレースする。
参考スコア（独自算出の注目度）: 48.58940522466915
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Understanding how LLMs reason is hindered by a practical asymmetry: while their generated outputs are observable, the underlying reasoning patterns remain opaque. Relying on single probes, such as Mutual Information Peak (MIP) or Deep-Thinking Ratio (DTR), risks underestimating the genuine inferential structure. To response this deficiency, we present an Integrated, cross-Architecture Reasoning (IAR) framework, designed to provide a unified approach to LLM reasoning interpretability. Specifically, we first propose to use bandwidth-calibrated MIP coupled with Tukey IQR peak-detection to isolate reasoning-crucial tokens at the output layer. Second, we performed an overlap analysis between MIP-picked tokens and DTR-deep tokens to trace the cross-layer trajectories of those tokens. This also discloses whether reasoning-crucial tokens are computation-intensive as well, further facilitating to understand how reasoning patterns evolve across model layers. Finally, we apply a Jaccard stability metric over multi-domain problems to verify if the MIP-identified tokens are reasoning quality-guaranteed. Extensive experiments on three models (Qwen-7B, Qwen-14B, and Llama-8B) across four domains (mathematics, code, logic, and common sense) demonstrate IAR's generalizable interpretation capabilities across architectures.
Abstract（参考訳）: LLMの推論が実際的な非対称性によってどのように妨げられるかを理解する: 生成された出力は観測可能であるが、基礎となる推論パターンは不透明である。 MIP(Mutual Information Peak)やDTR(Deep-Thinking Ratio)のような単一のプローブを利用すると、真の推論構造を過小評価するリスクがある。この欠陥に対応するために,我々は,LLM推論の解釈可能性に対する統一的なアプローチを提供するために,統合的・クロスアーキテクチャ推論(IAR)フレームワークを提案する。具体的には、まず帯域幅校正MIPとTukey IQRのピーク検出を併用して、出力層における推論・クラシカルトークンを分離することを提案する。次に,MIP-pickedトークンとDTR-deepトークンの重なり解析を行い,これらのトークンの層間軌跡の追跡を行った。これはまた、推論-クラシカルトークンが計算集約的であるかどうかを明らかにし、モデル層間での推論パターンの進化をより容易にする。最後に,マルチドメイン問題に対してJaccardの安定性基準を適用し,MIP識別トークンが品質保証の理由であるかどうかを検証する。 Qwen-7B、Qwen-14B、Llama-8Bの4つの領域(数学、コード、論理、常識)にわたる3つのモデルの大規模な実験は、IARのアーキテクチャ全体にわたる一般化可能な解釈能力を示している。

論文の概要: Integrated and Cross-Architecture Interpretation of LLM Reasoning

関連論文リスト