Fugu-MT 論文翻訳(概要): AgentSHAP: Interpreting LLM Agent Tool Importance with Monte Carlo Shapley Value Estimation

論文の概要: AgentSHAP: Interpreting LLM Agent Tool Importance with Monte Carlo Shapley Value Estimation

arxiv url: http://arxiv.org/abs/2512.12597v1
Date: Sun, 14 Dec 2025 08:31:43 GMT
ステータス: 翻訳完了
システム内更新日: 2025-12-16 17:54:56.337737
Title: AgentSHAP: Interpreting LLM Agent Tool Importance with Monte Carlo Shapley Value Estimation
Title（参考訳）: AgentSHAP:モンテカルロシェープ値推定によるLLMエージェントツールの重要性の解釈
Authors: Miriam Horovicz,
Abstract要約: 本稿では,LDMエージェントにおけるツールの重要性を説明するための最初のフレームワークであるAgentSHAPを紹介する。エージェントをブラックボックスとして扱い、内部重量や勾配へのアクセスを必要とせずに、任意のLCMで動作する。筆者らの貢献は,(1)ゲーム理論からシェープリー値に基づくエージェントツール属性の最初の説明可能性手法,(2)O(2n)から実用レベルまでコストを下げるモンテカルロサンプリング,(3)API-Bankに関する総合的な実験である。
参考スコア（独自算出の注目度）: 0.0
License: http://creativecommons.org/licenses/by/4.0/
Abstract: LLM agents that use external tools can solve complex tasks, but understanding which tools actually contributed to a response remains a blind spot. No existing XAI methods address tool-level explanations. We introduce AgentSHAP, the first framework for explaining tool importance in LLM agents. AgentSHAP is model-agnostic: it treats the agent as a black box and works with any LLM (GPT, Claude, Llama, etc.) without needing access to internal weights or gradients. Using Monte Carlo Shapley values, AgentSHAP tests how an agent responds with different tool subsets and computes fair importance scores based on game theory. Our contributions are: (1) the first explainability method for agent tool attribution, grounded in Shapley values from game theory; (2) Monte Carlo sampling that reduces cost from O(2n) to practical levels; and (3) comprehensive experiments on API-Bank showing that AgentSHAP produces consistent scores across runs, correctly identifies which tools matter, and distinguishes relevant from irrelevant tools. AgentSHAP joins TokenSHAP (for tokens) and PixelSHAP (for image regions) to complete a family of Shapley-based XAI tools for modern generative AI. Code: https://github.com/GenAISHAP/TokenSHAP.
Abstract（参考訳）: 外部ツールを使用するLLMエージェントは複雑なタスクを解くことができるが、どのツールが応答に実際に寄与しているかを理解することは盲点である。ツールレベルの説明に既存のXAIメソッドは対応していません。本稿では,LDMエージェントにおけるツールの重要性を説明するための最初のフレームワークであるAgentSHAPを紹介する。エージェントSHAPはモデルに依存しない: エージェントをブラックボックスとして扱い、内部重量や勾配へのアクセスを必要とせず、いかなるLCM(GPT、Claude、Llamaなど)とも機能する。エージェントSHAPはMonte Carlo Shapleyの値を使用して、エージェントが異なるツールサブセットでどのように反応するかをテストし、ゲーム理論に基づいた公正な重要性スコアを計算する。提案手法は,(1)ゲーム理論からシェープリー値に基づくエージェントツール属性の最初の説明可能性手法,(2)O(2n)から実用レベルまでコストを下げるモンテカルロサンプリング,(3)AgentSHAPが実行中に一貫したスコアを生成し,どのツールが重要かを正しく識別し,無関係ツールと区別することを示すAPI-Bankに関する総合的な実験である。 AgentSHAPはトークン用)TokenSHAPとPixelSHAP(画像領域用)に加わり、現代の生成AIのためのShapleyベースのXAIツールのファミリーを完成させる。コード:https://github.com/GenAISHAP/TokenSHAP

論文の概要: AgentSHAP: Interpreting LLM Agent Tool Importance with Monte Carlo Shapley Value Estimation

関連論文リスト