Fugu-MT 論文翻訳(概要): Informative Post-Hoc Explanations Only Exist for Simple Functions

論文の概要: Informative Post-Hoc Explanations Only Exist for Simple Functions

arxiv url: http://arxiv.org/abs/2508.11441v1
Date: Fri, 15 Aug 2025 12:46:18 GMT
ステータス: 翻訳完了
システム内更新日: 2025-08-18 14:51:23.952094
Title: Informative Post-Hoc Explanations Only Exist for Simple Functions
Title（参考訳）: 単純な関数にのみ存在するインフォームティブなポストホック説明
Authors: Eric Günther, Balázs Szabados, Robi Bhattacharjee, Sebastian Bordt, Ulrike von Luxburg,
Abstract要約: 本稿では、意思決定機能に関する情報を提供するための説明のための、一般的な学習理論に基づくフレームワークを紹介する。複雑な決定関数に適用した場合,多くの一般的な説明アルゴリズムは有益ではないことを示す。我々は、これらのアルゴリズムの実用性、特に監査、規制、AIのリスクの高い応用に強く影響していると論じている。
参考スコア（独自算出の注目度）: 12.017822772474576
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Many researchers have suggested that local post-hoc explanation algorithms can be used to gain insights into the behavior of complex machine learning models. However, theoretical guarantees about such algorithms only exist for simple decision functions, and it is unclear whether and under which assumptions similar results might exist for complex models. In this paper, we introduce a general, learning-theory-based framework for what it means for an explanation to provide information about a decision function. We call an explanation informative if it serves to reduce the complexity of the space of plausible decision functions. With this approach, we show that many popular explanation algorithms are not informative when applied to complex decision functions, providing a rigorous mathematical rejection of the idea that it should be possible to explain any model. We then derive conditions under which different explanation algorithms become informative. These are often stronger than what one might expect. For example, gradient explanations and counterfactual explanations are non-informative with respect to the space of differentiable functions, and SHAP and anchor explanations are not informative with respect to the space of decision trees. Based on these results, we discuss how explanation algorithms can be modified to become informative. While the proposed analysis of explanation algorithms is mathematical, we argue that it holds strong implications for the practical applicability of these algorithms, particularly for auditing, regulation, and high-risk applications of AI.
Abstract（参考訳）: 多くの研究者は、複雑な機械学習モデルの振る舞いに関する洞察を得るために、局所的なポストホックな説明アルゴリズムを使うことができることを示唆している。しかし、そのようなアルゴリズムに関する理論的保証は単純な決定関数に対してのみ存在し、複雑なモデルに対して同様の仮定結果が存在するかどうかは不明である。本稿では、意思決定関数に関する情報を提供するための説明のための、一般的な学習理論に基づくフレームワークを紹介する。妥当な決定関数の空間の複雑さを減らすのに役立つと説明する。このアプローチにより、複雑な決定関数に適用した場合、多くの一般的な説明アルゴリズムは、どんなモデルでも説明できるという考え方を厳密な数学的に拒絶することを示す。次に、異なる説明アルゴリズムが情報となる条件を導出する。これらはしばしば予想よりも強い。例えば、勾配説明や反事実説明は微分可能関数の空間に関して非形式的であり、SHAPとアンカー説明は決定木の空間に関して情報的ではない。これらの結果に基づいて、説明アルゴリズムを情報化するためにどのように修正するかについて議論する。提案する説明アルゴリズムの解析は数学的であるが,特に監査,規制,リスクの高いAI応用において,これらのアルゴリズムの実践的適用性に強い意味があることを論じる。

論文の概要: Informative Post-Hoc Explanations Only Exist for Simple Functions

関連論文リスト