Fugu-MT 論文翻訳(概要): Fairness of Explanations in Artificial Intelligence (AI): A Unifying Framework, Axioms, and Future Direction toward Responsible AI

論文の概要: Fairness of Explanations in Artificial Intelligence (AI): A Unifying Framework, Axioms, and Future Direction toward Responsible AI

arxiv url: http://arxiv.org/abs/2605.09852v1
Date: Mon, 11 May 2026 01:09:06 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-12 23:28:50.452184
Title: Fairness of Explanations in Artificial Intelligence (AI): A Unifying Framework, Axioms, and Future Direction toward Responsible AI
Title（参考訳）: 人工知能(AI)における説明の公正性 : 統一フレームワーク,公理,そして責任あるAIへの今後の方向性
Authors: Gideon Popoola, John Sheppard,
Abstract要約: 機械学習アルゴリズムは、刑事司法、医療、信用、雇用など、高度な意思決定に使われている。研究コミュニティは2つの独立した研究分野に答えている: インペラルゴリトミックフェアネス(英語版)、そして解釈可能な推論をターゲットとする説明可能なAI(英語版)(XAI)である。この調査は、その交点における新しい盲点を特定し、マッピングする。これは、そのアウトプットにおける標準フェアネス基準をすべて満たし、そのエンフェレゾン化過程において極めて不公平であるモデルである。
参考スコア（独自算出の注目度）: 0.0
License: http://creativecommons.org/licenses/by-sa/4.0/
Abstract: Machine learning algorithms are being used in high-stakes decisions, including those in criminal justice, healthcare, credit, and employment. The research community has responded with two largely independent research fields: \emph{algorithmic fairness}, which targets equitable outcomes, and \emph{explainable AI} (XAI), which targets interpretable reasoning. This survey identifies and maps a novel blind spot at their intersection, which is a model that can satisfy every standard fairness criterion in its outputs while being profoundly unfair in its \emph{reasoning process}. We refer to this as the procedural bias, and mitigating it requires treating the fairness of explanations as a distinct object of scientific study. To our knowledge, we provide the first unified theoretical and literature review of this emerging field and elucidate the drawbacks of post-hoc explainers in certifying explanation fairness. Our central contribution is a \emph{conditional invariance framework} formalizing explanation fairness as the requirement that explanations should be indifferent regardless of the protected attributes $ P(E(X) \in \cdot \mid X_\text{rel} = x_\text{rel},\, A = a) = P(E(X) \in \cdot \mid X_\text{rel} = x_\text{rel},\, A = b)$ for all task-relevant $x$, a single principle from which all existing explanation fairness metrics emerge as partial operationalizations. We introduce a seven-dimensional taxonomy, identify three generative mechanisms of explanation inequity (representation-driven, explanation-model mismatch, actionability-driven), and propose a canonical six-step evaluation workflow for operationalizing explanation fairness audits in practice.
Abstract（参考訳）: 機械学習アルゴリズムは、刑事司法、医療、信用、雇用など、高度な意思決定に使われている。研究コミュニティは、公平な成果を目標とする「emph{algorithmic fairness}」と、解釈可能な推論をターゲットとする「emph{explainable AI}」(XAI)の2つの大きく独立した研究分野に回答している。このサーベイは、その交点における新しい盲点を特定し、マッピングする。これは、その出力における標準公正基準をすべて満たし、かつ、その 'emph{reasoning} プロセスにおいて極めて不公平であるモデルである。我々はこれを手続き的バイアスと呼び、それを緩和するには説明の公正性を科学研究の別の対象として扱う必要がある。本研究は,本分野の統一的理論的,文献的考察を行い,説明公正性を証明する上でのポストホック解説者の欠点を解明するものである。我々の中心的な貢献は、説明フェアネスを、保護された属性にかかわらず説明が無関心であるという要件として定式化する \emph{conditional invariance framework} である。 a) = P(E(X) \in \cdot \mid X_\text{rel} = x_\text{rel},\, A = b)すべてのタスク関連$x$に対して、既存の説明公正度指標が部分的運用化として出現する単一の原則。本研究では,7次元分類法を導入し,説明不等式(表現駆動型,説明モデルミスマッチ,行動可能性駆動型)の3つの生成機構を同定し,実際に説明公正監査を行うための6段階評価ワークフローを提案する。

論文の概要: Fairness of Explanations in Artificial Intelligence (AI): A Unifying Framework, Axioms, and Future Direction toward Responsible AI

関連論文リスト