Fugu-MT 論文翻訳(概要): What is Missing? Explaining Neurons Activated by Absent Concepts

論文の概要: What is Missing? Explaining Neurons Activated by Absent Concepts

arxiv url: http://arxiv.org/abs/2603.09787v1
Date: Tue, 10 Mar 2026 15:21:52 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-11 15:25:24.422608
Title: What is Missing? Explaining Neurons Activated by Absent Concepts
Title（参考訳）: 失恋とは何か? 絶対的概念によって活性化されるニューロンについて
Authors: Robin Hesse, Simone Schaub-Meyer, Janina Hesse, Bernt Schiele, Stefan Roth,
Abstract要約: XAIは、ディープニューラルネットワーク(DNN)の動作に関する人間解釈可能な洞察を提供することを目指している既存の研究において、この因果構造はしばしば、概念の存在がニューロンの強い活性化と関連している関係を含む。ほとんど見過ごされがちな因果関係は、概念の欠如が神経活動を増加させるエンコードされた欠如であることを示す。
参考スコア（独自算出の注目度）: 57.02903530185128
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Explainable artificial intelligence (XAI) aims to provide human-interpretable insights into the behavior of deep neural networks (DNNs), typically by estimating a simplified causal structure of the model. In existing work, this causal structure often includes relationships where the presence of a concept is associated with a strong activation of a neuron. For example, attribution methods primarily identify input pixels that contribute most to a prediction, and feature visualization methods reveal inputs that cause high activation of a target neuron - the former implicitly assuming that the relevant information resides in the input, and the latter that neurons encode the presence of concepts. However, a largely overlooked type of causal relationship is that of encoded absences, where the absence of a concept increases neural activation. In this work, we show that such missing but relevant concepts are common and that mainstream XAI methods struggle to reveal them when applied in their standard form. To address this, we propose two simple extensions to attribution and feature visualization techniques that uncover encoded absences. Across experiments, we show how mainstream XAI methods can be used to reveal and explain encoded absences, how ImageNet models exploit them, and that debiasing can be improved when considering them.
Abstract（参考訳）: 説明可能な人工知能(XAI)は、ディープニューラルネットワーク(DNN)の振る舞いに関する人間の解釈可能な洞察を提供することを目的としている。既存の研究において、この因果構造はしばしば、概念の存在がニューロンの強い活性化と関連している関係を含む。例えば、属性法は、主に予測に最も寄与する入力ピクセルを識別し、特徴可視化法は、ターゲットニューロンの高い活性化を引き起こす入力を明らかにする。しかし、ほとんど見過ごされがちな因果関係は、概念の欠如が神経活動を増加させるエンコードされた欠如である。本稿では,そのような欠落する概念が一般的であり,標準形式に適用した場合,主流のXAI手法がそれらを明らかにするのに苦慮していることを示す。そこで本研究では,帰属関係の簡易な拡張と,不在を符号化する特徴可視化手法を提案する。実験を通じて、エンコードされた欠如を明らかにし、説明するために、メインストリームのXAIメソッドをどのように使用できるか、ImageNetモデルがそれらをどのように活用するか、そして、それらを考える際にデバイアスを改善することができるかを示す。

論文の概要: What is Missing? Explaining Neurons Activated by Absent Concepts

関連論文リスト