Fugu-MT 論文翻訳(概要): Clarifying Model Transparency: Interpretability versus Explainability in Deep Learning with MNIST and IMDB Examples

論文の概要: Clarifying Model Transparency: Interpretability versus Explainability in Deep Learning with MNIST and IMDB Examples

arxiv url: http://arxiv.org/abs/2509.10929v1
Date: Sat, 13 Sep 2025 18:06:55 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-16 17:26:22.844168
Title: Clarifying Model Transparency: Interpretability versus Explainability in Deep Learning with MNIST and IMDB Examples
Title（参考訳）: モデル透明性の明確化:MNISTおよびIMDBを用いた深層学習における解釈可能性と説明可能性
Authors: Mitali Raj,
Abstract要約: ドキュメントは、ディープラーニングパラダイムにおける解釈可能性と説明可能性の比較調査を提供する。解釈可能性(interpretability)は、人間の操作機構を理解するためのモデル固有の能力に関係している。例えば、特徴属性法は、特定のMNISTイメージが「7」であると認識される理由を明らかにし、単語レベルの重要さはIMDBの感情結果を明らかにすることができる。
参考スコア（独自算出の注目度）: 0.0
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: The impressive capabilities of deep learning models are often counterbalanced by their inherent opacity, commonly termed the "black box" problem, which impedes their widespread acceptance in high-trust domains. In response, the intersecting disciplines of interpretability and explainability, collectively falling under the Explainable AI (XAI) umbrella, have become focal points of research. Although these terms are frequently used as synonyms, they carry distinct conceptual weights. This document offers a comparative exploration of interpretability and explainability within the deep learning paradigm, carefully outlining their respective definitions, objectives, prevalent methodologies, and inherent difficulties. Through illustrative examinations of the MNIST digit classification task and IMDB sentiment analysis, we substantiate a key argument: interpretability generally pertains to a model's inherent capacity for human comprehension of its operational mechanisms (global understanding), whereas explainability is more commonly associated with post-hoc techniques designed to illuminate the basis for a model's individual predictions or behaviors (local explanations). For example, feature attribution methods can reveal why a specific MNIST image is recognized as a '7', and word-level importance can clarify an IMDB sentiment outcome. However, these local insights do not render the complex underlying model globally transparent. A clear grasp of this differentiation, as demonstrated by these standard datasets, is vital for fostering dependable and sound artificial intelligence.
Abstract（参考訳）: ディープラーニングモデルの印象的な能力は、しばしばその固有の不透明さによって相反するものであり、一般に「ブラックボックス」問題と呼ばれ、高信頼ドメインで広く受け入れられることを妨げる。これに対し、説明可能なAI(XAI)の傘の下に、解釈可能性と説明可能性の交差する分野が研究の焦点となっている。これらの用語はしばしば同義語として使われるが、異なる概念的な重みを持つ。この文書は、深層学習パラダイムにおける解釈可能性と説明可能性について比較検討し、それぞれの定義、目的、一般的な方法論、固有の困難について慎重に概説する。 MNISTの数字分類タスクとIMDBの感情分析の実証的な検証を通じて、重要な議論を裏付ける: 解釈可能性とは、一般に、その操作機構の人間の理解能力(言語的理解)に関連するモデル固有の能力であり、一方、説明可能性とは、モデルの個々の予測や行動の基礎を照らすために設計されたポストホック技術(局所的な説明)とより一般的に関連している。例えば、特徴属性法は、特定のMNISTイメージが「7」であると認識される理由を明らかにし、単語レベルの重要さはIMDBの感情結果を明らかにすることができる。しかし、これらの局所的な洞察は、複雑な基盤となるモデルをグローバルに透明にしない。これらの標準データセットが示すように、この区別を明確に把握することは、信頼性と健全な人工知能の育成に不可欠である。

論文の概要: Clarifying Model Transparency: Interpretability versus Explainability in Deep Learning with MNIST and IMDB Examples

関連論文リスト