Fugu-MT 論文翻訳(概要): Concept-Based Abductive and Contrastive Explanations for Behaviors of Vision Models

論文の概要: Concept-Based Abductive and Contrastive Explanations for Behaviors of Vision Models

arxiv url: http://arxiv.org/abs/2605.06640v1
Date: Thu, 07 May 2026 17:51:13 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-08 22:27:12.061533
Title: Concept-Based Abductive and Contrastive Explanations for Behaviors of Vision Models
Title（参考訳）: 視覚モデルの行動に対する概念に基づく帰納的・コントラスト的説明
Authors: Ronaldo Canizales, Divya Gopinath, Corina Păsăreanu, Ravi Mangal,
Abstract要約: 因果関係を確立するために *concept erasure* プロシージャを使用しながら,すべての最小説明を列挙するアルゴリズム群を提案する。我々は、複数のモデル、データセット、行動に対するアプローチを評価し、そのコンピューティングにおける有効性を示し、ユーザフレンドリな説明を行う。
参考スコア（独自算出の注目度）: 0.9881300744154826
License: http://creativecommons.org/licenses/by/4.0/
Abstract: *Concept-based explanations* offer a promising approach for explaining the predictions of deep neural networks in terms of high-level, human-understandable concepts. However, existing methods either do not establish a causal connection between the concepts and model predictions or are limited in expressivity and only able to infer causal explanations involving single concepts. At the same time, the parallel line of work on *formal abductive and contrastive explanations* computes the minimal set of input features causally relevant for model outcomes but only considers low-level features such as pixels. Merging these two threads, in this work, we propose the notion of *concept-based abductive and contrastive explanations* that capture the minimal sets of high-level concepts causally relevant for model outcomes. We then present a family of algorithms that enumerate all minimal explanations while using *concept erasure* procedures to establish causal relationships. By appropriately aggregating such explanations, we are not only able to understand model predictions on individual images but also on collections of images where the model exhibits a user-specified, common *behavior*. We evaluate our approach on multiple models, datasets, and behaviors, and demonstrate its effectiveness in computing helpful, user-friendly explanations.
Abstract（参考訳）: *概念に基づく説明*は、深いニューラルネットワークの予測を、高レベルで人間に理解可能な概念の観点から説明するための有望なアプローチを提供する。しかし、既存の手法は、概念とモデル予測の間の因果関係を確立していないか、表現性に制限があるか、単一の概念を含む因果的説明しか推論できない。同時に、*形式的帰納的かつ対照的な説明に関する平行線の作業*は、モデル結果に因果的に関係する最小限の入力特徴集合を計算するが、ピクセルのような低レベルの特徴のみを考慮する。この2つのスレッドを組み合わせることで、モデルの結果に因果関係のある最小レベルの概念を捉える*概念ベースの導出的かつ対照的な説明*の概念が提案される。次に, 因果関係を確立するために *concept erasure* プロシージャを使用しながら, 最小限の説明を列挙するアルゴリズム群を提案する。このような説明を適切に集約することにより、個々の画像上のモデル予測だけでなく、モデルがユーザ指定の一般的な *behavior* を示す画像の集合についても理解することができる。我々は、複数のモデル、データセット、行動に対するアプローチを評価し、そのコンピューティングにおける有効性を示し、ユーザフレンドリな説明を行う。

論文の概要: Concept-Based Abductive and Contrastive Explanations for Behaviors of Vision Models

関連論文リスト