Fugu-MT 論文翻訳(概要): XtrAIn: Training-Guided Occlusion for Feature Attribution

論文の概要: XtrAIn: Training-Guided Occlusion for Feature Attribution

arxiv url: http://arxiv.org/abs/2606.10877v1
Date: Tue, 09 Jun 2026 13:52:05 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-10 15:40:58.53266
Title: XtrAIn: Training-Guided Occlusion for Feature Attribution
Title（参考訳）: XtrAIn: 特徴属性のためのトレーニングガイドオクルージョン
Authors: Thodoris Lymperopoulos, Ioannis Kakogeorgiou, Denia Kanellopoulou,
Abstract要約: Occlusion-based Attribution Methodは、入力特徴を摂動させ、結果のモデル出力の変化を測定することにより、特徴の重要度を推定することができる。本稿では,入力空間からパラメータ空間へのオクルージョン操作を伝達する学習誘導帰属法であるXtrAInを紹介する。 XtrAInは標準的な属性ベースラインよりもクリーンで解釈可能な属性パターンを生成する。
参考スコア（独自算出の注目度）: 3.3609128464006015
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Occlusion-based attribution methods provide an intuitive way to estimate feature importance by perturbing input features and measuring the resulting change in model output. However, their reliability is strongly affected by how feature removal is implemented: externally selected baselines can introduce bias, out-of-distribution samples, and unstable explanations, while in nonlinear models the occlusion of a set of features can also alter the contribution of non-occluded features. We refer to this effect as attribution shift, as the attribution scores of the non-occluded features drift from their initial values. To challenge these major issues that render explanations unstable, we introduce XtrAIn, a training-guided attribution method that transfers the occlusion operation from the input space to the parameter space. Instead of replacing input values with hand-crafted baselines, XtrAIn follows the model's training trajectory and measures how feature-associated parameter updates affect the output logits. We further introduce Xstep, a lightweight approximation for reducing computational cost, and XtrAIn+, a target-focused variant that emphasizes updates aligned with the target class. Experiments on controlled image datasets and PAM50 breast-cancer subtype classification show that the proposed methods produce cleaner and more interpretable attribution patterns than standard attribution baselines. Overall, XtrAIn provides a training-aware perspective on feature attribution and offers a useful diagnostic tool for studying how feature-level evidence is formed during training.
Abstract（参考訳）: Occlusion-based Attribution Methodは、入力特徴を摂動させ、結果のモデル出力の変化を測定することによって、特徴の重要度を推定する直感的な方法を提供する。外部選択されたベースラインはバイアス、分布外サンプル、不安定な説明を導入でき、非線形モデルでは、一連の特徴の閉塞は非閉塞的特徴の寄与を変える。この効果を帰属シフト(Atribution shift)と呼び、非排他的特徴の帰属スコアが初期値からドリフトする。説明が不安定なこれらの主要な問題に対処するために,入力空間からパラメータ空間へのオクルージョン操作を伝達するトレーニング誘導帰属法であるXtrAInを導入する。入力値を手作りのベースラインに置き換える代わりに、XtrAInはモデルのトレーニング軌道に従い、特徴関連パラメータ更新が出力ロジットにどのように影響するかを測定する。さらに、計算コストを削減する軽量な近似であるXstepと、対象クラスと整合した更新を強調するターゲット中心の変種であるXtrAIn+についても紹介する。制御された画像データセットとPAM50乳がんサブタイプ分類の実験により,提案手法は標準属性ベースラインよりもクリーンで解釈可能な属性パターンを生成することが示された。全体として、XtrAInは、機能属性に関するトレーニング対応の視点を提供し、トレーニング中に機能レベルのエビデンスがどのように形成されるかを研究するための有用な診断ツールを提供する。

論文の概要: XtrAIn: Training-Guided Occlusion for Feature Attribution

関連論文リスト