Fugu-MT 論文翻訳(概要): A Unified Taylor Framework for Revisiting Attribution Methods

論文の概要: A Unified Taylor Framework for Revisiting Attribution Methods

arxiv url: http://arxiv.org/abs/2008.09695v3
Date: Tue, 13 Apr 2021 09:00:51 GMT
ステータス: 翻訳完了
システム内更新日: 2022-10-26 20:44:55.965868
Title: A Unified Taylor Framework for Revisiting Attribution Methods
Title（参考訳）: 帰属法を再検討するための統一taylorフレームワーク
Authors: Huiqi Deng, Na Zou, Mengnan Du, Weifu Chen, Guocan Feng, and Xia Hu
Abstract要約: 我々はTaylor属性フレームワークを提案し、7つの主流属性メソッドをフレームワークに再構成する。我々はTaylor属性フレームワークにおいて、良い属性の3つの原則を確立する。
参考スコア（独自算出の注目度）: 49.03783992773811
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Attribution methods have been developed to understand the decision-making process of machine learning models, especially deep neural networks, by assigning importance scores to individual features. Existing attribution methods often built upon empirical intuitions and heuristics. There still lacks a general and theoretical framework that not only can unify these attribution methods, but also theoretically reveal their rationales, fidelity, and limitations. To bridge the gap, in this paper, we propose a Taylor attribution framework and reformulate seven mainstream attribution methods into the framework. Based on reformulations, we analyze the attribution methods in terms of rationale, fidelity, and limitation. Moreover, We establish three principles for a good attribution in the Taylor attribution framework, i.e., low approximation error, correct contribution assignment, and unbiased baseline selection. Finally, we empirically validate the Taylor reformulations and reveal a positive correlation between the attribution performance and the number of principles followed by the attribution method via benchmarking on real-world datasets.
Abstract（参考訳）: 個々の特徴に重要なスコアを割り当てることで、機械学習モデル、特にディープニューラルネットワークの決定過程を理解するために属性手法が開発された。既存の帰属法はしばしば経験的直観とヒューリスティックに基づいている。これらの帰属法を統一できるだけでなく、理論的にその合理性、忠実性、限界を明らかにするという一般的かつ理論的枠組みがまだ欠けている。本稿では,このギャップを埋めるために,Taylor属性フレームワークを提案し,7つの主流属性メソッドをフレームワークに再構成する。改定に基づき, 合理的, 忠実度, 限界度の観点から帰属法を解析する。さらに,taylorアトリビューションフレームワークにおける優れた帰属のための3つの原則,すなわち低近似誤差,正しい貢献割り当て,偏りのないベースライン選択を定式化する。最後に,taylor改革の有効性を実証的に検証し,実世界のデータセットのベンチマークによる帰属性能と原則数との正の相関を明らかにする。

関連論文リスト

Feature Attribution from First Principles [6.836945436656676]
あらゆる特徴帰属メソッドが満たすべき公理的フレームワークは、しばしば制限的すぎると我々は主張する。公理を課すのではなく、最も単純なモデルに対する属性を定義することから始める。深部ReLUネットワークの帰属を表すクローズドフォーム式を導出し,評価指標の最適化に向けて一歩踏み出した。
論文参考訳（メタデータ） (2025-05-30T15:53:11Z)
Discrete Markov Bridge [93.64996843697278]
離散マルコフブリッジと呼ばれる離散表現学習に特化して設計された新しいフレームワークを提案する。私たちのアプローチは、Matrix LearningとScore Learningの2つの重要なコンポーネントの上に構築されています。
論文参考訳（メタデータ） (2025-05-26T09:32:12Z)
Evaluating Human Alignment and Model Faithfulness of LLM Rationale [66.75309523854476]
大規模言語モデル(LLM)が,その世代を理論的にどのように説明するかを考察する。提案手法は帰属に基づく説明よりも「偽り」が少ないことを示す。
論文参考訳（メタデータ） (2024-06-28T20:06:30Z)
Backdoor-based Explainable AI Benchmark for High Fidelity Evaluation of Attribution Methods [49.62131719441252]
属性法は入力特徴の重要度を計算し、深層モデルの出力予測を説明する。本研究はまず,属性手法の信頼性ベンチマークが満たすであろう信頼度基準の集合を同定する。次に、望ましい忠実度基準に準拠したBackdoorベースのeXplainable AIベンチマーク(BackX)を紹介します。
論文参考訳（メタデータ） (2024-05-02T13:48:37Z)
The Weighted M\"obius Score: A Unified Framework for Feature Attribution [17.358276581599643]
特徴属性は、各特徴が予測に与える影響を特定することによって、ブラックボックスモデルの予測の背後にある理由を説明することを目的としている。統一されたフレームワークの欠如は、直接的に比較できないメソッドの急増につながった。本稿では,パラメータ化属性フレームワークである重み付きM"obius Scoreを提案する。
論文参考訳（メタデータ） (2023-05-16T06:27:27Z)
Learning Against Distributional Uncertainty: On the Trade-off Between Robustness and Specificity [29.672383320615218]
本稿では,3つのアプローチを統一し,上記の課題に対処する新たな枠組みについて検討する。新しいモデルは、目に見えないデータとトレーニングデータへの特異性の間のトレードオフを明らかにする。実世界の様々なタスクの実験は、提案した学習フレームワークの優位性を検証する。
論文参考訳（メタデータ） (2023-01-31T11:33:18Z)
The Open-World Lottery Ticket Hypothesis for OOD Intent Classification [68.93357975024773]
我々はOODに対するモデル過信の根本的な原因を明かした。 Lottery Ticket仮説も,オープンワールドシナリオに拡張しています。
論文参考訳（メタデータ） (2022-10-13T14:58:35Z)
A General Taylor Framework for Unifying and Revisiting Attribution Methods [36.34893316038053]
本稿では,その帰属問題を連立における個人報酬の決定方法としてモデル化したTaylor Attributionフレームワークを提案する。我々はTaylor属性フレームワークにおいて、良い属性の3つの原則を確立する。
論文参考訳（メタデータ） (2021-05-28T13:57:16Z)
Do Feature Attribution Methods Correctly Attribute Features? [5.58592454173439]
特徴帰属法は、解釈可能な機械学習で非常に人気がある。属性」の定義に関する合意はありません。塩分マップ,合理性,注意の3つの方法を評価した。
論文参考訳（メタデータ） (2021-04-27T20:35:30Z)
Learning Causal Semantic Representation for Out-of-Distribution Prediction [125.38836464226092]
因果推論に基づく因果意味生成モデル(CSG)を提案し,その2つの要因を別々にモデル化する。 CSGはトレーニングデータに適合させることで意味的因子を識別できることを示し、この意味的識別はOOD一般化誤差の有界性を保証する。
論文参考訳（メタデータ） (2020-11-03T13:16:05Z)

関連論文リストは本サイト内にある論文のタイトル・アブストラクトから自動的に作成しています。

指定された論文の情報です。
本サイトの運営者は本サイト（すべての情報・翻訳含む）の品質を保証せず、本サイト（すべての情報・翻訳含む）を使用して発生したあらゆる結果について一切の責任を負いません。