Fugu-MT 論文翻訳(概要): CLIF: Concept-Level Influence Functions for Transparent Bottleneck Models

論文の概要: CLIF: Concept-Level Influence Functions for Transparent Bottleneck Models

arxiv url: http://arxiv.org/abs/2605.19848v2
Date: Sat, 23 May 2026 09:32:26 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-26 16:32:37.760184
Title: CLIF: Concept-Level Influence Functions for Transparent Bottleneck Models
Title（参考訳）: CLIF:透明ボトルネックモデルに対する概念レベル影響関数
Authors: Yike Sun, Mingkun Xu, Mu You, Zhongzhi He, Henghua Shen, Zehan Tan, Derek F. Wong, Tao Fang,
Abstract要約: 本研究では,NLPモデルのサンプルレベルと概念レベルでの解釈可能性を高めるために,インフルエンス関数を用いた新しい手法を提案する。 CEBaBとYelpのデータセットの実験は、影響関数が最も影響のあるトレーニングサンプルを効果的に識別することを示している。
参考スコア（独自算出の注目度）: 31.932529831600558
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: In recent years, the black-box nature of deep learning models has limited their application in high-stakes domains such as medical diagnosis and finance, where interpretability is essential. To address this, we propose a novel approach using influence functions to enhance interpretability in NLP models at both the sample and concept levels. Experiments on CEBaB and Yelp datasets show that influence functions effectively identify the most impactful training samples, both helpful and harmful, on model predictions. By adjusting the labels and weights of these samples, we demonstrate that model performance can be restored to baseline levels without retraining, confirming the value of influence functions for efficient data debugging. Furthermore, our concept-level analysis identifies key concepts within Concept Bottleneck Models (CBM) that significantly affect predictions. Modifying these concepts alters model behavior observably, providing clear insights into the decision process.
Abstract（参考訳）: 近年, 深層学習モデルのブラックボックスの性質は, 診断やファイナンスなど, 解釈可能性に欠かせない領域において, 適用範囲を限定している。そこで本研究では,NLPモデルのサンプルレベルと概念レベルでの解釈可能性を高めるために,インフルエンス関数を用いた新しい手法を提案する。 CEBaBとYelpのデータセットの実験は、モデル予測において、影響関数が最も影響の大きいトレーニングサンプルを効果的に識別することを示している。これらのサンプルのラベルと重みを調整することにより、モデルの性能をトレーニングせずにベースラインレベルに復元できることを示し、効率的なデータデバッギングのための影響関数の値を確認する。さらに,我々の概念レベル分析では,予測に大きく影響を及ぼす概念ボトルネックモデル(CBM)における重要な概念を明らかにしている。これらの概念を変更することで、モデル行動が観察可能に変更され、決定プロセスに対する明確な洞察が得られます。

論文の概要: CLIF: Concept-Level Influence Functions for Transparent Bottleneck Models

関連論文リスト