Fugu-MT 論文翻訳(概要): Analyzing Error Sources in Global Feature Effect Estimation

論文の概要: Analyzing Error Sources in Global Feature Effect Estimation

arxiv url: http://arxiv.org/abs/2603.15057v2
Date: Tue, 17 Mar 2026 10:23:11 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-18 13:19:43.957162
Title: Analyzing Error Sources in Global Feature Effect Estimation
Title（参考訳）: 大域的特徴量効果推定における誤差源の解析
Authors: Timo Heiß, Coco Bögel, Bernd Bischl, Giuseppe Casalicchio,
Abstract要約: 部分依存(PD)や累積局所効果(ALE)プロットのようなグローバルな特徴効果はブラックボックスモデルの解釈に広く用いられている。グローバルな特徴効果の人気にもかかわらず、エラーソースはほとんど探索されていない。 PDとALLのバイアスと分散の源をアンタングルする系統的,推定器レベルの分析を行う。
参考スコア（独自算出の注目度）: 11.864330844302403
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Global feature effects such as partial dependence (PD) and accumulated local effects (ALE) plots are widely used to interpret black-box models. However, they are only estimates of true underlying effects, and their reliability depends on multiple sources of error. Despite the popularity of global feature effects, these error sources are largely unexplored. In particular, the practically relevant question of whether to use training or holdout data to estimate feature effects remains unanswered. We address this gap by providing a systematic, estimator-level analysis that disentangles sources of bias and variance for PD and ALE. To this end, we derive a mean-squared-error decomposition that separates model bias, estimation bias, model variance, and estimation variance, and analyze their dependence on model characteristics, data selection, and sample size. We validate our theoretical findings through an extensive simulation study across multiple data-generating processes, learners, estimation strategies (training data, validation data, and cross-validation), and sample sizes. Our results reveal that, while using holdout data is theoretically the cleanest, potential biases arising from the training data are empirically negligible and dominated by the impact of the usually higher sample size. The estimation variance depends on both the presence of interactions and the sample size, with ALE being particularly sensitive to the latter. Cross-validation-based estimation is a promising approach that reduces the model variance component, particularly for overfitting models. Our analysis provides a principled explanation of the sources of error in feature effect estimates and offers concrete guidance on choosing estimation strategies when interpreting machine learning models.
Abstract（参考訳）: 部分依存(PD)や累積局所効果(ALE)プロットのようなグローバルな特徴効果はブラックボックスモデルの解釈に広く用いられている。しかし、それらは真の基礎となる効果を見積もるだけであり、その信頼性は複数のエラー源に依存する。グローバルな特徴効果の人気にもかかわらず、これらのエラー源はほとんど探索されていない。特に、特徴効果を見積もるためにトレーニングやホールドアウトデータを使うべきかという現実的な疑問は未解決のままである。我々は,PDとALEのバイアスと分散の源をアンハングルする系統的,推定器レベルの分析を提供することにより,このギャップに対処する。この目的のために、モデルバイアス、推定バイアス、モデル分散、推定分散を分離する平均二乗誤差分解を導出し、モデル特性、データ選択、サンプルサイズへの依存性を分析する。我々は,複数のデータ生成プロセス,学習者,推定戦略(トレーニングデータ,検証データ,クロスバリデーション),サンプルサイズに関する広範なシミュレーション研究を通じて,理論的な知見を検証した。以上の結果から, ホルドアウトデータが最もクリーンである一方, トレーニングデータから生じる潜在的なバイアスは, 経験的に無視可能であり, 通常より高いサンプルサイズの影響が支配的であることが明らかとなった。推定分散は相互作用の存在とサンプルサイズの両方に依存し、ALEは後者に特に敏感である。クロスバリデーションに基づく推定は、特に過適合モデルにおいて、モデルの分散成分を減少させる有望なアプローチである。本分析では,特徴効果推定における誤差源の原理的説明と,機械学習モデルの解釈における推定方法の選択に関する具体的なガイダンスを提供する。

論文の概要: Analyzing Error Sources in Global Feature Effect Estimation

関連論文リスト