Fugu-MT 論文翻訳(概要): Natural Geometry of Robust Data Attribution: From Convex Models to Deep Networks

論文の概要: Natural Geometry of Robust Data Attribution: From Convex Models to Deep Networks

arxiv url: http://arxiv.org/abs/2512.09103v1
Date: Tue, 09 Dec 2025 20:40:27 GMT
ステータス: 翻訳完了
システム内更新日: 2025-12-11 15:14:53.313859
Title: Natural Geometry of Robust Data Attribution: From Convex Models to Deep Networks
Title（参考訳）: ロバストデータ属性の自然幾何学:凸モデルから深部ネットワークへ
Authors: Shihao Li, Jiachen Li, Dongmei Chen,
Abstract要約: コンベックスモデルからディープネットワークへ拡張するロバスト属性の統一フレームワークを提案する。凸設定では、検証可能なカバレッジ保証を備えたW-RIF(Wasserstein-Robust Influence Function)を導出する。ディープネットワークでは、ユークリッド認証がスペクトル増幅によって空白化されることを実証する。
参考スコア（独自算出の注目度）: 9.553350856191743
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Data attribution methods identify which training examples are responsible for a model's predictions, but their sensitivity to distributional perturbations undermines practical reliability. We present a unified framework for certified robust attribution that extends from convex models to deep networks. For convex settings, we derive Wasserstein-Robust Influence Functions (W-RIF) with provable coverage guarantees. For deep networks, we demonstrate that Euclidean certification is rendered vacuous by spectral amplification -- a mechanism where the inherent ill-conditioning of deep representations inflates Lipschitz bounds by over $10{,}000\times$. This explains why standard TRAK scores, while accurate point estimates, are geometrically fragile: naive Euclidean robustness analysis yields 0\% certification. Our key contribution is the Natural Wasserstein metric, which measures perturbations in the geometry induced by the model's own feature covariance. This eliminates spectral amplification, reducing worst-case sensitivity by $76\times$ and stabilizing attribution estimates. On CIFAR-10 with ResNet-18, Natural W-TRAK certifies 68.7\% of ranking pairs compared to 0\% for Euclidean baselines -- to our knowledge, the first non-vacuous certified bounds for neural network attribution. Furthermore, we prove that the Self-Influence term arising from our analysis equals the Lipschitz constant governing attribution stability, providing theoretical grounding for leverage-based anomaly detection. Empirically, Self-Influence achieves 0.970 AUROC for label noise detection, identifying 94.1\% of corrupted labels by examining just the top 20\% of training data.
Abstract（参考訳）: データ属性法は、モデルの予測にどのトレーニング例が原因であるかを識別するが、その分散摂動に対する感度は、実用的な信頼性を損なう。コンベックスモデルからディープネットワークへ拡張可能な,信頼性の高いロバスト属性の統一フレームワークを提案する。凸設定では、検証可能なカバレッジ保証を備えたW-RIF(Wasserstein-Robust Influence Function)を導出する。ディープネットワークでは、ユークリッド認証はスペクトル増幅により空白化されることを実証する -- ディープ表現の固有の悪条件がリプシッツ境界を10$,000\times$で膨らませるメカニズムである。このことは、標準のTRAKスコアが正確な点推定ではあるものの、幾何学的に脆弱である理由を説明している。我々の重要な貢献は、モデル自身の特徴共分散によって誘導される幾何学における摂動を測定するナチュラル・ワッサーシュタイン計量である。これによりスペクトル増幅が排除され、最悪の場合の感度が76\times$に低下し、属性推定が安定化される。 ResNet-18を使用したCIFAR-10では、Nature W-TRAKが、ユークリッドベースラインの0\%と比較して68.7%のランクペアを認定しています。さらに,本分析から生じる自己影響項は,リプシッツ定数が帰属安定性を制御し,レバレッジに基づく異常検出の理論的根拠となることを証明した。経験的に、Self-Influenceは、ラベルノイズ検出のための0.970 AUROCを達成し、トレーニングデータの上位20 %だけを調べることで、破損したラベルの94.1 %を識別する。

論文の概要: Natural Geometry of Robust Data Attribution: From Convex Models to Deep Networks

関連論文リスト