Fugu-MT 論文翻訳(概要): On Reliability of Efficient Membership Inference Vulnerability Evaluation

論文の概要: On Reliability of Efficient Membership Inference Vulnerability Evaluation

arxiv url: http://arxiv.org/abs/2605.25819v1
Date: Mon, 25 May 2026 13:16:39 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-26 19:50:20.129382
Title: On Reliability of Efficient Membership Inference Vulnerability Evaluation
Title（参考訳）: 効率的なメンバーシップ推論脆弱性評価の信頼性について
Authors: Joonas Jälkö, Gauri Pradhan, Ossi Räisä, Antti Honkela,
Abstract要約: メンバーシップ推論攻撃(MIA)は、トレーニングデータ中の機密情報の漏洩を経験的に評価するための一般的な手法である。 MIAスコアに基づくTPRの評価は、複数の個人と複数のターゲットモデルに対して平均化されることが多いことを示す。異なるサンプル間でFPRを効果的に校正するための後処理手法を提案する。
参考スコア（独自算出の注目度）: 7.327728640900649
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Membership inference attacks (MIAs) are popular methods for empirically assessing the leakage of sensitive information in the training data through models or statistics learned from the data. The MIA vulnerability is often evaluated through false positive rate (FPR) and true positive rate (TPR) of a binary classifier that tries to predict whether a particular sample was in the training data. However, in order to reliably estimate the TPR especially for low FPR values, a lot of observations are needed, which in case of MIA translates to many target models, leading to large computational cost. To avoid excessive compute requirements, the MIA scores are often averaged over multiple individuals and multiple targeted models. We demonstrate two key weaknesses in this efficient MIA evaluation pipeline. First, we show that evaluating the TPR based on MIA scores concatenated across multiple individuals, commonly used to study vulnerabilities in the very low FPR regime, is not calibrated across the per-sample FPRs. This makes it unreliable as a tool for auditing differential privacy. To solve this, we propose a post-processing method to effectively calibrate the FPR across different samples. Second, we identify a finite population bias in the commonly used efficient likelihood-ratio attack (LiRA) implementation proposed by Carlini et al. 2022, leading to a positive bias in the per-sample vulnerability.
Abstract（参考訳）: メンバーシップ推論攻撃(MIA)は、トレーニングデータから得られたモデルや統計を通じて、トレーニングデータの機密情報の漏洩を経験的に評価する一般的な方法である。 MIA脆弱性は、トレーニングデータに含まれる特定のサンプルを予測しようとするバイナリ分類器の偽陽性率(FPR)と真陽性率(TPR)によって評価されることが多い。しかし、特に低いFPR値に対してTPRを確実に推定するには、多くの観測が必要であり、MIAの場合、多くのターゲットモデルに変換され、計算コストが大きくなる。過剰な計算要求を避けるため、MIAスコアは複数の個人と複数のターゲットモデルで平均化されることが多い。この効率的なMIA評価パイプラインでは,2つの重要な弱点が示される。まず、非常に低いFPR体制の脆弱性を研究するために一般的に使用されるMIAスコアに基づいてTPRを評価することは、サンプルごとのFPRに対して校正されないことを示す。これにより、差分プライバシーを監査するツールとして信頼性が低い。そこで本研究では,異なるサンプル間でFPRを効果的に校正するための後処理手法を提案する。第2に、Carliiniらによって提案されたLiRA実装における有限集団バイアスを同定し、サンプル当たりの脆弱性に正のバイアスを与える。

論文の概要: On Reliability of Efficient Membership Inference Vulnerability Evaluation

関連論文リスト