Fugu-MT 論文翻訳(概要): Rethinking the Security of DP-SGD: A Corrected Analysis of Differentially Private Machine Learning

論文の概要: Rethinking the Security of DP-SGD: A Corrected Analysis of Differentially Private Machine Learning

arxiv url: http://arxiv.org/abs/2605.15648v1
Date: Fri, 15 May 2026 06:04:00 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-18 17:44:16.304291
Title: Rethinking the Security of DP-SGD: A Corrected Analysis of Differentially Private Machine Learning
Title（参考訳）: DP-SGDのセキュリティを再考する: 微分プライベート機械学習の正解分析
Authors: Wenhao Wang, Shujie Cui, Hui Cui, Xingliang Yuan,
Abstract要約: Differentially Private Gradient Descent (DP-SGD)は機械学習のトレーニングデータを保護するために広く使われている。我々は,DP-SGD のプライバシー保証を,期待平均 SGM と Batch平均 SGM の定式化に基づいて再解析する。我々の理論的結果は、これらの保証が標準のSGMベースの保証よりも弱いことを示し、これは真のプライバシー漏洩が一部の政権で報告された保証を上回る可能性があることを示唆している。
参考スコア（独自算出の注目度）: 16.83879548734828
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Differentially Private Stochastic Gradient Descent (DP-SGD) is widely used to protect training data in machine learning. Its privacy guarantee is commonly analyzed through a security game in which an adversary infers whether a target record is included in the training dataset from the mechanism output. The resulting privacy leakage is characterized by a privacy curve, which reports the false negative rate as a function of the false positive rate. We identify a mismatch between this formal analysis and common DP-SGD implementations. Existing analyses often model DP-SGD and its variants as the Subsampled Gaussian Mechanism (SGM), where Gaussian noise is added to the sum of clipped gradients computed from a Poisson-sampled batch. In practice, however, many implementations apply an additional normalization step: the noisy gradient sum is divided either by the expected batch size or by the sampled batch size. These mechanisms are therefore better formalized as the Expected-Averaged SGM (EASGM) or the Batch-Averaged SGM (ASGM), respectively. We re-analyze the privacy guarantees of DP-SGD under the EASGM and ASGM formulations. Our theoretical results show that these guarantees can be weaker than the standard SGM-based guarantee, implying that the true privacy leakage may exceed the reported guarantee in some regimes. We further audit four state-of-the-art DP-SGD implementations, including Meta's Opacus library, and observe empirical leakage beyond the SGM-based guarantees. Finally, we audit Opacus versions v0.9.0 to v1.5.4 and derive a corrected privacy guarantee for the latest implementation.
Abstract（参考訳）: Differentially Private Stochastic Gradient Descent (DP-SGD)は機械学習のトレーニングデータを保護するために広く使われている。そのプライバシー保証は、その機構出力からトレーニングデータセットにターゲットレコードが含まれているかどうかを敵が推測するセキュリティゲームを通じて一般的に分析される。結果として生じるプライバシー漏洩は、偽陰性率を偽陽性率の関数として報告するプライバシー曲線によって特徴づけられる。この形式解析とDP-SGD実装のミスマッチを同定する。既存の分析は、しばしばDP-SGDとその変種をサブサンプリングガウス機構(SGM)としてモデル化し、ガウスノイズをポアソンサンプリングバッチから計算したクリッピング勾配の和に追加する。ノイズ勾配の和は、期待されるバッチサイズまたはサンプリングされたバッチサイズによって分割される。したがって、これらの機構は、それぞれ予測平均SGM(EASGM)またはバッチ平均SGM(ASGM)としてより形式化されている。 EASGMとASGMの定式化の下で,DP-SGDのプライバシー保証を再分析する。我々の理論的結果は、これらの保証が標準のSGMベースの保証よりも弱いことを示し、これは真のプライバシー漏洩が一部の政権で報告された保証を上回る可能性があることを示唆している。我々はさらに,MetaのOpacusライブラリを含む,最先端のDP-SGD実装4つを監査し,SGMベースの保証以上の経験的漏洩を観察する。最後に、Opacusのバージョンv0.9.0からv1.5.4を監査し、最新の実装に対する修正されたプライバシ保証を導出する。

論文の概要: Rethinking the Security of DP-SGD: A Corrected Analysis of Differentially Private Machine Learning

関連論文リスト