Fugu-MT 論文翻訳(概要): Reconstructing Training Data with Informed Adversaries

論文の概要: Reconstructing Training Data with Informed Adversaries

arxiv url: http://arxiv.org/abs/2201.04845v1
Date: Thu, 13 Jan 2022 09:19:25 GMT
ステータス: 翻訳完了
システム内更新日: 2022-01-14 22:33:17.461063
Title: Reconstructing Training Data with Informed Adversaries
Title（参考訳）: インフォームド・アドバイザによるトレーニングデータの再構築
Authors: Borja Balle, Giovanni Cherubin, Jamie Hayes
Abstract要約: 機械学習モデルへのアクセスを考えると、敵はモデルのトレーニングデータを再構築できるだろうか? 本研究は、この疑問を、学習データポイントの全てを知っている強力な情報提供者のレンズから研究する。この厳密な脅威モデルにおいて、残りのデータポイントを再構築することは可能であることを示す。
参考スコア（独自算出の注目度）: 30.138217209991826
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Given access to a machine learning model, can an adversary reconstruct the model's training data? This work studies this question from the lens of a powerful informed adversary who knows all the training data points except one. By instantiating concrete attacks, we show it is feasible to reconstruct the remaining data point in this stringent threat model. For convex models (e.g. logistic regression), reconstruction attacks are simple and can be derived in closed-form. For more general models (e.g. neural networks), we propose an attack strategy based on training a reconstructor network that receives as input the weights of the model under attack and produces as output the target data point. We demonstrate the effectiveness of our attack on image classifiers trained on MNIST and CIFAR-10, and systematically investigate which factors of standard machine learning pipelines affect reconstruction success. Finally, we theoretically investigate what amount of differential privacy suffices to mitigate reconstruction attacks by informed adversaries. Our work provides an effective reconstruction attack that model developers can use to assess memorization of individual points in general settings beyond those considered in previous works (e.g. generative language models or access to training gradients); it shows that standard models have the capacity to store enough information to enable high-fidelity reconstruction of training data points; and it demonstrates that differential privacy can successfully mitigate such attacks in a parameter regime where utility degradation is minimal.
Abstract（参考訳）: 機械学習モデルへのアクセスが与えられると、敵はモデルのトレーニングデータを再構築できるか? この研究は、すべてのトレーニングデータポイントを知っている強力な知識のある敵のレンズからこの問題を研究する。具体的な攻撃をインスタンス化することにより、この厳密な脅威モデルにおける残りのデータポイントを再構築できることを示す。凸モデル(例えばロジスティック回帰)では、再構成攻撃は単純であり、閉形式で導出することができる。より一般的なモデル(例えばニューラルネットワーク)に対しては、攻撃対象のモデルの重みを入力として受け取り、ターゲットのデータポイントを出力する再構成器ネットワークのトレーニングに基づく攻撃戦略を提案する。我々は,MNIST と CIFAR-10 で訓練された画像分類器に対する攻撃の有効性を実証し,標準的な機械学習パイプラインのどの要素が再構築の成功に影響を与えるかを体系的に検討した。最後に,情報提供者によるリコンストラクション攻撃を緩和するためのプライバシーの差異について理論的に検討する。 Our work provides an effective reconstruction attack that model developers can use to assess memorization of individual points in general settings beyond those considered in previous works (e.g. generative language models or access to training gradients); it shows that standard models have the capacity to store enough information to enable high-fidelity reconstruction of training data points; and it demonstrates that differential privacy can successfully mitigate such attacks in a parameter regime where utility degradation is minimal.

関連論文リスト

Vulnerability of Transfer-Learned Neural Networks to Data Reconstruction Attacks in Small-Data Regime [0.0]
トレーニングデータ再構築攻撃は、敵がリリースしたモデルのトレーニングデータの一部を復元することを可能にする。再構成ニューラルネットワークがトレーニングデータとモデル重みの間の(ランダムな)マッピングを反転させるのを学習する攻撃について考察する。
論文参考訳（メタデータ） (2025-05-20T13:09:22Z)
Reconstruction Attacks on Machine Unlearning: Simple Models are Vulnerable [30.22146634953896]
線形回帰モデルから削除したデータポイントに対して、ほぼ完璧な攻撃をマウントする方法を示す。我々の研究は、個人がモデルからデータの削除を要求できる非常に単純なモデルクラスであっても、プライバシリスクが重要であることを強調している。
論文参考訳（メタデータ） (2024-05-30T17:27:44Z)
Bounding Reconstruction Attack Success of Adversaries Without Data Priors [53.41619942066895]
機械学習(ML)モデルに対する再構成攻撃は、機密データの漏洩の強いリスクをもたらす。本研究では,現実的な対角的環境下での再建成功に関する公式な上限を提供する。
論文参考訳（メタデータ） (2024-02-20T09:52:30Z)
Membership Inference Attacks on Diffusion Models via Quantile Regression [30.30033625685376]
我々は,家族関係推論(MI)攻撃による拡散モデルのプライバシー上の脆弱性を実証する。提案したMI攻撃は、トレーニングに使用されていない例における再構成損失の分布を予測(定量化)する量子レグレッションモデルを学習する。我々の攻撃は従来の最先端攻撃よりも優れており、計算コストは著しく低い。
論文参考訳（メタデータ） (2023-12-08T16:21:24Z)
Boosting Model Inversion Attacks with Adversarial Examples [26.904051413441316]
ブラックボックス設定において、より高い攻撃精度を達成できる学習ベースモデル反転攻撃のための新しい訓練パラダイムを提案する。まず,攻撃モデルの学習過程を,意味的損失関数を追加して規則化する。第2に、学習データに逆例を注入し、クラス関連部の多様性を高める。
論文参考訳（メタデータ） (2023-06-24T13:40:58Z)
Deconstructing Classifiers: Towards A Data Reconstruction Attack Against Text Classification Models [2.9735729003555345]
我々はMix And Match攻撃と呼ばれる新たなターゲットデータ再構成攻撃を提案する。この研究は、分類モデルにおけるデータ再構成攻撃に関連するプライバシーリスクを考慮することの重要性を強調している。
論文参考訳（メタデータ） (2023-06-23T21:25:38Z)
Understanding Reconstruction Attacks with the Neural Tangent Kernel and Dataset Distillation [110.61853418925219]
我々は、データセット再構築攻撃のより強力なバージョンを構築し、無限の幅で設定されたエンペントリアルトレーニングを確実に回復する方法を示す。理論的にも経験的にも再構成された画像は、データセットの「外部」に傾向を示す。これらのリコンストラクション攻撃は, テクストデータセット蒸留において, 再構成画像上で再トレーニングを行い, 高い予測精度を得ることができる。
論文参考訳（メタデータ） (2023-02-02T21:41:59Z)
Reconstructing Training Data from Model Gradient, Provably [68.21082086264555]
ランダムに選択されたパラメータ値で1つの勾配クエリからトレーニングサンプルを再構成する。センシティブなトレーニングデータを示す証明可能な攻撃として、われわれの発見はプライバシーに対する深刻な脅威を示唆している。
論文参考訳（メタデータ） (2022-12-07T15:32:22Z)
RelaxLoss: Defending Membership Inference Attacks without Losing Utility [68.48117818874155]
より達成可能な学習目標を持つ緩和された損失に基づく新しい学習フレームワークを提案する。 RelaxLossは、簡単な実装と無視可能なオーバーヘッドのメリットを加えた任意の分類モデルに適用できる。当社のアプローチはMIAに対するレジリエンスの観点から,常に最先端の防御機構より優れています。
論文参考訳（メタデータ） (2022-07-12T19:34:47Z)
Delving into Data: Effectively Substitute Training for Black-box Attack [84.85798059317963]
本稿では,知識盗むプロセスで使用されるデータの分散設計に焦点をあてた,新しい視点代替トレーニングを提案する。これら2つのモジュールの組み合わせにより、代替モデルとターゲットモデルの一貫性がさらに向上し、敵攻撃の有効性が大幅に向上する。
論文参考訳（メタデータ） (2021-04-26T07:26:29Z)
Exploring the Security Boundary of Data Reconstruction via Neuron Exclusivity Analysis [23.07323180340961]
線形整列ユニット(ReLUs)を用いたニューラルネットワーク上の微視的視点による勾配からのデータ再構成のセキュリティ境界について検討する。ニューラルネットワークの安全性の低い境界にある訓練バッチの再構築において,従来の攻撃よりも大幅に優れる新しい決定論的攻撃アルゴリズムを構築した。
論文参考訳（メタデータ） (2020-10-26T05:54:47Z)
Knowledge-Enriched Distributional Model Inversion Attacks [49.43828150561947]
モデルインバージョン(MI)攻撃は、モデルパラメータからトレーニングデータを再構成することを目的としている。本稿では,パブリックデータからプライベートモデルに対する攻撃を行うのに役立つ知識を抽出する,新しい反転型GANを提案する。実験の結果,これらの手法を組み合わせることで,最先端MI攻撃の成功率を150%向上させることができることがわかった。
論文参考訳（メタデータ） (2020-10-08T16:20:48Z)

関連論文リストは本サイト内にある論文のタイトル・アブストラクトから自動的に作成しています。

指定された論文の情報です。
本サイトの運営者は本サイト（すべての情報・翻訳含む）の品質を保証せず、本サイト（すべての情報・翻訳含む）を使用して発生したあらゆる結果について一切の責任を負いません。