Fugu-MT 論文翻訳(概要): PURGE: Projected Unlearning via Retain-Guided Erasure

論文の概要: PURGE: Projected Unlearning via Retain-Guided Erasure

arxiv url: http://arxiv.org/abs/2606.03808v1
Date: Tue, 02 Jun 2026 15:53:01 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-03 22:00:05.130066
Title: PURGE: Projected Unlearning via Retain-Guided Erasure
Title（参考訳）: PURGE:Retain-Guided Erasureによる未学習の計画
Authors: Vedant Jawandhia, Daksh Ahuja, Ghufran Alam Siddiqui, Prashant Trivedi, Yash Sinha, Pratik Narang,
Abstract要約: PURGEは、単純だが未発見の観測(継続学習(CL)と機械未学習(MU))に基づいて構築された機械学習アルゴリズムである。 CLは古いタスクを忘れずに新しいタスクを学習しようとします。MUは、反対方向に同じ根底にある緊張力を表す保持パフォーマンスを損なうことなく、特定のデータを消去しようとします。 PURGEは、MIA AUROCを0.5(理想)に近い精度で達成し、勾配上昇、KLユニフォーム、プライバシーユーティリティフロンティアのいくつかのベースラインを達成しながら、常に96%以上の精度を維持している。
参考スコア（独自算出の注目度）: 5.069332646113575
License: http://creativecommons.org/licenses/by/4.0/
Abstract: We propose PURGE, a machine unlearning algorithm built on a simple but an under-exploited observation: continual learning (CL) and machine unlearning (MU) which are fundamentally dual problems. CL tries to learn new tasks without forgetting old ones; MU tries to erase specific data without hurting retained performance representing the same underlying tension in opposite directions. PURGE leverages this duality by adapting gradient projection from A-GEM (Chaudhry et al., 2019) so that every unlearning step is constrained to not increase the retain-set loss. On top of this, it performs multi-layer representation erasure, pushing forget-set activations in intermediate layers towards the retain distribution to remove information from hidden representations rather than just suppressing it at the output. A key design choice is the retain-confusion target: rather than pushing forget outputs toward the uniform distribution, which we found to be surprisingly easy for membership inference attacks to detect, we instead target the model's natural confusion pattern on retain data. This makes the unlearned model hard to distinguish from one retrained from scratch. Two self-regulating stopping criteria (a retain-loss budget and a forget-accuracy target) let the algorithm decide on its own when to stop, removing the need for manual epoch tuning. In experiments on five datasets (CIFAR-10, MNIST, SVHN, STL10, PathMNIST) across 22 class-level forgetting tasks, PURGE consistently keeps retain accuracy above 96% while achieving MIA AUROC close to 0.5 (the ideal), outperforming gradient ascent, KL-uniform, and several published baselines on the privacy-utility frontier.
Abstract（参考訳）: 基本的には二重問題である継続学習(CL)と機械アンラーニング(MU)という,単純だが未発見な観測結果に基づいて構築された機械アンラーニングアルゴリズムであるPURGEを提案する。 CLは古いタスクを忘れずに新しいタスクを学習しようとします。MUは、反対方向に同じ根底にある緊張力を表す保持パフォーマンスを損なうことなく、特定のデータを消去しようとします。 PURGEはこの双対性を利用して、A-GEM(Chaudhry et al , 2019)からの勾配投影を適用し、すべての未学習ステップが保持セット損失を増大させないよう制約される。これに加えて、マルチレイヤ表現消去を行い、中間層における忘れられたセットのアクティベーションを保持分布にプッシュすることで、単に出力で抑制するのではなく、隠された表現から情報を除去する。均一分布に出力をプッシュする代わりに、メンバシップ推論攻撃が検出するのが驚くほど容易であることが分かったので、保持データに対するモデルの自然な混乱パターンをターゲットとしています。これにより、未学習のモデルは、スクラッチからトレーニングされた1つのモデルと区別しにくくなります。 2つの自己規制停止基準(余剰予算と忘れ精度目標)は、アルゴリズムが停止するタイミングを決め、手動のエポックチューニングを不要にする。 5つのデータセット(CIFAR-10、MNIST、SVHN、STL10、PathMNIST)の22のクラスレベルの忘れ作業に対する実験では、PURGEは一貫して96%以上の精度を維持し、MIA AUROCは0.5(理想)に近づき、勾配上昇、KL-ユニフォーム、プライバシーユーティリティフロンティアのいくつかの公開ベースラインを達成している。

論文の概要: PURGE: Projected Unlearning via Retain-Guided Erasure

関連論文リスト