Fugu-MT 論文翻訳(概要): Is your algorithm unlearning or untraining?

論文の概要: Is your algorithm unlearning or untraining?

arxiv url: http://arxiv.org/abs/2604.07962v1
Date: Thu, 09 Apr 2026 08:24:52 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-10 18:34:05.79915
Title: Is your algorithm unlearning or untraining?
Title（参考訳）: あなたのアルゴリズムは学習していないのか、トレーニングしていないのか?
Authors: Eleni Triantafillou, Ahmed Imtiaz Humayun, Monica Ribero, Alexander Matt Turner, Michael C. Mozer, Georgios Kaissis,
Abstract要約: 我々は「アンラーニング」という用語は、2つの異なる問題定式化にまたがる異なる研究努力によって過負荷になっていると論じている。図1に示すように、未学習と未学習の2つの概念を特定します。本稿では,これらの問題の技術的定義と各文献で研究した問題設定について述べる。
参考スコア（独自算出の注目度）: 57.25293122855155
License: http://creativecommons.org/licenses/by/4.0/
Abstract: As models are getting larger and are trained on increasing amounts of data, there has been an explosion of interest into how we can ``delete'' specific data points or behaviours from a trained model, after the fact. This goal has been referred to as ``machine unlearning''. In this note, we argue that the term ``unlearning'' has been overloaded, with different research efforts spanning two distinct problem formulations, but without that distinction having been observed or acknowledged in the literature. This causes various issues, including ambiguity around when an algorithm is expected to work, use of inappropriate metrics and baselines when comparing different algorithms to one another, difficulty in interpreting results, as well as missed opportunities for pursuing critical research directions. In this note, we address this issue by establishing a fundamental distinction between two notions that we identify as \unlearning and \untraining, illustrated in Figure 1. In short, \untraining aims to reverse the effect of having trained on a given forget set, i.e. to remove the influence that that specific forget set examples had on the model during training. On the other hand, the goal of \unlearning is not just to remove the influence of those given examples, but to use those examples for the purpose of more broadly removing the entire underlying distribution from which those examples were sampled (e.g. the concept or behaviour that those examples represent). We discuss technical definitions of these problems and map problem settings studied in the literature to each. We hope to initiate discussions on disambiguating technical definitions and identify a set of overlooked research questions, as we believe that this a key missing step for accelerating progress in the field of ``unlearning''.
Abstract（参考訳）: モデルが大きくなり、データ量の増加にトレーニングされているため、後にトレーニングされたモデルから‘削除’した特定のデータポイントや振る舞いを、どのようにして‘削除’できるかという関心が爆発的に高まっている。この目標を「機械の未学習」と呼ぶ。本稿では,「アンラーニング」という用語は,2つの異なる問題の定式化にまたがる異なる研究努力を伴って過負荷となったが,その区別は文献で観察・認識されていないことを論じる。これは、アルゴリズムがいつ機能すると予想されるかの曖昧さ、異なるアルゴリズムを互いに比較する際に不適切なメトリクスとベースラインの使用、結果の解釈が困難であること、重要な研究方向を追求する機会の欠如など、様々な問題を引き起こす。本稿では、図1に示す「学習」と「訓練」という2つの概念の根本的な区別を確立することにより、この問題に対処する。言い換えれば、'untraining' は与えられた忘れ物集合に対する訓練の効果、すなわち特定の忘れ物集合の例が訓練中にモデルに与えた影響を逆にすることを目的としている。一方、‘未学習’の目的は、与えられた例の影響を除去するだけでなく、それらの例が示している概念や行動など、サンプルがサンプリングされた基礎的な分布全体をより広範囲に除去する目的のために、これらの例を使用することである。本稿では,これらの問題の技術的定義と各文献で研究した問題設定について述べる。我々は、技術的定義の曖昧化に関する議論を開始し、見落とされがちな研究課題の集合を識別し、これが'アンラーニング'分野の進歩を加速するための重要な欠落したステップであると信じている。

論文の概要: Is your algorithm unlearning or untraining?

関連論文リスト