Fugu-MT 論文翻訳(概要): A Survey on Unlearning in Large Language Models

論文の概要: A Survey on Unlearning in Large Language Models

arxiv url: http://arxiv.org/abs/2510.25117v1
Date: Wed, 29 Oct 2025 02:34:17 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-30 15:50:44.988919
Title: A Survey on Unlearning in Large Language Models
Title（参考訳）: 大規模言語モデルにおけるアンラーニングに関する調査
Authors: Ruichen Qiu, Jiajun Tan, Jiayue Pu, Honglin Wang, Xiao-Shan Gao, Fei Sun,
Abstract要約: 大規模言語モデル(LLM)は自然言語処理に革命をもたらしたが、大規模なコーパスでのトレーニングは重大なリスクをもたらす。これらの問題を緩和し、「忘れられる権利」のような法的・倫理的な基準に合わせるために、機械の非学習は重要なテクニックとして現れてきた。この調査は、2021年以降に出版されたLLMアンラーニングに関する180以上の論文の体系的なレビューを提供する。
参考スコア（独自算出の注目度）: 18.262778815699345
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: The advancement of Large Language Models (LLMs) has revolutionized natural language processing, yet their training on massive corpora poses significant risks, including the memorization of sensitive personal data, copyrighted material, and knowledge that could facilitate malicious activities. To mitigate these issues and align with legal and ethical standards such as the "right to be forgotten", machine unlearning has emerged as a critical technique to selectively erase specific knowledge from LLMs without compromising their overall performance. This survey provides a systematic review of over 180 papers on LLM unlearning published since 2021, focusing exclusively on large-scale generative models. Distinct from prior surveys, we introduce novel taxonomies for both unlearning methods and evaluations. We clearly categorize methods into training-time, post-training, and inference-time based on the training stage at which unlearning is applied. For evaluations, we not only systematically compile existing datasets and metrics but also critically analyze their advantages, disadvantages, and applicability, providing practical guidance to the research community. In addition, we discuss key challenges and promising future research directions. Our comprehensive overview aims to inform and guide the ongoing development of secure and reliable LLMs.
Abstract（参考訳）: LLM(Large Language Models)の進歩は自然言語処理に革命をもたらしたが、その大規模なコーパスでの訓練は、機密性の高い個人情報の記憶、著作権のある資料、悪意ある活動を促進する知識など、重大なリスクをもたらす。これらの問題を緩和し、「忘れられる権利」のような法的・倫理的な基準に合わせるために、機械アンラーニングはLLMから特定の知識を選択的に消去する重要な手法として、全体的な性能を損なうことなく登場した。この調査は、2021年以降に出版されたLLMアンラーニングに関する180以上の論文の体系的なレビューを提供する。従来の調査と異なり、未学習の手法と評価の両方に新しい分類法を導入する。本研究では,未学習の訓練段階に基づいて,学習時間,後学習,推論時間に明確に分類する。評価のために、既存のデータセットやメトリクスを体系的にコンパイルするだけでなく、それらの利点、デメリット、適用性を分析し、研究コミュニティに実践的なガイダンスを提供する。また,重要な課題と今後の研究の方向性についても論じる。我々の総合的な概要は、安全で信頼性の高いLCMの継続的な開発を知らせ、ガイドすることを目的としている。

論文の概要: A Survey on Unlearning in Large Language Models

関連論文リスト