Fugu-MT 論文翻訳(概要): LUNE: Efficient LLM Unlearning via LoRA Fine-Tuning with Negative Examples

論文の概要: LUNE: Efficient LLM Unlearning via LoRA Fine-Tuning with Negative Examples

arxiv url: http://arxiv.org/abs/2512.07375v1
Date: Mon, 08 Dec 2025 10:10:29 GMT
ステータス: 翻訳完了
システム内更新日: 2025-12-09 22:03:54.828164
Title: LUNE: Efficient LLM Unlearning via LoRA Fine-Tuning with Negative Examples
Title（参考訳）: LUNE: LoRAファインチューニングによる効率的なLLMアンラーニング
Authors: Yezi Liu, Hanning Chen, Wenjun Huang, Yang Ni, Mohsen Imani,
Abstract要約: 大規模言語モデル(LLM)は、広範な訓練コーパスから得られた膨大な知識を持っている。従来のモデルアンラーニングアプローチでは、計算に高価な微調整や直接重み付けが必要となる。 LoRAベースのUnlearning with Negative Examples (LUNE)は、ローランクアダプタのみを更新することで、負のみのアンラーニングを実行する。
参考スコア（独自算出の注目度）: 17.898277374771254
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Large language models (LLMs) possess vast knowledge acquired from extensive training corpora, but they often cannot remove specific pieces of information when needed, which makes it hard to handle privacy, bias mitigation, and knowledge correction. Traditional model unlearning approaches require computationally expensive fine-tuning or direct weight editing, making them impractical for real-world deployment. In this work, we introduce LoRA-based Unlearning with Negative Examples (LUNE), a lightweight framework that performs negative-only unlearning by updating only low-rank adapters while freezing the backbone, thereby localizing edits and avoiding disruptive global changes. Leveraging Low-Rank Adaptation (LoRA), LUNE targets intermediate representations to suppress (or replace) requested knowledge with an order-of-magnitude lower compute and memory than full fine-tuning or direct weight editing. Extensive experiments on multiple factual unlearning tasks show that LUNE: (I) achieves effectiveness comparable to full fine-tuning and memory-editing methods, and (II) reduces computational cost by about an order of magnitude.
Abstract（参考訳）: 大規模言語モデル(LLM)は、広範囲なトレーニングコーパスから取得した膨大な知識を持っているが、必要に応じて特定の情報を削除できないことが多いため、プライバシやバイアス軽減、知識修正の処理が困難である。従来のモデルアンラーニングアプローチでは、計算に高価な微調整や直接重み付けが必要であり、現実のデプロイメントでは実用的ではない。本研究では,LoRAをベースとしたUnlearning with Negative Examples(LUNE)という,低ランクなアダプタのみをフリーズしながら更新し,編集をローカライズし,破壊的なグローバルな変更を避けることにより,負のみのアンラーニングを実現する軽量フレームワークを紹介した。 Low-Rank Adaptation (LoRA)を活用してLUNEは、要求された知識を完全な微調整や直接重み付けよりも低次の計算とメモリで抑制(または置き換え)するために中間表現をターゲットにしている。複数の事実学習タスクに対する大規模な実験により、LUNE:(I)完全な微調整およびメモリ編集手法に匹敵する効率を実現し、(II)計算コストを約1桁削減した。

論文の概要: LUNE: Efficient LLM Unlearning via LoRA Fine-Tuning with Negative Examples

関連論文リスト