Fugu-MT 論文翻訳(概要): Improving Fisher Information Estimation and Efficiency for LoRA-based LLM Unlearning

論文の概要: Improving Fisher Information Estimation and Efficiency for LoRA-based LLM Unlearning

arxiv url: http://arxiv.org/abs/2508.21300v1
Date: Fri, 29 Aug 2025 01:45:09 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-01 19:45:10.91662
Title: Improving Fisher Information Estimation and Efficiency for LoRA-based LLM Unlearning
Title（参考訳）: LoRAに基づくLLMアンラーニングにおける漁獲量推定と効率の改善
Authors: Yejin Kim, Eunwon Kim, Buru Chang, Junsuk Choe,
Abstract要約: VILAは、FILAで見落とされた仮定を明示的に考慮する、新しいアンラーニングフレームワークである。我々はTOFU、WMDP、MUSEなどのベンチマークで最先端のパフォーマンスを新たに設定した。
参考スコア（独自算出の注目度）: 15.880906462267935
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: LLMs have demonstrated remarkable performance across various tasks but face challenges related to unintentionally generating outputs containing sensitive information. A straightforward approach to address this issue is to retrain the model after excluding the problematic data. However, this approach incurs prohibitively high computational costs. To overcome this limitation, machine unlearning has emerged as a promising solution that can effectively remove sensitive information without the need to retrain the model from scratch. Recently, FILA has been proposed as a parameter-efficient unlearning method by integrating LoRA adapters. Specifically, it calculates the Fisher information to identify parameters associated with the forget set and assigns them to LoRA adapters for updates. Despite its innovative approach, FILA still requires access to all model parameters and does not adequately account for fundamental assumptions underlying Fisher information, leading to inaccuracies in importance estimation. To address these limitations, we propose VILA, a novel unlearning framework that explicitly considers the assumptions overlooked in FILA, thereby enhancing the accuracy of parameter identification for the forget set. Moreover, VILA significantly reduces computational costs by enabling parameter identification without accessing the entire model. Our method achieves up to 100x higher parameter efficiency and 40x faster training speed compared to FILA, and sets new state-of-the-art performance on benchmarks including TOFU, WMDP, and MUSE. Our code is available at https://github.com/kyj93790/VILA.
Abstract（参考訳）: LLMは様々なタスクにおいて顕著な性能を示してきたが、機密情報を含む意図しない出力に関する課題に直面している。この問題に対処するための簡単なアプローチは、問題のあるデータを除外した後、モデルを再トレーニングすることです。しかし、このアプローチは計算コストを極端に高める。この制限を克服するために、マシン・アンラーニングは、モデルをスクラッチから再トレーニングすることなく、センシティブな情報を効果的に除去できる有望なソリューションとして登場した。 FILAは,LoRAアダプタを組み込んだパラメータ効率の高い未学習手法として提案されている。具体的には、Fisher情報を計算して、Deaksetに関連するパラメータを識別し、更新のためにLoRAアダプタに割り当てる。その革新的なアプローチにもかかわらず、FILAは依然として全てのモデルパラメータへのアクセスを必要としており、フィッシャー情報に基づく基本的な仮定を適切に説明していない。これらの制約に対処するために,FILAで見過ごされた仮定を明示的に考慮し,未知集合に対するパラメータ識別の精度を高める新しいアンラーニングフレームワークであるVILAを提案する。さらに、VILAは、モデル全体にアクセスすることなくパラメータ識別を可能にすることにより、計算コストを大幅に削減する。提案手法はFILAと比較して最大100倍のパラメータ効率と40倍のトレーニング速度を実現し,TOFU, WMDP, MUSEなどのベンチマーク上での最先端性能を新たに設定する。私たちのコードはhttps://github.com/kyj93790/VILAで公開されています。

論文の概要: Improving Fisher Information Estimation and Efficiency for LoRA-based LLM Unlearning

関連論文リスト