Fugu-MT 論文翻訳(概要): Lightweight Model Editing for LLMs to Correct Deprecated API Recommendations

論文の概要: Lightweight Model Editing for LLMs to Correct Deprecated API Recommendations

arxiv url: http://arxiv.org/abs/2511.21022v1
Date: Wed, 26 Nov 2025 03:36:34 GMT
ステータス: 翻訳完了
システム内更新日: 2025-11-27 18:37:58.950154
Title: Lightweight Model Editing for LLMs to Correct Deprecated API Recommendations
Title（参考訳）: LLMの軽量モデル編集による非推奨APIレコメンデーションの修正
Authors: Guancheng Lin, Xiao Yu, Jacky Keung, Xing Hu, Xin Xia, Alex X. Liu,
Abstract要約: 事前訓練された大規模言語モデル(LLM)は、コード補完タスクにおいて強力なパフォーマンスを示している。 LLMは、将来のサードパーティライブラリではサポートされない非推奨のAPIを頻繁に生成する。本稿では,AdaLoRA-Lを提案する。AdaLoRA-Lは"共通APIレイヤ"(すべてのAPIで高い重要性を持ち,一般的な知識を保存し,編集から除外されたレイヤ)を定義し,編集を"特定APIレイヤ"に限定する。実験の結果、AdaLoRA-Lは他の評価指標と同等の性能を維持しながら、比例性を大幅に改善することが示された。
参考スコア（独自算出の注目度）: 15.586818028794942
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Pre-trained or fine-tuned on large code corpora, Large Language Models (LLMs) have demonstrated strong performance in code completion tasks. However, their embedded knowledge is constrained by the timeliness of training data, which often includes code using deprecated APIs. Consequently, LLMs frequently generate deprecated APIs that will no longer be supported in future versions of third-party libraries. While retraining LLMs on updated codebases could refresh their API knowledge, this approach is computationally expensive. Recently, lightweight model editing methods have emerged to efficiently correct specific knowledge in LLMs. However, it remains unclear whether these methods can effectively update deprecated API knowledge and enable edited models to generate up-to-date APIs. To address this gap, we conduct the first systematic study applying 10 state-of-the-art model editing techniques to update deprecated API knowledge in three LLMs: Qwen2.5-Coder, StarCoder2, and DeepSeek-Coder. We introduce EDAPIBench, a dedicated benchmark featuring over 70 deprecated APIs from 8 popular Python libraries, with more than 3,000 editing instances. Our results show that the parameter-efficient fine-tuning method AdaLoRA achieves the best performance in enabling edited models to generate correct, up-to-date APIs, but falls short in Specificity (i.e., the editing influences untargeted knowledge). To resolve this, we propose AdaLoRA-L, which defines "Common API Layers" (layers within the LLMs with high importance across all APIs, storing general knowledge and excluded from editing) and restricts edits exclusively to "Specific API Layers" (layers with high importance only for the target API, storing the API-specific knowledge). Experimental results demonstrate that AdaLoRA-L significantly improves Specificity while maintaining comparable performance across other evaluation metrics.
Abstract（参考訳）: 大規模コードコーパス、LLM(Large Language Models)の事前訓練または微調整は、コード補完タスクにおいて強力なパフォーマンスを示している。しかしながら、その組み込み知識は、非推奨のAPIを使用したコードを含む、トレーニングデータのタイムラインによって制限されている。その結果、LLMは非推奨のAPIを頻繁に生成し、将来のサードパーティライブラリではサポートされなくなる。更新されたコードベースでLLMを再トレーニングすることは、APIの知識を更新する可能性があるが、このアプローチは計算的に高価である。近年,LLMの特定の知識を効率よく補正するために,軽量なモデル編集手法が出現している。しかし、これらのメソッドが非推奨のAPI知識を効果的に更新し、編集されたモデルが最新のAPIを生成することができるかどうかは不明だ。このギャップに対処するため,まず10種類の最先端モデル編集技術を適用し,非推奨のAPI知識を3つのLLM(Qwen2.5-Coder,StarCoder2,DeepSeek-Coder)で更新する。 EDAPIBenchは8つの人気のあるPythonライブラリから70以上の非推奨APIを特徴とする専用ベンチマークで、3,000以上の編集インスタンスがある。この結果から,パラメータ効率のよい微調整手法であるAdaLoRAは,編集されたモデルに対して,最新のAPIを生成可能であるが,特異性に乏しい(すなわち,未学習の知識に影響を及ぼす)ことが示唆された。この問題を解決するために、AdaLoRA-Lを提案する。これは"Common API Layers"(すべてのAPIにおいて高い重要性を持ち、一般的な知識を保存し、編集から除外されたレイヤ)を定義し、編集を"Specific API Layers"(ターゲットAPIにのみ重要で、API固有の知識を格納するレイヤ)に限定する。実験の結果、AdaLoRA-Lは他の評価指標と同等の性能を維持しながら、比例性を大幅に改善することが示された。

論文の概要: Lightweight Model Editing for LLMs to Correct Deprecated API Recommendations

関連論文リスト