Fugu-MT 論文翻訳(概要): HiEdit: Lifelong Model Editing with Hierarchical Reinforcement Learning

論文の概要: HiEdit: Lifelong Model Editing with Hierarchical Reinforcement Learning

arxiv url: http://arxiv.org/abs/2604.11214v1
Date: Mon, 13 Apr 2026 09:08:57 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-14 20:13:16.449942
Title: HiEdit: Lifelong Model Editing with Hierarchical Reinforcement Learning
Title（参考訳）: HiEdit:階層的強化学習による生涯モデル編集
Authors: Yangfan Wang, Tianyang Sun, Chen Tang, Jie Liu, Wei Cai, Jingchi Jiang,
Abstract要約: HiEditは階層的な強化学習フレームワークで、編集インスタンスごとに最も知識に関連のあるレイヤを適応的に識別する。実験の結果、HiEditは競合するRLEditのパフォーマンスを平均8.48%向上させ、編集するレイヤの半分しか摂動しないことがわかった。
参考スコア（独自算出の注目度）: 16.114622048845895
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Lifelong model editing (LME) aims to sequentially rectify outdated or inaccurate knowledge in deployed LLMs while minimizing side effects on unrelated inputs. However, existing approaches typically apply parameter perturbations to a static and dense set of LLM layers for all editing instances. This practice is counter-intuitive, as we hypothesize that different pieces of knowledge are stored in distinct layers of the model. Neglecting this layer-wise specificity can impede adaptability in integrating new knowledge and result in catastrophic forgetting for both general and previously edited knowledge. To address this, we propose HiEdit, a hierarchical reinforcement learning framework that adaptively identifies the most knowledge-relevant layers for each editing instance. By enabling dynamic, instance-aware layer selection and incorporating an intrinsic reward for sparsity, HiEdit achieves precise, localized updates. Experiments on various LLMs show that HiEdit boosts the performance of the competitive RLEdit by an average of 8.48% with perturbing only half of the layers per edit. Our code is available at: https://github.com/yangfanww/hiedit.
Abstract（参考訳）: ライフロングモデル編集(LME)は、非関係な入力に対する副作用を最小限に抑えつつ、デプロイされたLLMにおいて、時代遅れまたは不正確な知識を逐次修正することを目的としている。しかし、既存のアプローチでは、通常、全ての編集インスタンスに対して静的で高密度なLCM層にパラメータ摂動を適用する。このプラクティスは直感に反するものであり、異なる知識がモデルの異なるレイヤに格納されているという仮説を立てています。この階層的特異性を無視することは、新しい知識を統合する際の適応性を阻害し、一般的な知識と以前に編集された知識の両方を破滅的に忘れてしまう。そこで本稿では,階層型強化学習フレームワークであるHiEditを提案する。動的でインスタンス対応の層選択を可能とし、スパーシリティに固有の報酬を組み込むことで、HiEditは正確なローカライズされた更新を実現する。様々な LLM の実験では、HiEdit は競合する RLEdit のパフォーマンスを平均8.48%向上させ、編集毎に半分のレイヤーしか摂動しないことを示した。私たちのコードは、https://github.com/yangfanww/hiedit.comで利用可能です。

論文の概要: HiEdit: Lifelong Model Editing with Hierarchical Reinforcement Learning

関連論文リスト