Fugu-MT 論文翻訳(概要): Energy-Regularized Sequential Model Editing on Hyperspheres

論文の概要: Energy-Regularized Sequential Model Editing on Hyperspheres

arxiv url: http://arxiv.org/abs/2510.01172v1
Date: Wed, 01 Oct 2025 17:55:43 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-03 16:59:20.709503
Title: Energy-Regularized Sequential Model Editing on Hyperspheres
Title（参考訳）: ハイパースフィア上でのエネルギー規則化シーケンスモデル編集
Authors: Qingyuan Liu, Jia-Chen Gu, Yunzhi Yao, Hong Wang, Nanyun Peng,
Abstract要約: 大規模言語モデル(LLM)は、進化する現実世界の知識と整合性を維持するために、定期的な更新を必要とする。逐次編集はしばしば表現を不安定にし、破滅的な忘れを誘発する。ニューロンの重量分布を安定化するHE駆動正規化戦略であるSPHERE(Sparse Projection for Hyperspherical Energy-Regularized Editing)を提案する。
参考スコア（独自算出の注目度）: 59.47007547581175
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Large language models (LLMs) require constant updates to remain aligned with evolving real-world knowledge. Model editing offers a lightweight alternative to retraining, but sequential editing often destabilizes representations and induces catastrophic forgetting. In this work, we seek to better understand and mitigate performance degradation caused by sequential editing. We hypothesize that hyperspherical uniformity, a property that maintains uniform distribution of neuron weights on a hypersphere, helps the model remain stable, retain prior knowledge, while still accommodate new updates. We use Hyperspherical Energy (HE) to quantify neuron uniformity during editing, and examine its correlation with editing performance. Empirical studies across widely used editing methods reveals a strong correlation between HE dynamics and editing performance, with editing failures consistently coinciding with high HE fluctuations. We further theoretically prove that HE dynamics impose a lower bound on the degradation of pretrained knowledge, highlighting why HE stability is crucial for knowledge retention. Motivated by these insights, we propose SPHERE (Sparse Projection for Hyperspherical Energy-Regularized Editing), an HE-driven regularization strategy that stabilizes neuron weight distributions, ultimately preserving prior knowledge while enabling reliable sequential updates. Specifically, SPHERE identifies a sparse space complementary to the principal hyperspherical directions of the pretrained weight matrices and projects new knowledge onto it, attenuating perturbations on the principal directions. Extensive experiments on LLaMA3 (8B) and Qwen2.5 (7B) show that SPHERE outperforms the best baseline in editing capability by an average of 16.41%, while most faithfully preserving general model performance, thereby offering a principled path toward reliable large-scale knowledge editing.
Abstract（参考訳）: 大規模言語モデル(LLM)は、進化する現実世界の知識と整合性を維持するために、定期的な更新を必要とする。モデル編集は、リトレーニングの軽量な代替手段を提供するが、シーケンシャルな編集は、しばしば表現を不安定にし、破滅的な忘れを誘発する。本研究では,逐次編集による性能劣化の理解と軽減を図る。我々は、超球面上のニューロン重みの均一分布を維持する性質である超球面均一性は、モデルが安定し、事前の知識を維持しつつ、新しい更新を許容するのに役立つと仮定する。我々は超球面エネルギー(HE)を用いて、編集中のニューロンの均一性を定量化し、その編集性能との相関について検討する。広く使われている編集手法における実証研究は、HEのダイナミクスと編集性能の相関が強く、編集失敗は高いHE変動と一貫して一致していることを示している。さらに、HE力学が事前学習した知識の劣化に低い限界を課すことを理論的に証明し、He安定性が知識保持に不可欠である理由を明らかにした。これらの知見を生かしたSPHERE(Sparse Projection for Hyperspherical Energy-Regularized Editing)を提案する。具体的には、SPHEREは、事前訓練された重量行列の主超球面方向を補完するスパース空間を特定し、その上に新しい知識を投射し、主方向の摂動を減衰させる。 LLaMA3 (8B) と Qwen2.5 (7B) の広範な実験により、SPHEREは平均16.41%の編集能力で最高のベースラインを上回り、最も忠実に一般的なモデル性能を保ち、信頼性の高い大規模知識編集への原則化された経路を提供する。

論文の概要: Energy-Regularized Sequential Model Editing on Hyperspheres

関連論文リスト