Fugu-MT 論文翻訳(概要): PEFT-Arena: Understanding Parameter-Efficient Finetuning from a Stability-Plasticity Perspective

論文の概要: PEFT-Arena: Understanding Parameter-Efficient Finetuning from a Stability-Plasticity Perspective

arxiv url: http://arxiv.org/abs/2605.28819v1
Date: Wed, 27 May 2026 17:59:51 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-28 17:38:56.272811
Title: PEFT-Arena: Understanding Parameter-Efficient Finetuning from a Stability-Plasticity Perspective
Title（参考訳）: PEFT-Arena:安定性・塑性の観点からのパラメータ効率の高いファインタニング
Authors: Yangyi Huang, Ruotian Peng, Zeju Qiu, Jiale Kang, Yandong Wen, Bernhard Schölkopf, Weiyang Liu,
Abstract要約: PEFTは安定性・塑性ジレンマにより評価されるべきである。本稿では,下流性能と一般能力の維持を計測するベンチマークPEFT-Arenaを紹介する。そこで本研究では,パスワイド巻き戻しによるポストホック改善の事例研究を行った。
参考スコア（独自算出の注目度）: 52.693471818837395
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Parameter-efficient finetuning (PEFT) has become the standard approach for adapting large language models, yet evaluations largely emphasize downstream accuracy while overlooking the retention of pretrained capabilities. We argue that PEFT should be assessed through the stability-plasticity dilemma: the trade-off between target-task adaptation and resistance to forgetting. We introduce PEFT-Arena, a benchmark that jointly measures downstream performance and general capability retention. Across methods, we find distinct stability-plasticity profiles; under comparable parameter budgets, orthogonal finetuning achieves the most favorable Pareto frontier. To explain these differences, we analyze PEFT updates from two geometric perspectives. In weight space, spectral analysis reveals how parameterizations interact with the pretrained singular-value structure. In activation space, retention metrics show whether finetuning preserves or distorts general-capability representations, with forgetting linked to non-isometric representation distortion. Finally, an analysis shows that final SFT checkpoints often overshoot a better target-retention operating point. Inspired by this, we present case studies of a post-hoc improvement with path-wise rewinding.
Abstract（参考訳）: パラメータ効率ファインタニング(PEFT)は大規模言語モデルに適応する標準的な手法となっているが、評価は事前訓練された能力の維持を目立たせながら、下流の精度を重視している。我々は,PEFTは,目標タスク適応と,忘れることへの抵抗のトレードオフである安定性・塑性ジレンマによって評価されるべきであると主張している。本稿では,下流性能と一般能力維持を共同で測定するベンチマークPEFT-Arenaを紹介する。パラメータ予算では直交微調整は最も好ましいパレートフロンティアを実現する。これらの違いを説明するために,2つの幾何学的視点からPEFT更新を解析した。重み空間において、スペクトル解析はパラメータ化が事前訓練された特異値構造とどのように相互作用するかを明らかにする。アクティベーション空間において、保持度は、非等尺的表現歪みに関連付けて、微調整が一般能力表現を保存するか歪曲するかを示す。最後に、分析の結果、最終的なSFTチェックポイントは、しばしばより良い目標保持動作ポイントをオーバーシュートすることが示された。そこで本研究では,パスワイド巻き戻しによるポストホック改善の事例研究を行った。

論文の概要: PEFT-Arena: Understanding Parameter-Efficient Finetuning from a Stability-Plasticity Perspective

関連論文リスト