Fugu-MT 論文翻訳(概要): Theoretical Foundations of Continual Learning via Drift-Plus-Penalty

論文の概要: Theoretical Foundations of Continual Learning via Drift-Plus-Penalty

arxiv url: http://arxiv.org/abs/2606.08452v1
Date: Sun, 07 Jun 2026 04:51:32 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-09 14:42:06.115211
Title: Theoretical Foundations of Continual Learning via Drift-Plus-Penalty
Title（参考訳）: ドリフト・プラス・ペナルティによる連続学習の理論的基礎
Authors: Nazreen Shah, Govinda Arya, Bharath B. N., Ranjitha Prasad,
Abstract要約: 継続的な学習(CL)は、破滅的な忘れを緩和しながら新しいタスクを取り入れることでこの課題に対処する。我々は,忘れることの進化を明示的に制御する制御理論的な視点をCLに導入する。我々は,Drift-PlusPenalty原則を最適化した継続的フレームワークであるCentinual Learning with Drift-PlusPenalty(COLD)を提案する。
参考スコア（独自算出の注目度）: 6.614755043607776
License: http://creativecommons.org/licenses/by/4.0/
Abstract: In many real-world settings, data streams are nonstationary and arrive sequentially, requiring learning systems to adapt continuously without retraining from scratch. Continual learning (CL) addresses this challenge by incorporating new tasks while mitigating catastrophic forgetting, where learning new information degrades performance on previously acquired knowledge. We introduce a control-theoretic perspective on CL that explicitly regulates the evolution of forgetting, framing adaptation as a controlled process subject to long-term stability constraints. We focus on replay-based CL, where a finite memory buffer stores representative samples from prior tasks. We propose COntinual Learning with Drift-Plus-Penalty (COLD), a continual learning framework based on the Drift-Plus-Penalty (DPP) principle from stochastic optimization. To facilitate analysis, we also consider an oracle variant, COLD-ORACLE, as a reference benchmark. At each task, both methods minimize the current task loss while maintaining a virtual queue that tracks deviations from long-term stability on previously learned tasks, capturing the stability-plasticity trade-off as a regulated dynamical process. We establish stability and convergence guarantees that characterize this trade-off through a tunable control parameter. Experiments on standard benchmarks demonstrate that COLD consistently outperforms a broad range of state-of-the-art CL methods while providing competitive and controllable forgetting behavior through explicit regulation of stability and plasticity.
Abstract（参考訳）: 多くの実世界の環境では、データストリームは非定常的であり、逐次到着するので、学習システムはスクラッチから再トレーニングすることなく継続的に適応する必要がある。連続学習(CL)は、破滅的な忘れを緩和しながら、新しいタスクを取り入れることで、この課題に対処する。本稿では,長期安定制約を受ける制御プロセスとして,忘れ,フレーミング適応の進化を明示的に規制するCLの制御理論的視点を紹介する。メモリバッファが先行タスクから代表サンプルを格納するリプレイベースのCLに着目した。確率的最適化からDPP(Drift-Plus-Penalty)の原理に基づく連続的な学習フレームワークであるCOLD(Centinual Learning with Drift-Plus-Penalty)を提案する。分析を容易にするため, オラクル変種である COLD-ORACLE を基準ベンチマークとして検討した。各タスクにおいて、どちらの手法も現在のタスク損失を最小限に抑えつつ、以前に学習したタスクの長期的な安定性から逸脱を追跡する仮想キューを維持し、安定塑性トレードオフを規制された動的プロセスとして捉えている。我々は、調整可能な制御パラメータを通じて、このトレードオフを特徴付ける安定性と収束性を確立する。標準ベンチマークの実験では、COLDは安定性と可塑性の明示的な規制により、競争的で制御可能な忘れ行動を提供しながら、最先端のCL手法の幅広い性能を一貫して上回っていることが示されている。

論文の概要: Theoretical Foundations of Continual Learning via Drift-Plus-Penalty

関連論文リスト