Fugu-MT 論文翻訳(概要): Sparsity, Superposition, and Forgetting: A Mechanistic Study of Representation Retention in Continual Learning

論文の概要: Sparsity, Superposition, and Forgetting: A Mechanistic Study of Representation Retention in Continual Learning

arxiv url: http://arxiv.org/abs/2606.20431v1
Date: Thu, 18 Jun 2026 16:10:40 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-19 18:23:39.975843
Title: Sparsity, Superposition, and Forgetting: A Mechanistic Study of Representation Retention in Continual Learning
Title（参考訳）: 空間性, 重ね, 予測:連続学習における表現保持の力学的研究
Authors: Jan Wasilewski, Jędrzej Kozal, Michał Woźniak, Bartosz Krawczyk,
Abstract要約: 連続学習システムは、しばしば以前取得した知識を忘れる。我々は, 忘れる機構を観察し, テスト可能にする, 制御されたおもちゃの世界フレームワークを提案する。
参考スコア（独自算出の注目度）: 6.113106953880908
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Continual learning (CL) systems often forget previously acquired knowledge, yet the mechanisms driving forgetting remain hard to isolate in practice because real datasets entangle many factors. We present a controlled, toy-world framework that makes these mechanisms observable and testable. Using a synthetic generator-separator pipeline, we define ground-truth latent features, build tasks with tunable sparsity and overlap, and introduce measurable quantities for representation strength and superposition (directional overlap among features). We then study retention dynamics-the temporal change of representation strength by fitting sparse dynamical relations (via SINDy) between retention, superposition, and exposure history. A complementary task-level analysis based on effective rank characterizes how representational capacity is allocated across tasks. Our controlled experiments yield three takeaways. (1) Superposition tends to increase over time with transient dips at task boundaries, suggesting boundary-specific interference rather than steady drift. (2) Higher feature sparsity induces more superposition yet does not inevitably cause forgetting; when representations remain strong, forgetting can be reduced despite overlap. (3) Task-level effective rank grows with sparsity, indicating broader capacity usage under sparse regimes. Together, these results nuance the common intuition that more superposition leads to more forgetting by showing that overlap interacts with representation strength and capacity allocation. Our toy analysis provides falsifiable hypotheses and diagnostic tools for CL.
Abstract（参考訳）: 連続学習(CL)システムは、しばしば以前取得した知識を忘れるが、実際のデータセットが多くの要因を絡み合わせるため、忘れることを促すメカニズムは、実際には分離が難しいままである。我々は,これらの機構を観察し,テスト可能にする,制御されたおもちゃの世界フレームワークを提案する。合成ジェネレータ・セパレータパイプラインを用いて, 地中連続潜伏特性を定義し, 調整可能な間隔と重なりを持つタスクを構築し, 表現強度と重なり量(特徴間の方向重なり)について測定可能な量を導入する。次に、保持、重ね合わせ、露出履歴の間の(SINDyを介して)疎ダイナミックな関係を組み込むことにより、表現強度の時間的変化について検討する。効果的なランクに基づく補完的なタスクレベル分析は、タスク間での表現能力の割り当てを特徴付ける。制御された実験では3つのテイクアウトが得られます。 1) 重畳はタスク境界における過渡的なディップによって時間とともに増加する傾向にあり, 定常的なドリフトよりも境界固有の干渉が示唆される。 2) 高い特徴空間は重畳を誘発するが、必然的に忘れを生じさせることはない。 3) タスクレベルの有効ランクは,スパース体制下でのキャパシティ使用量の増加とともに増大する。これらの結果は、重なり合いが表現強度とキャパシティアロケーションと相互作用することを示すことによって、より多くの重ね合わせがより忘れることにつながる共通の直観をニュアンスさせる。我々の玩具分析はCLのための偽装仮説と診断ツールを提供する。

論文の概要: Sparsity, Superposition, and Forgetting: A Mechanistic Study of Representation Retention in Continual Learning

関連論文リスト