Fugu-MT 論文翻訳(概要): When Does Structure Matter in Continual Learning? Dimensionality Controls When Modularity Shapes Representational Geometry

論文の概要: When Does Structure Matter in Continual Learning? Dimensionality Controls When Modularity Shapes Representational Geometry

arxiv url: http://arxiv.org/abs/2604.27656v1
Date: Thu, 30 Apr 2026 09:50:01 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-01 16:31:54.029447
Title: When Does Structure Matter in Continual Learning? Dimensionality Controls When Modularity Shapes Representational Geometry
Title（参考訳）: 連続学習における構造はいつ重要か? : モジュラー形状表現幾何学における次元制御
Authors: Kathrin Korte, Joachim Winter Pedersen, Eleni Nisioti, Sebastian Risi,
Abstract要約: 逐次的タスクパラダイムにおけるネットワークアーキテクチャ,タスク類似性,表現次元の共形学習について検討する。本研究は, 構造的分離が機能的に関連性を持つようになると, 表現的次元性は, 構成的変数管理の鍵となる役割を担っていることを示唆する。
参考スコア（独自算出の注目度）: 4.121514039516763
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: To preserve previously learned representations, continual learning systems must strike a balance between plasticity, the ability to acquire new knowledge, and stability. This stability-plasticity dilemma affects how representations can be reused across tasks: shared structure enables transfer when tasks are similar but may also induce interference when new learning disrupts existing representations. However, it remains unclear when and why structural separation influences this trade-off. In this study, we examine how network architecture, task similarity, and representational dimensionality jointly shape learning in a sequential task paradigm inspired by transfer-interference studies. We compare a task-partitioned modular recurrent network with a single-module baseline by systematically varying task similarity (low, medium, high) and the scale of weight initialization, which induces different learning regimes that we empirically characterize through the effective dimensionality of the learned representations. We find that architecture has minimal impact in high-dimensional regimes where representations are sufficiently unconstrained to accommodate multiple tasks without strong interference. In contrast, in lower-dimensional (rich) regimes, architectural separation is decisive: modular networks exhibit graded alignment of task-specific subspaces with overlap for similar tasks, partial orthogonalization for moderately dissimilar tasks, and stronger separation for dissimilar tasks. This graded geometry is absent in the single network baseline. Our findings suggest that representational dimensionality acts as a key organizing variable governing when structural separation becomes functionally relevant, and highlight adaptive geometry as a central principle for designing continual learning systems.
Abstract（参考訳）: これまでに学習した表現を保存するためには、連続的な学習システムは、可塑性、新しい知識を得る能力、安定性のバランスを取らなければならない。この安定性-塑性ジレンマは、タスク間で表現を再利用する方法に影響を与える:共有構造は、タスクが類似しているときに転送を可能にするが、新しい学習が既存の表現を破壊したときに干渉を引き起こす可能性がある。しかし、なぜ構造的分離がこのトレードオフに影響を及ぼすのかは、まだ不明である。本研究では,伝達干渉研究に触発された逐次的タスクパラダイムにおけるネットワークアーキテクチャ,タスク類似性,表現次元の共形学習について検討する。課題分割型モジュラーリカレントネットワークと単一モジュールベースラインを,タスク類似性(低,中,高)と重み初期化の尺度に体系的に変化させることで比較し,学習表現の有効次元性を通じて経験的に特徴付ける学習体制を創出する。アーキテクチャは、強い干渉を伴わずに複数のタスクに対応するために、表現が十分に制約されていない高次元のレシエーションにおいて、最小限の影響が認められる。対照的に、低次元の(豊かな)体制では、アーキテクチャの分離は決定的である: モジュラーネットワークは、類似したタスクに重複するタスク固有の部分空間の段階的なアライメント、適度に異なるタスクに対する部分直交化、異種タスクに対するより強い分離を示す。このグレード付き幾何は単一のネットワークベースラインに存在しない。本研究は, 構造的分離が機能的に関連性を持つようになると, 表現的次元性が重要な体系的変数として機能し, 連続的な学習システムを設計するための中心原理として適応幾何学を強調することを示唆している。

論文の概要: When Does Structure Matter in Continual Learning? Dimensionality Controls When Modularity Shapes Representational Geometry

関連論文リスト