Fugu-MT 論文翻訳(概要): Unlocking the Potential of Continual Model Merging: An ODE Perspective

論文の概要: Unlocking the Potential of Continual Model Merging: An ODE Perspective

arxiv url: http://arxiv.org/abs/2605.19409v3
Date: Sat, 23 May 2026 16:39:38 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-26 16:32:37.748657
Title: Unlocking the Potential of Continual Model Merging: An ODE Perspective
Title（参考訳）: 連続モデルマージの可能性の解き放つ: ODE の視点から
Authors: Lihong Lin, Haidong Kang,
Abstract要約: CMM(Continuous Model Merging)は、反復的な再トレーニングなしにタスク適応モデルを順次組み込むことで、基礎モデルの迅速なカスタマイズを可能にする。本稿では, 1ステップのエンドポイント更新ではなく, パラメータ空間の軌跡として各マージを定式化する,制御可能なフレームワークODE-Mを提案する。
参考スコア（独自算出の注目度）: 0.42970700836450487
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Continual Model Merging (CMM) enables rapid customization of foundation models by sequentially incorporating task-adapted models without repeated retraining. However, existing merging rules usually update the deployed model through fixed algebraic or projection-based operations, providing limited control over how much previously accumulated knowledge should be retained relative to the incoming task model. This limitation leads to unstable retention and performance degradation in long task streams, and becomes more pronounced when tasks have heterogeneous utilities. We propose ODE-driven Merging (ODE-M), a controllable framework that formulates each continual merge as a trajectory in parameter space rather than a one-step endpoint update. Motivated by mode connectivity, ODE-M constructs a barrier-aware trajectory using a rectified time-dependent velocity field, where lightweight first-order feedback from a small calibration set suppresses loss-increasing motion while preserving progress toward the incoming model. The next merged model is then obtained by selecting an operating point along this trajectory through a utility-aware time schedule, providing an explicit mechanism for balancing retained historical knowledge and incoming task expertise. Extensive experiments on standard CMM benchmarks show that ODE-M consistently improves over strong continual merging baselines across CLIP ViT backbones, stream lengths, and heterogeneous task-utility settings.
Abstract（参考訳）: CMM(Continuous Model Merging)は、反復的な再トレーニングなしにタスク適応モデルを順次組み込むことで、基礎モデルの迅速なカスタマイズを可能にする。しかし、既存のマージルールは通常、固定された代数的または射影的操作を通じてデプロイされたモデルを更新する。この制限は、長いタスクストリームで不安定な保持とパフォーマンスの低下をもたらし、タスクが不均一なユーティリティを持つ場合、より顕著になる。本稿では, 1ステップのエンドポイント更新ではなく, パラメータ空間における各連続的なマージを軌跡として定式化するための, ODE-driven Merging (ODE-M) を提案する。モード接続によりモチベーションされたODE-Mは、補正時間依存速度場を用いてバリア対応軌道を構築し、小さなキャリブレーションセットからの軽量な1次フィードバックは、入ってくるモデルへの進捗を保ちながら損失増加運動を抑制する。次に、この軌道に沿った動作点を実用性を考慮した時間スケジュールで選択し、保持された履歴知識と受信タスクの専門知識のバランスをとるための明確なメカニズムを提供する。標準CMMベンチマークでの大規模な実験により、ODE-MはCLIP ViTバックボーン、ストリーム長、および不均一なタスクユーティリティ設定を含む強力な連続的なマージベースラインよりも一貫して改善されている。

論文の概要: Unlocking the Potential of Continual Model Merging: An ODE Perspective

関連論文リスト