Fugu-MT 論文翻訳(概要): Tunable MAGMAX: Preference-Aware Model Merging for Continual Learning

論文の概要: Tunable MAGMAX: Preference-Aware Model Merging for Continual Learning

arxiv url: http://arxiv.org/abs/2605.20803v1
Date: Wed, 20 May 2026 06:50:10 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-21 19:19:56.537932
Title: Tunable MAGMAX: Preference-Aware Model Merging for Continual Learning
Title（参考訳）: Tunable MAGMAX:継続的学習のための参照認識モデルの統合
Authors: Kei Hiroshima, Kento Uchida, Shinichi Shirakawa,
Abstract要約: 連続学習(CL)は、以前に学習した知識の破滅的な忘れを軽減しつつ、複数のタスクでモデルを逐次訓練することを目的としている。近年の大規模事前学習モデル(LPM)とMAGMAXのようなモデルマージ技術は,タスク固有のパラメータを組み合わせることで,効果的なCL性能を実証している。本稿では、CLにおけるタスク固有性能の優先制御を可能にするTunable MAGMAXと呼ばれるモデル統合フレームワークを提案する。
参考スコア（独自算出の注目度）: 1.6822770693792826
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Continual learning (CL) aims to train models sequentially on multiple tasks while mitigating catastrophic forgetting of previously learned knowledge. Recent advances in large pre-trained models (LPMs) and model merging techniques, such as MAGMAX, have demonstrated effective CL performance by combining task-specific parameters. However, existing methods primarily focus on average performance across all tasks and do not adequately address how to construct models accommodating different deployment environments or varying user preferences. This paper proposes a model merging framework, termed Tunable MAGMAX, which enables preference-aware control of task-specific performance in CL. Our method introduces a preference vector that controls the number of elements selected from each task vector during model merging, allowing us to adjust the merged model performance according to their deployment needs. We further propose a method for automatically constructing appropriate preference vectors by leveraging small amounts of target environment data and datasets from model training tasks, thereby eliminating the need for manual specification. The experimental result on CL benchmark tasks demonstrates that Tunable MAGMAX effectively controls task-wise performance and successfully adapts merged models to various target environments. The proposed Tunable MAGMAX achieves superior or comparable performance to baseline methods, making it a practical solution for deploying CL models to various environments where the preferences of each task performance differ.
Abstract（参考訳）: 連続学習(CL)は、以前に学習した知識の破滅的な忘れを軽減しつつ、複数のタスクでモデルを逐次訓練することを目的としている。近年の大規模事前学習モデル(LPM)とMAGMAXのようなモデルマージ技術は,タスク固有のパラメータを組み合わせることで,効果的なCL性能を実証している。しかし、既存のメソッドは主にすべてのタスクの平均的なパフォーマンスに重点を置いており、異なるデプロイメント環境やさまざまなユーザの好みに合わせてモデルを適切に構築する方法には対処していない。本稿では,CLにおけるタスク固有性能を優先的に制御できるTunable MAGMAXというモデル統合フレームワークを提案する。本手法では,モデルマージ時に各タスクベクトルから選択した要素数を制御する選好ベクトルを導入する。また、モデルトレーニングタスクから少量のターゲット環境データとデータセットを活用することにより、適切な選好ベクトルを自動的に構築する手法を提案し、手動仕様の必要性を排除した。 CLベンチマークタスクの実験結果は、Tunable MAGMAXがタスクワイドのパフォーマンスを効果的に制御し、マージしたモデルを様々なターゲット環境に適応させることを実証している。提案したTunable MAGMAXは,各タスク性能の選好が異なる環境にCLモデルをデプロイするための実用的なソリューションである。

論文の概要: Tunable MAGMAX: Preference-Aware Model Merging for Continual Learning

関連論文リスト