Fugu-MT 論文翻訳(概要): PACT: Preserving Anchored Cores in Task-vectors for Model Merging

論文の概要: PACT: Preserving Anchored Cores in Task-vectors for Model Merging

arxiv url: http://arxiv.org/abs/2606.18627v2
Date: Fri, 19 Jun 2026 07:07:33 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-23 13:41:30.928145
Title: PACT: Preserving Anchored Cores in Task-vectors for Model Merging
Title（参考訳）: PACT: モデルマージのためのタスクベクタにおけるAnchored Coreの保存
Authors: Ningyuan Shi, Zhipeng Zhou, Hao Wang, Chunyan Miao, Peilin Zhao,
Abstract要約: モデルマージは、複数のタスク固有の細調整されたモデルを単一のマルチタスクモデルに結合することを目的としている。既存のモデルマージアプローチのほとんどは、Task Arithmeticパラダイムに従っています。本研究では,タスクベクトル内の固定されたタスク固有コア(LBW次元)を,事前学習した重みのサブ空間と補間を整合させることにより保存するPACTを提案する。
参考スコア（独自算出の注目度）: 68.52455853496585
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Model merging has emerged as a training-free alternative to multi-task learning, aiming to combine multiple task-specific fine-tuned models into a single multi-task model. Most existing model merging approaches follow the Task Arithmetic paradigm, which decomposes fine-tuned weights into pre-trained parameters and task vectors, and performs merging exclusively in the task-vector space. The effectiveness of this paradigm implicitly relies on the assumption that task-specific knowledge is encoded solely within task vectors. We argue that this assumption generally does not hold due to the intrinsic task preferences of pre-trained models. Specifically, we identify \textbf{Load-Bearing Wall (LBW) dimensions}, namely some task-critical knowledge that remains embedded in the pre-trained weights rather than being fully transferred into task vectors. We characterize LBW dimensions from both scalar-weight and subspace perspectives, thereby covering the major paradigms of existing model merging methods. Our analysis reveals that, by ignoring LBW dimensions, task-vector-based approaches fail to fully resolve task conflicts and may inadvertently damage task-specific knowledge encoded in the pre-trained model, leading to degradation. To address this issue, we propose PACT, which preserves the anchored task-specific cores (i.e., LBW dimensions) within task vectors by aligning their orthogonal complements with the subspace of the pre-trained weights. These aligned subspace components are then removed from the task vectors before applying existing model merging algorithms. Furthermore, we develop an efficient variant based on randomized SVD to improve scalability. PACT can be seamlessly integrated with existing methods. Extensive experiments across multiple benchmarks demonstrate that PACT consistently enhances mainstream model merging approaches and establishes new state-of-the-art performance.
Abstract（参考訳）: モデルマージは、複数のタスク固有の細調整されたモデルを単一のマルチタスクモデルに組み合わせることを目的として、マルチタスク学習のトレーニング不要の代替品として登場した。既存のモデルマージアプローチのほとんどは、微調整された重みを事前訓練されたパラメータとタスクベクトルに分解し、タスクベクトル空間でのみマージするタスク算術パラダイムに従っている。このパラダイムの有効性は、タスク固有の知識がタスクベクトルにのみエンコードされているという仮定に暗黙的に依存している。この仮定は、訓練済みモデルの本質的なタスク嗜好のため、一般的には成り立たないと我々は主張する。具体的には、タスクベクトルに完全に変換されるのではなく、トレーニング済みの重みに埋もれたままのタスククリティカルな知識である、 \textbf{Load-Bearing Wall (LBW) 次元を識別する。我々は,LBW次元をスカラーウェイトとサブスペースの両方の観点から特徴付け,既存のモデルマージ手法の主要なパラダイムを網羅する。分析の結果,LBW次元を無視したタスクベクタベースのアプローチでは,タスク競合を完全に解決できず,事前学習したモデルで符号化されたタスク固有知識に不注意にダメージを与える可能性が示唆された。この問題に対処するため,本論文では,タスクベクトル内の固定されたタスク固有コア(LBW次元)を,事前学習した重みの部分空間に直交補関数を整列させることにより保存するPACTを提案する。これらのアライメントされたサブスペースコンポーネントは、既存のモデルマージアルゴリズムを適用する前にタスクベクトルから削除される。さらに,拡張性を向上させるために,ランダム化SVDに基づく効率的な変種を開発する。 PACTは既存のメソッドとシームレスに統合できる。複数のベンチマークにわたる大規模な実験は、PACTが主流モデルのマージアプローチを一貫して強化し、新しい最先端のパフォーマンスを確立することを実証している。

論文の概要: PACT: Preserving Anchored Cores in Task-vectors for Model Merging

関連論文リスト