Fugu-MT 論文翻訳(概要): Auto-FlexSwitch: Efficient Dynamic Model Merging via Learnable Task Vector Compression

論文の概要: Auto-FlexSwitch: Efficient Dynamic Model Merging via Learnable Task Vector Compression

arxiv url: http://arxiv.org/abs/2604.28109v1
Date: Thu, 30 Apr 2026 16:58:05 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-01 16:31:54.207644
Title: Auto-FlexSwitch: Efficient Dynamic Model Merging via Learnable Task Vector Compression
Title（参考訳）: Auto-FlexSwitch: 学習可能なタスクベクトル圧縮による効率的な動的モデルマージ
Authors: Junqi Gao, Dazhi Zhang, Zhichang Guo, Biqing Qi, Yi Ran, Wangmeng Zuo,
Abstract要約: タスクベクトルを3つのコンパクトなコンポーネントに分解するT-Switchを提案する。次に,タスクベクトルを自動生成するトレーニングフリーマージスキームであるAuto-Switchを紹介する。本稿では,高効率なタスクベクトル圧縮をサポートする動的モデルマージ手法であるAuto-FlexSwitchを提案する。
参考スコア（独自算出の注目度）: 54.12624130407679
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Model merging has attracted attention as an effective path toward multi-task adaptation by integrating knowledge from multiple task-specific models. Among existing approaches, dynamic merging mitigates performance degradation caused by conflicting parameter updates across tasks by flexibly combining task-specific parameters at inference time, thereby maintaining high performance. However, these methods require storing independent parameters for each task, resulting in prohibitive storage overhead. To address this issue, we first experimentally demonstrate that the fine-tuned weight increments (referred to as task vectors) exhibit an impulse-like activation pattern and high robustness to low-bit representations. Driven by this insight, we propose T-Switch, which decomposes task vectors into three compact components: a binary sparse mask, a sign vector, and a scalar scaling factor, achieving high-fidelity approximation at high compression ratios. We then introduce Auto-Switch, a training-free merging scheme that automatically composes task vectors via feature similarity retrieval. Building on this, we develop Auto-Switch, a training-free merging scheme that automatically assembles task vectors through feature similarity retrieval. Furthermore, to transform task vector sparsification and quantization from static rules to adaptive learning, we propose FlexSwitch, a learnable framework which jointly optimizes the compression strategy for each model unit via Learnable Gating Sparsification (LGS) and Bit-width Adaptive Selection (BAS), while employing the Sparsity-Aware Storage Strategy (SASS) to select the optimal storage encoding structure. Finally, by incorporating a K-Nearest Neighbor (KNN) inference scheme with a learnable low-rank metric, we present Auto-FlexSwitch, a dynamic model merging approach that supports highly efficient task vector compression.
Abstract（参考訳）: モデルマージは、複数のタスク固有のモデルから知識を統合することで、マルチタスク適応への効果的な経路として注目されている。従来のアプローチでは、動的マージはタスク間のパラメータ更新の矛盾に起因するパフォーマンス劣化を軽減し、推論時にタスク固有のパラメータを柔軟に組み合わせ、高いパフォーマンスを維持する。しかし、これらの方法は各タスクに独立したパラメータを格納する必要があるため、結果としてストレージのオーバーヘッドは禁じられる。この問題に対処するために、まず、微調整された重み付け(タスクベクトルと呼ばれる)がインパルス的な活性化パターンを示し、低ビット表現に対して高い堅牢性を示すことを実験的に実証した。この知見に基づいて,タスクベクトルを2次元スパースマスク,符号ベクトル,スカラースケーリング係数の3つのコンパクト成分に分解し,高い圧縮比で高忠実度近似を実現するT-Switchを提案する。次に、特徴類似性検索によってタスクベクトルを自動的に構成する、トレーニング不要なマージスキームであるAuto-Switchを紹介する。そこで我々は,特徴類似性検索によってタスクベクトルを自動生成する,トレーニング不要なマージスキームであるAuto-Switchを開発した。さらに,静的な規則から適応学習へのタスクベクトルスペーシフィケーションと量子化を変換するために,学習可能ゲーティングスペーシフィケーション(LGS)とビット幅適応選択(BAS)を通じて各モデルユニットの圧縮戦略を協調的に最適化する学習可能なフレームワークであるFlexSwitchを提案する。最後に、K-Nearest Neighbor(KNN)推論スキームを学習可能な低ランクメトリックに組み込むことで、高効率なタスクベクトル圧縮をサポートする動的モデルマージアプローチであるAuto-FlexSwitchを提案する。

論文の概要: Auto-FlexSwitch: Efficient Dynamic Model Merging via Learnable Task Vector Compression

関連論文リスト