Fugu-MT 論文翻訳(概要): Bilinear Coordinate Alignment for Training-Free Task-Vector Transfer

論文の概要: Bilinear Coordinate Alignment for Training-Free Task-Vector Transfer

arxiv url: http://arxiv.org/abs/2605.28444v1
Date: Wed, 27 May 2026 13:10:34 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-28 17:38:56.067885
Title: Bilinear Coordinate Alignment for Training-Free Task-Vector Transfer
Title（参考訳）: 訓練自由なタスクベクトル移動のための線形座標アライメント
Authors: Jungyong Son, Jinwook Jung, Minhee Park, Sungyong Baik,
Abstract要約: 事前訓練されたモデルの新バージョンが利用可能になると、微調整によって得られた専門知識を直接再利用することはできない。本稿では、Bilinear Coordinateアライメントを介してタスクベクトルを転送するためのトレーニング不要なフレームワークであるBiCoを提案する。 BiCoは、幅、深さ、トレーニング前の設定が異なるモデル間で、既存の転送メソッドを一貫して上回る。
参考スコア（独自算出の注目度）: 13.823003260600663
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Fine-tuning large-scale pre-trained models is a recent prevalent paradigm for adapting general representations to specialized tasks. However, when a new version of a pre-trained model becomes available, expertise acquired through fine-tuning cannot be directly reused because it is tied to the parameterization of the original model, requiring another costly fine-tuning. To address this inefficiency, recent work uses task vectors, defined as the parameter difference between a fine-tuned model and its base model, to transfer expertise across models. While existing methods bridge disparate models by matching activations or gradients, a significant performance gap remains relative to direct fine-tuning, suggesting that these partial correspondences are insufficient. In this work, instead of viewing a task vector merely as a parameter offset, we revisit the formation of task vectors and show that they can be derived as accumulated bilinear interactions between input-side activations and output-side gradients. Motivated by this observation, we formulate task-vector transfer as a dual-space alignment problem and propose BiCo, a training-free framework for transferring task vectors through Bilinear Coordinate alignment. BiCo estimates orthogonal Procrustes mappings in both spaces using a single forward-backward pass on a small calibration set, without any parameter update. Across extensive computer vision and natural language processing benchmarks, BiCo consistently outperforms existing transfer methods across models that differ in width, depth, and pre-training configuration.
Abstract（参考訳）: 微調整された大規模事前訓練モデル(英語版)は、特殊タスクに一般表現を適用するための近年のパラダイムである。しかし、事前訓練されたモデルの新バージョンが利用可能になると、原モデルのパラメータ化に結びついており、さらにコストのかかる微調整を必要とするため、微調整によって得られた専門知識を直接再利用することはできない。この非効率性に対処するため、最近の研究では、細調整されたモデルとそのベースモデルの間のパラメータ差として定義されたタスクベクトルを使用して、モデル間で専門知識を伝達している。既存手法ではアクティベーションや勾配の整合によって異なるモデルをブリッジするが、直接微調整と比較して大きな性能差は残っており、これらの部分対応は不十分である。本研究は,タスクベクトルを単にパラメータオフセットとして見るのではなく,タスクベクトルの形成を再考し,入力側アクティベーションと出力側勾配の間の蓄積された双線形相互作用として導出可能であることを示す。この観測により,2次元空間アライメント問題としてタスクベクトル移動を定式化し,ビリニアコーディネートアライメントを介してタスクベクトルを転送するトレーニング自由フレームワークであるBiCoを提案する。 BiCoは、パラメータを更新することなく、小さなキャリブレーションセット上の単一の前方パスを使用して、両方の空間における直交プロクリストマッピングを推定する。広範囲にわたるコンピュータビジョンと自然言語処理ベンチマークを通じて、BiCoは、幅、深さ、トレーニング前の設定が異なるモデル間で、既存の転送メソッドを一貫して上回っている。

論文の概要: Bilinear Coordinate Alignment for Training-Free Task-Vector Transfer

関連論文リスト