Fugu-MT 論文翻訳(概要): Understanding and Enforcing Weight Disentanglement in Task Arithmetic

論文の概要: Understanding and Enforcing Weight Disentanglement in Task Arithmetic

arxiv url: http://arxiv.org/abs/2604.17078v1
Date: Sat, 18 Apr 2026 17:34:56 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-21 21:52:52.322213
Title: Understanding and Enforcing Weight Disentanglement in Task Arithmetic
Title（参考訳）: タスク算術における重みの絡み合いの理解と強制
Authors: Shangge Liu, Yuehan Yin, Lei Wang, Qi Fan, Yinghuan Shi, Wenbin Li, Yang Gao, Dacheng Tao,
Abstract要約: 私たちは、異なるタスクに異なる内部機能を割り当てるモデルの機能であるTask-Feature(TFS)を紹介します。そこで我々はOrthoRegを提案する。OrthoRegはシンプルで効果的な正規化手法で、内部構造を重みに積極的に適用する。
参考スコア（独自算出の注目度）: 72.17785699918092
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Task arithmetic provides an efficient, training-free way to edit pre-trained models, yet lacks a fundamental theoretical explanation for its success. The existing concept of ``weight disentanglement" describes the ideal outcome of non-interfering task composition but does not reveal its underlying cause. Crucially, what intrinsic properties of the pre-trained model ($θ_0$) or the task vectors ($τ_t$) enable this disentanglement remains underexplored. In this paper, we introduce Task-Feature Specialization (TFS), a model's ability to allocate distinct internal features to different tasks, as the fundamental principle. We first prove that TFS is a sufficient condition for weight disentanglement. More importantly, we find that TFS also gives rise to an observable geometric consequence: weight vector orthogonality. This positions TFS as the common cause for both the desired functional outcome (disentanglement) and a measurable geometric property (orthogonality). This relationship provides the key insight for our method: since the abstract TFS property is intractable to enforce directly, we can instead promote weight disentanglement by shaping its concrete geometric consequence, orthogonality. Therefore, we propose OrthoReg, a simple and effective regularization method that actively enforces an internal orthogonal structure on weight updates ($ΔW$) that constitute $τ_t$ during fine-tuning. And we theoretically prove that OrthoReg promotes disentanglement. Extensive experiments demonstrate that OrthoReg consistently and significantly enhances the performance of various task arithmetic methods. Code is available at \href{https://github.com/RL-MIND/OrthoReg}{https://github.com/RL-MIND/OrthoReg}.
Abstract（参考訳）: タスク算術は、事前訓練されたモデルを編集するための効率的で訓練のない方法を提供するが、その成功に対する基本的な理論的説明は欠如している。既存の「重みの絡み合い」の概念は、非干渉的タスク構成の理想的な結果を示すものであるが、その根本原因は明らかになっていない。重要なことに、事前訓練されたモデル(θ_0$)やタスクベクトル(τ_t$)の固有の性質は、この非絡み合いを過小評価することを可能にしている。本稿では,タスク・フィーチャー・スペシャライゼーション(TFS, Task-Feature Specialization)を基本原理として,異なるタスクに異なる内部機能を割り当てるモデルの能力を紹介する。まず、TFSが重みの絡み合うのに十分な条件であることを証明します。さらに重要なのは、TFSが観測可能な幾何学的結果、すなわち重みベクトル直交をもたらすことだ。これは TFS を所望の関数結果(アンタングルメント)と測定可能な幾何学的性質(直交性)の両方の共通原因として位置付ける。抽象TFSプロパティは直接的に強制することができるため、具体的幾何学的結果、直交性を形作ることで重みの絡み合いを促進することができる。そこで我々はOrthoRegを提案する。OrthoRegは、微調整中に$τ_t$を構成する重量更新(ΔW$)において、内部直交構造を積極的に強制するシンプルで効果的な正規化手法である。そして、理論上OrthoRegが絡み合いを促進することを証明します。大規模な実験により、OrthoRegは、様々なタスク演算手法の性能を一貫して、そして大幅に向上させることが示された。コードは \href{https://github.com/RL-MIND/OrthoReg}{https://github.com/RL-MIND/OrthoReg} で公開されている。

論文の概要: Understanding and Enforcing Weight Disentanglement in Task Arithmetic

関連論文リスト