Fugu-MT 論文翻訳(概要): W2T: LoRA Weights Already Know What They Can Do

論文の概要: W2T: LoRA Weights Already Know What They Can Do

arxiv url: http://arxiv.org/abs/2603.15990v1
Date: Mon, 16 Mar 2026 23:01:00 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-18 17:42:07.032056
Title: W2T: LoRA Weights Already Know What They Can Do
Title（参考訳）: W2T:LoRAは自分ができることをすでに知っている
Authors: Xiaolong Han, Ferrante Neri, Zijian Jiang, Fang Wu, Yanfang Ye, Lu Yin, Zehong Wang,
Abstract要約: QR分解により各LoRA更新を正準形式にマッピングする手法を提案する。生成されたコンポーネントはトークン化され、Transformerによって処理され、重み空間の埋め込みを生成する。言語と視覚のLoRAコレクション全体にわたって、W2Tは分類、性能予測、アダプタ検索において強力な結果を得る。
参考スコア（独自算出の注目度）: 27.25969992553623
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Each LoRA checkpoint compactly stores task-specific updates in low-rank weight matrices, offering an efficient way to adapt large language models to new tasks and domains. In principle, these weights already encode what the adapter does and how well it performs. In this paper, we ask whether this information can be read directly from the weights, without running the base model or accessing training data. A key obstacle is that a single LoRA update can be factorized in infinitely many ways. Without resolving this ambiguity, models trained on the factors may fit the particular factorization rather than the underlying update. To this end, we propose \methodfull, which maps each LoRA update to a provably canonical form via QR decomposition followed by SVD, so that all equivalent factorizations share the same representation. The resulting components are then tokenized and processed by a Transformer to produce a weight-space embedding. Across language and vision LoRA collections, W2T achieves strong results on attribute classification, performance prediction, and adapter retrieval, demonstrating that LoRA weights reliably indicate model behavior once factorization ambiguity is removed. Code is available at https://github.com/xiaolonghan2000/Weight2Token.
Abstract（参考訳）: 各LoRAチェックポイントは、タスク固有の更新を低ランクの重み付け行列にコンパクトに格納し、大きな言語モデルを新しいタスクやドメインに適応する効率的な方法を提供する。原則として、これらの重みはすでにアダプタの動作と性能をコード化しています。本稿では,ベースモデルを実行したり,トレーニングデータにアクセスしたりすることなく,重みからこの情報を直接読み取ることができるかどうかを問う。重要な障害は、単一のLoRA更新を無限に多くの方法で分解できることだ。この曖昧さを解決せずに、これらの要因に基づいてトレーニングされたモデルは、基盤となる更新よりも特定の要因化に適合する可能性がある。この目的のために,各 LoRA 更新を QR 分解に続いて SVD による証明可能な正準形式にマッピングする \methodfull を提案する。生成されたコンポーネントはトークン化され、Transformerによって処理され、重み空間の埋め込みを生成する。言語と視覚のLoRAコレクション全体にわたって、W2Tは属性分類、性能予測、アダプタ検索において強力な結果を達成し、因数分解の曖昧さが除去されると、LoRAの重みがモデルの振る舞いを確実に示すことを示した。コードはhttps://github.com/xiaolonghan2000/Weight2Token.comから入手できる。

論文の概要: W2T: LoRA Weights Already Know What They Can Do

関連論文リスト