Fugu-MT 論文翻訳(概要): LoRA-DA: Data-Aware Initialization for Low-Rank Adaptation via Asymptotic Analysis

論文の概要: LoRA-DA: Data-Aware Initialization for Low-Rank Adaptation via Asymptotic Analysis

arxiv url: http://arxiv.org/abs/2510.24561v1
Date: Tue, 28 Oct 2025 15:55:36 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-29 15:35:37.262231
Title: LoRA-DA: Data-Aware Initialization for Low-Rank Adaptation via Asymptotic Analysis
Title（参考訳）: LoRA-DA:漸近解析による低ランク適応のためのデータ認識初期化
Authors: Qingyue Zhang, Chang Chu, Tianren Peng, Qi Li, Xiangyang Luo, Zhihao Jiang, Shao-Lun Huang,
Abstract要約: データ認識型LoRAの理論的枠組みを確立した。そこで我々は,目標領域の少数のサンプルから最適化問題の項を推定するアルゴリズムLoRA-DAを開発した。さらなる研究により、より速く、より安定な収束、ランク間の堅牢性、LoRA-DAのオーバーヘッドはわずかである。
参考スコア（独自算出の注目度）: 33.708800231646606
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: With the widespread adoption of LLMs, LoRA has become a dominant method for PEFT, and its initialization methods have attracted increasing attention. However, existing methods have notable limitations: many methods do not incorporate target-domain data, while gradient-based methods exploit data only at a shallow level by relying on one-step gradient decomposition, which remains unsatisfactory due to the weak empirical performance of the one-step fine-tuning model that serves as their basis, as well as the fact that these methods either lack a rigorous theoretical foundation or depend heavily on restrictive isotropic assumptions. In this paper, we establish a theoretical framework for data-aware LoRA initialization based on asymptotic analysis. Starting from a general optimization objective that minimizes the expectation of the parameter discrepancy between the fine-tuned and target models, we derive an optimization problem with two components: a bias term, which is related to the parameter distance between the fine-tuned and target models, and is approximated using a Fisher-gradient formulation to preserve anisotropy; and a variance term, which accounts for the uncertainty introduced by sampling stochasticity through the Fisher information. By solving this problem, we obtain an optimal initialization strategy for LoRA. Building on this theoretical framework, we develop an efficient algorithm, LoRA-DA, which estimates the terms in the optimization problem from a small set of target domain samples and obtains the optimal LoRA initialization. Empirical results across multiple benchmarks demonstrate that LoRA-DA consistently improves final accuracy over existing initialization methods. Additional studies show faster, more stable convergence, robustness across ranks, and only a small initialization overhead for LoRA-DA. The source code will be released upon publication.
Abstract（参考訳）: LLMの普及に伴い、LoRAはPEFTの主要な手法となり、その初期化手法が注目されている。しかし、既存の手法では、多くの手法がターゲットドメインデータを組み込んでいないが、勾配に基づく手法は1ステップの勾配分解に頼り、基礎となる1ステップの微調整モデルの弱い経験的性能や、厳密な理論的基礎が欠如している、あるいは制限的な等方的仮定に大きく依存しているという事実により、不満足なままである。本稿では,漸近解析に基づくデータ認識型LoRAの初期化のための理論的枠組みを確立する。微調整モデルと対象モデル間のパラメータ差の期待を最小化する汎用最適化目標から、微調整モデルと対象モデル間のパラメータ距離に関連するバイアス項と、異方性を保存するためにフィッシャー勾配の定式化を用いて近似された偏差項と、フィッシャー情報を通して確率性をサンプリングすることによって生じる不確実性を説明する分散項の2つの成分を用いて最適化問題を導出する。この問題を解決することで、LoRAの最適初期化戦略を得る。この理論的枠組みに基づいて、最適化問題の項を少数の対象領域サンプルから推定し、最適なLoRA初期化を求める効率的なアルゴリズムであるLoRA-DAを開発する。複数のベンチマークにまたがる実験結果から、LoRA-DAは既存の初期化手法よりも常に最終的な精度を向上することが示された。さらなる研究により、より速く、より安定な収束、ランク間の堅牢性、そしてLoRA-DAの小さな初期化オーバーヘッドのみが示される。ソースコードは公開時に公開される。

論文の概要: LoRA-DA: Data-Aware Initialization for Low-Rank Adaptation via Asymptotic Analysis

関連論文リスト