Fugu-MT 論文翻訳(概要): MatryoshkaLoRA: Learning Accurate Hierarchical Low-Rank Representations for LLM Fine-Tuning

論文の概要: MatryoshkaLoRA: Learning Accurate Hierarchical Low-Rank Representations for LLM Fine-Tuning

arxiv url: http://arxiv.org/abs/2605.07850v1
Date: Fri, 08 May 2026 15:13:01 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-11 19:43:39.148271
Title: MatryoshkaLoRA: Learning Accurate Hierarchical Low-Rank Representations for LLM Fine-Tuning
Title（参考訳）: MatryoshkaLora: LLMファインチューニングのための正確な階層的低ランク表現の学習
Authors: Ionut-Vlad Modoranu, Mher Safaryan, Dan Alistarh,
Abstract要約: MatryoshkaLoRAは、固定された慎重に作られた対角行列を既存のLoRAアダプタ間で$P$で挿入することで、正確な階層的な低ランク表現を学習する。以上の結果から,MatryoshkaLoraは従来のランク適応手法よりも,より正確な階層的な低ランク表現を学習できることが示唆された。
参考スコア（独自算出の注目度）: 45.41768510181293
License: http://creativecommons.org/licenses/by/4.0/
Abstract: With the rise in scale for deep learning models to billions of parameters, the computational cost of fine-tuning remains a significant barrier to deployment. While Low-Rank Adaptation (LoRA) has become the standard for parameter-efficient fine-tuning, the need to set a predefined, static rank $r$ requires exhaustive grid searches to balance efficiency and performance. Existing rank-adaptive solutions such as DyLoRA mitigate this by sampling ranks during the training from a predefined distribution. However, they often yield sub-optimal results at higher ranks due to lack of consistent gradient signals across the full hierarchy of ranks, thus making these methods data-inefficient. In this paper, we propose MatryoshkaLoRA, a general, Matryoshka-inspired training framework for LoRA that learns accurate hierarchical low-rank representations by inserting a fixed, carefully crafted diagonal matrix $P$ between the existing LoRA adapters to scale their sub-ranks accordingly. By introducing this simple modification, our general framework recovers LoRA and DyLoRA only by changing $P$ and ensures all sub-ranks embed the available gradient information efficiently. Our MatryoshkaLoRA supports dynamic rank selection with minimal degradation in accuracy. We further propose Area Under the Rank Accuracy Curve (AURAC), a metric that consistently evaluates the performance of hierarchical low-rank adapters. Our results demonstrate that MatryoshkaLoRA learns more accurate hierarchical low-rank representations than prior rank-adaptive approaches and achieves superior accuracy-performance trade-offs across ranks on the evaluated datasets. Our code is available at https://github.com/IST-DASLab/MatryoshkaLoRA.
Abstract（参考訳）: ディープラーニングモデルの数十億のパラメータへのスケールの増大により、微調整の計算コストは、デプロイメントにとって大きな障壁となっている。 Low-Rank Adaptation (LoRA) はパラメータ効率の高い微調整の標準となっているが、事前定義された静的ランクを$r$に設定する必要があるため、効率と性能のバランスをとるために網羅的なグリッド探索が必要である。 DyLoRAのような既存のランク適応解は、事前定義された分布からトレーニング中にランクをサンプリングすることによってこれを緩和する。しかし、ランクの完全な階層にわたって一貫した勾配信号が欠如しているため、より高いランクで準最適結果を得ることが多いため、これらの手法はデータ非効率である。本稿では, 既存のLoRAアダプタ間で, 厳密な対角行列を挿入して, 精度の高い階層的低ランク表現を学習する, 汎用のMateryoshka-inspireed training frameworkであるMatryoshkaLoRAを提案する。この簡単な修正を導入することで、我々の一般的なフレームワークは、$P$を変更するだけでLoRAとDyLoRAを回復し、利用可能な勾配情報を効率的に埋め込めるようにします。私たちのMatryoshkaLoRAは、精度の低下を最小限に抑えた動的ランク選択をサポートしています。さらに、階層的な低ランクアダプタの性能を継続的に評価する指標であるAURAC(Area Under the Rank Accuracy Curve)を提案する。以上の結果から,MetryoshkaLoRAは従来のランク適応手法よりも高精度な階層的低ランク表現を学習し,評価されたデータセット上のランク間での精度・性能のトレードオフを達成できることが示唆された。私たちのコードはhttps://github.com/IST-DASLab/MatryoshkaLoRAで公開されています。

論文の概要: MatryoshkaLoRA: Learning Accurate Hierarchical Low-Rank Representations for LLM Fine-Tuning

関連論文リスト