Fugu-MT 論文翻訳(概要): Low-Rank Adaptation Redux for Large Models

論文の概要: Low-Rank Adaptation Redux for Large Models

arxiv url: http://arxiv.org/abs/2604.21905v1
Date: Thu, 23 Apr 2026 17:50:23 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-24 14:40:06.806386
Title: Low-Rank Adaptation Redux for Large Models
Title（参考訳）: 大規模モデルに対する低ランク適応型Redux
Authors: Bingcong Li, Yilang Zhang, Georgios B. Giannakis,
Abstract要約: 低ランク適応(LoRA)は、基礎モデルのパラメータ効率微調整(PEFT)のデファクトスタンダードとして登場した。実証的な成功と変種の普及にもかかわらず、どのアーキテクチャの選択や最適化手法、デプロイの制約が実用的な方法の選択を導くべきかは、いまだ解明されていない。この概要は、信号処理(SP)のレンズを通してLoRAを再検討し、古典的な低ランクモデリングツールと逆問題で近代的なアダプタ設計をブリッジする。
参考スコア（独自算出の注目度）: 41.28925127311434
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Low-rank adaptation (LoRA) has emerged as the de facto standard for parameter-efficient fine-tuning (PEFT) of foundation models, enabling the adaptation of billion-parameter networks with minimal computational and memory overhead. Despite its empirical success and rapid proliferation of variants, it remains elusive which architectural choices, optimization techniques, and deployment constraints should guide practical method selection. This overview revisits LoRA through the lens of signal processing (SP), bridging modern adapter designs with classical low-rank modeling tools and inverse problems, as well as highlighting how SP principles can inform principled advances of fine-tuning approaches. Rather than providing a comprehensive enumeration and empirical comparisons of LoRA variants, emphasis is placed on the technical mechanisms underpinning these approaches to justify their effectiveness. These advances are categorized into three complementary axes: architectural design, efficient optimization, and pertinent applications. The first axis builds on singular value decomposition (SVD)-based factorization, rank-augmentation constructions, and cross-layer tensorization, while the second axis deals with initialization, alternating solvers, gauge-invariant optimization, and parameterization-aware methods. Beyond fine-tuning, emerging applications of LoRA are accounted across the entire lifecycle of large models, ranging from pre- and post-training to serving/deployment. Finally, open research directions are outlined at the confluence of SP and deep learning to catalyze a bidirectional frontier: classical SP tools provide a principled vocabulary for designing principled PEFT methods, while the unique challenges facing modern deep learning, especially the overwhelming scale and prohibitive overhead, also offer new research lines benefiting the SP community in return.
Abstract（参考訳）: 低ランク適応(LoRA)は、基礎モデルのパラメータ効率のよい微調整(PEFT)のデファクトスタンダードとして登場し、計算とメモリのオーバーヘッドを最小限に抑えた数十億パラメータネットワークの適応を可能にした。実証的な成功と変種の普及にもかかわらず、どのアーキテクチャの選択や最適化手法、デプロイの制約が実用的な方法の選択を導くべきかは、いまだ解明されていない。この概要は、信号処理(SP)のレンズを通してLoRAを再検討し、古典的な低ランクモデリングツールと逆問題で近代的なアダプタ設計をブリッジし、SP原則が微調整アプローチの原則的な進歩にどのように役立つかを強調した。 LoRA変異の包括的列挙と経験的比較を提供するのではなく、これらのアプローチを正当化する技術的なメカニズムに重点を置いている。これらの進歩は、アーキテクチャ設計、効率的な最適化、関連するアプリケーションという3つの相補的な軸に分類される。第1軸は特異値分解(SVD)に基づく分解、階数拡大構造、および層間テンソル化に基づいて構築され、第2軸は初期化、交互化ソルバ、ゲージ不変最適化、パラメータ化対応法を扱う。微調整以外にも、LoRAの新しいアプリケーションは、事前トレーニングやポストトレーニングからサービス/デプロイまで、大規模なモデルのライフサイクル全体にわたって説明されている。古典的なSPツールは、原則化されたPEFTメソッドを設計するための原則的な語彙を提供する一方、現代のディープラーニング、特に圧倒的な規模と禁止的なオーバーヘッドに直面する固有の課題は、SPコミュニティの見返りとしてSPコミュニティに恩恵を与える新しい研究ラインを提供する。

論文の概要: Low-Rank Adaptation Redux for Large Models

関連論文リスト