Fugu-MT 論文翻訳(概要): FuRA: Full-Rank Parameter-Efficient Fine-Tuning with Spectral Preconditioning

論文の概要: FuRA: Full-Rank Parameter-Efficient Fine-Tuning with Spectral Preconditioning

arxiv url: http://arxiv.org/abs/2605.22869v1
Date: Tue, 19 May 2026 22:11:25 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-25 17:29:20.00864
Title: FuRA: Full-Rank Parameter-Efficient Fine-Tuning with Spectral Preconditioning
Title（参考訳）: FuRA:スペクトルプレコンディショニングによるフルランクパラメータ効率のファインチューニング
Authors: Yequan Zhao, Ruijie Zhang, Liyan Tan, Niall Moran, Tong Qin, Zheng Zhang,
Abstract要約: フルファインチューニング (Full FT) とパラメータ効率の良いファインチューニングはどちらも、事前トレーニング中に確立されたスペクトル構造を考慮せずに重み更新を導入する。本稿では,ブロックテンソル-トレイン因数分解 W = LSR に基づく効率的なフルランク適応フレームワーク FuRA を提案する。この設計は同時にフルランクのスペクトルプリコンディショニングを提供し、フルランクの更新表現を保ち、パラメータ、メモリ、ステップタイム効率をLoRAに匹敵するものにしている。
参考スコア（独自算出の注目度）: 7.497481049158277
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Both full fine-tuning (Full FT) and parameter-efficient fine-tuning methods such as LoRA introduce weight updates without accounting for the spectral structure established during pretraining. As a result, noisy gradients from limited fine-tuning data can perturb robust pretrained features. We identify spectral preconditioning as the missing ingredient: reparameterizing each weight matrix through its full-rank singular value decomposition (SVD) and freezing one singular basis constrains updates to the pretrained column space, yielding a preconditioned optimization scheme that outperforms unconstrained Full FT at the same trainable parameter count. Building on this insight, we propose FuRA (Full-Rank Adaptation), an efficient full-rank adaptation framework based on a block tensor-train factorization W = LSR, where the large core L is fixed to the pretrained block-wise SVD basis, while only the compact core R and the block-wise singular values S are optimized. This design simultaneously provides full-rank spectral preconditioning, preserves full-rank update expressivity, and achieves parameter, memory, and step-time efficiency comparable to LoRA. FuRA consistently outperforms Full FT across multiple settings, including LLM fine-tuning (+1.37 on LLaMA-3-8B commonsense reasoning), LLM reinforcement learning for mathematical reasoning, and visual instruction tuning for VLMs. Furthermore, the 4-bit quantized variant, QFuRA, also surpasses QLoRA. Code is available at https://github.com/olokevin/FuRA-NIPS
Abstract（参考訳）: 完全な微調整(Full FT)とLoRAのようなパラメータ効率の高い微調整手法の両方が、事前訓練中に確立されたスペクトル構造を考慮せずに重み更新を導入する。その結果、限られた微調整データからの雑音勾配は、頑健な事前訓練された特徴を摂動させることができる。各重み行列をその全階特異値分解(SVD)によりパラメータ化し、一個の特異基底制約を凍結することにより、事前制約された列空間への更新を凍結し、未制約のフルFTを同時にトレーニング可能なパラメータ数で上回る事前条件付き最適化スキームを生成する。この知見に基づいて,ブロックテンソル-トレイン因数分解W = LSRに基づく効率的なフルランク適応フレームワークFuRA(Full-Rank Adaptation)を提案する。この設計は同時にフルランクのスペクトルプリコンディショニングを提供し、フルランクの更新表現を保ち、パラメータ、メモリ、ステップタイム効率をLoRAに匹敵するものにしている。 FuRAは、LLMファインチューニング(LLaMA-3-8Bコモンセンス推論の+1.37)、数学的推論のためのLLM強化学習、VLMのビジュアルインストラクションチューニングなど、複数の設定でフルFTを一貫して上回っている。さらに、4ビットの量子化変種であるQFuRAもQLoRAを上回っている。コードはhttps://github.com/olokevin/FuRA-NIPSで入手できる。

論文の概要: FuRA: Full-Rank Parameter-Efficient Fine-Tuning with Spectral Preconditioning

関連論文リスト