Fugu-MT 論文翻訳(概要): Astra: Activation-Space Tail-Eigenvector Low-Rank Adaptation of Large Language Models

論文の概要: Astra: Activation-Space Tail-Eigenvector Low-Rank Adaptation of Large Language Models

arxiv url: http://arxiv.org/abs/2602.19111v1
Date: Sun, 22 Feb 2026 09:54:40 GMT
ステータス: 翻訳完了
システム内更新日: 2026-02-24 17:42:02.484587
Title: Astra: Activation-Space Tail-Eigenvector Low-Rank Adaptation of Large Language Models
Title（参考訳）: Astra: 大規模言語モデルのアクティベーション空間Tail-Eigenvector Low-Rank適応
Authors: Kainan Liu, Yong Zhang, Ning Cheng, Yun Zhu, Yanmeng Wang, Shaojun Wang, Jing Xiao,
Abstract要約: 本稿では,事前学習したモデルを下流タスクに適用するための新しいPEFT手法であるAstraを提案する。 Astraは16のベンチマークで既存のPEFTベースラインを一貫して上回り、特定のシナリオではFFT(Full Fine-tuning)を超えている。
参考スコア（独自算出の注目度）: 37.89933976872072
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Parameter-Efficient Fine-Tuning (PEFT) methods, especially LoRA, are widely used for adapting pre-trained models to downstream tasks due to their computational and storage efficiency. However, in the context of LoRA and its variants, the potential of activation subspaces corresponding to tail eigenvectors remains substantially under-exploited, which may lead to suboptimal fine-tuning performance. In this work, we propose Astra (Activation-Space Tail-Eigenvector Low-Rank Adaptation), a novel PEFT method that leverages the tail eigenvectors of the model output activations-estimated from a small task-specific calibration set-to construct task-adaptive low-rank adapters. By constraining updates to the subspace spanned by these tail eigenvectors, Astra achieves faster convergence and improved downstream performance with a significantly reduced parameter budget. Extensive experiments across natural language understanding (NLU) and natural language generation (NLG) tasks demonstrate that Astra consistently outperforms existing PEFT baselines across 16 benchmarks and even surpasses full fine-tuning (FFT) in certain scenarios.
Abstract（参考訳）: パラメータ効率のよいファインチューニング(PEFT)法、特にLoRA法は、計算と記憶効率のため、トレーニング済みのモデルを下流のタスクに適応するために広く用いられている。しかし、LoRAとその変種(英語版)の文脈では、尾固有ベクトルに対応する活性化部分空間のポテンシャルは、ほとんど明らかにされていないままであり、これは準最適微調整性能をもたらす可能性がある。本研究では,タスク固有キャリブレーションセットから推定したモデル出力アクティベーションのテール固有ベクトルを利用して,タスク適応型低ランクアダプタを構築する,新しいPEFT手法であるAstraを提案する。これらの尾の固有ベクトルにまたがる部分空間の更新を制限することにより、Astraはより高速な収束を実現し、パラメータ予算を大幅に削減して下流のパフォーマンスを向上する。自然言語理解(NLU)と自然言語生成(NLG)タスクにわたる広範な実験により、Astraは16のベンチマークで既存のPEFTベースラインを一貫して上回り、特定のシナリオにおいて完全な微調整(FFT)を超えている。

論文の概要: Astra: Activation-Space Tail-Eigenvector Low-Rank Adaptation of Large Language Models

関連論文リスト