Fugu-MT 論文翻訳(概要): Echo-LoRA: Parameter-Efficient Fine-Tuning via Cross-Layer Representation Injection

論文の概要: Echo-LoRA: Parameter-Efficient Fine-Tuning via Cross-Layer Representation Injection

arxiv url: http://arxiv.org/abs/2605.08177v1
Date: Tue, 05 May 2026 08:26:11 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-12 23:28:49.429269
Title: Echo-LoRA: Parameter-Efficient Fine-Tuning via Cross-Layer Representation Injection
Title（参考訳）: Echo-LoRA:クロスレイヤ表現注入によるパラメータ効率の良いファインチューニング
Authors: Yihang Peng, Peng Jin, Jie Gong, Xingyuan Chen, Lingjiao Xu, Ning Su, Yan Ran,
Abstract要約: パラメータ効率の良い微調整のための層間表現注入法であるEcho-LoRAを提案する。トレーニング中、Echo-LoRAは、深いソース層から境界隠れ状態を収集し、それらをサンプルレベルのエコー表現に集約し、軽量なプロジェクションネットワークを使用して、結果の信号を浅いLoRAまたはDoRAモジュールに注入する。 8つのコモンセンス推論ベンチマークにおいて、Echo-LoRAはLLaMA-7B、LLaMA2-7B、LLaMA3-8Bで報告されたLoRAベースラインを超える。
参考スコア（独自算出の注目度）: 13.686308852232857
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Parameter-efficient fine-tuning (PEFT) has become a practical route for adapting large language models to downstream tasks, with LoRA-style methods being particularly attractive because they are inexpensive to train and easy to deploy. Most LoRA variants, however, revise the update rule within the weight space of each layer and leave the intermediate representations formed by deeper layers largely unused. We propose Echo-LoRA, a cross-layer representation injection method for parameter-efficient fine-tuning. During training, Echo-LoRA collects boundary hidden states from deeper source layers, aggregates them into a sample-level echo representation, and uses lightweight projection and gating networks to inject the resulting signal into shallow LoRA or DoRA modules. Answer-only masking, masked distillation, and stochastic routing are used to keep this auxiliary path stable and to reduce the gap between training and inference. On eight commonsense reasoning benchmarks, Echo-LoRA exceeds the reported LoRA baselines by 5.7 percentage points on average across LLaMA-7B, LLaMA2-7B, and LLaMA3-8B. Under reproduced LoRA baselines in our unified implementation, the average gain is 3.0 points; when combined with DoRA, the gain is 2.7 points. The Echo path is discarded after training, so the deployed model keeps the original low-rank LoRA/DoRA form and adds neither inference-time parameters nor inference computation.
Abstract（参考訳）: パラメータ効率のよい微調整(PEFT)は、大規模言語モデルを下流タスクに適応させる実用的な方法となっている。しかし、ほとんどの LoRA 変種は、各層の重み空間内で更新規則を改訂し、より深い層によって形成された中間表現をほとんど使用しないままにしている。パラメータ効率の良い微調整のための層間表現注入法であるEcho-LoRAを提案する。トレーニング中、Echo-LoRAは、深いソース層から境界隠れ状態を収集し、それらをサンプルレベルのエコー表現に集約し、軽量なプロジェクションとゲーティングネットワークを使用して、結果の信号を浅いLoRAまたはDoRAモジュールに注入する。補助経路を安定に保ち、トレーニングと推論のギャップを減らすために、解答のみのマスク、マスク付き蒸留、確率的ルーティングが使用される。 8つのコモンセンス推論ベンチマークにおいて、Echo-LoRAはLLaMA-7B、LLaMA2-7B、LLaMA3-8Bの平均で報告されたLoRAベースラインを5.7%上回る。我々の統一実装における再現されたLoRAベースラインの下では、平均利得は3.0ポイントであり、DoRAと組み合わせると、利得は2.7ポイントである。 Echoパスはトレーニング後に破棄されるため、デプロイされたモデルはオリジナルのローランクのLoRA/DoRAフォームを保持し、推論時間パラメータも推論計算も追加しない。

論文の概要: Echo-LoRA: Parameter-Efficient Fine-Tuning via Cross-Layer Representation Injection

関連論文リスト