Fugu-MT 論文翻訳(概要): LCSB: Layer-Cyclic Selective Backpropagation for Memory-Efficient On-Device LLM Fine-Tuning

論文の概要: LCSB: Layer-Cyclic Selective Backpropagation for Memory-Efficient On-Device LLM Fine-Tuning

arxiv url: http://arxiv.org/abs/2602.13073v1
Date: Fri, 13 Feb 2026 16:32:53 GMT
ステータス: 翻訳完了
システム内更新日: 2026-02-16 23:37:54.035579
Title: LCSB: Layer-Cyclic Selective Backpropagation for Memory-Efficient On-Device LLM Fine-Tuning
Title（参考訳）: LCSB: メモリ効率の良いオンデバイスLCM微細調整のための層状回路選択バックプロパゲーション
Authors: Juneyoung Park, Eunbeen Yoon, Seongwan Kim. Jaeho Lee,
Abstract要約: メモリ効率のよいバックプロパゲーション(MeBP)により、1GB未満のメモリを持つモバイルデバイス上での大規模言語モデル(LLM)の1次微調整が可能になった。本稿では,ステップごとのレイヤのサブセットのみの勾配を計算するLCSB(Layer-Cyclic Selective Backpropagation)を提案する。
参考スコア（独自算出の注目度）: 3.179758551591901
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Memory-efficient backpropagation (MeBP) has enabled first-order fine-tuning of large language models (LLMs) on mobile devices with less than 1GB memory. However, MeBP requires backward computation through all transformer layers at every step, where weight decompression alone accounts for 32--42% of backward time. We propose Layer-Cyclic Selective Backpropagation (LCSB), which computes gradients for only a subset of layers per step. Our key insight is that residual connections guarantee gradient flow through identity paths, while AdamW momentum provides implicit updates for non-selected layers. We interpret LCSB as Block Coordinate Descent on the LoRA parameter space, providing theoretical justification for convergence. LCSB achieves up to 1.40$\times$ speedup with less than 2\% quality degradation across five models and three tasks. Surprisingly, in 4-bit quantized settings, LCSB exhibits superior stability: a 3B model that completely diverges under full backpropagation converges smoothly with LCSB, suggesting an implicit regularization effect from selective gradient computation.
Abstract（参考訳）: メモリ効率のよいバックプロパゲーション(MeBP)により、1GB未満のメモリを持つモバイルデバイス上での大規模言語モデル(LLM)の1次微調整が可能になった。しかし、MeBPは全てのステップで全ての変圧器層を通して逆方向の計算を必要とし、重量減圧だけで32～42%の逆方向の時間を消費する。本稿では,ステップごとのレイヤのサブセットのみの勾配を計算するLCSB(Layer-Cyclic Selective Backpropagation)を提案する。私たちのキーとなる洞察は、残余接続はアイデンティティパスの勾配フローを保証するのに対して、AdamWモメンタは非選択層に対して暗黙の更新を提供する、ということです。我々は LCSB を LoRA パラメータ空間上のブロック座標 Descent と解釈し、収束の理論的正当化を与える。 LCSBは最大1.40$\times$のスピードアップを実現し、5つのモデルと3つのタスクで2\%以下の品質劣化を実現している。驚くべきことに、LCSBは4ビット量子化設定において優れた安定性を示し、完全なバックプロパゲーションの下で完全に分岐する3BモデルはLCSBと滑らかに収束し、選択的勾配計算から暗黙の正規化効果が示唆される。

論文の概要: LCSB: Layer-Cyclic Selective Backpropagation for Memory-Efficient On-Device LLM Fine-Tuning

関連論文リスト