Fugu-MT 論文翻訳(概要): FedLoDrop: Federated LoRA with Dropout for Generalized LLM Fine-tuning

論文の概要: FedLoDrop: Federated LoRA with Dropout for Generalized LLM Fine-tuning

arxiv url: http://arxiv.org/abs/2510.12078v1
Date: Tue, 14 Oct 2025 02:40:45 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-15 19:02:32.152655
Title: FedLoDrop: Federated LoRA with Dropout for Generalized LLM Fine-tuning
Title（参考訳）: FedLoDrop: LLMファインチューニングのためのフェデレーションLoRAとドロップアウト
Authors: Sijing Xie, Dingzhu Wen, Changsheng You, Qimei Chen, Mehdi Bennis, Kaibin Huang,
Abstract要約: 大規模言語モデル(LLM)は、特定のタスクに汎用モデルを適用するために不可欠である。本稿では,FedLoDropを用いたFederated LoRAを提案する。FedLoDropは,Federated LoRAのトレーニング可能な行列の行と列にドロップアウトを適用する新しいフレームワークである。
参考スコア（独自算出の注目度）: 65.26899091946417
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Fine-tuning (FT) large language models (LLMs) is crucial for adapting general-purpose models to specific tasks, enhancing accuracy and relevance with minimal resources. To further enhance generalization ability while reducing training costs, this paper proposes Federated LoRA with Dropout (FedLoDrop), a new framework that applies dropout to the rows and columns of the trainable matrix in Federated LoRA. A generalization error bound and convergence analysis under sparsity regularization are obtained, which elucidate the fundamental trade-off between underfitting and overfitting. The error bound reveals that a higher dropout rate increases model sparsity, thereby lowering the upper bound of pointwise hypothesis stability (PHS). While this reduces the gap between empirical and generalization errors, it also incurs a higher empirical error, which, together with the gap, determines the overall generalization error. On the other hand, though dropout reduces communication costs, deploying FedLoDrop at the network edge still faces challenges due to limited network resources. To address this issue, an optimization problem is formulated to minimize the upper bound of the generalization error, by jointly optimizing the dropout rate and resource allocation subject to the latency and per-device energy consumption constraints. To solve this problem, a branch-and-bound (B\&B)-based method is proposed to obtain its globally optimal solution. Moreover, to reduce the high computational complexity of the B\&B-based method, a penalized successive convex approximation (P-SCA)-based algorithm is proposed to efficiently obtain its high-quality suboptimal solution. Finally, numerical results demonstrate the effectiveness of the proposed approach in mitigating overfitting and improving the generalization capability.
Abstract（参考訳）: 微細チューニング(FT)大言語モデル(LLM)は、汎用モデルを特定のタスクに適応させ、最小限のリソースで正確性と関連性を高めるために重要である。本稿では,FedLoDropを用いたFederated LoRAを提案する。FedLoDropは,FedLoRAのトレーニング可能な行列の行と列にドロップアウトを適用した新しいフレームワークである。疎度正則化の下での一般化誤差境界と収束解析が得られ、不適合と過適合の基本的なトレードオフを解明する。誤差境界は、より高いドロップアウト率がモデル間隔を増大させ、ポイントワイズ仮説安定性(PHS)の上限を下げることを明らかにする。これにより、経験的誤差と一般化誤差のギャップが減るが、経験的誤差も高くなり、その差と合わせて全体の一般化誤差が決定される。一方、ドロップアウトは通信コストを削減しますが、ネットワークエッジにFedLoDropをデプロイすることは、ネットワークリソースが限られているため、依然として課題に直面しています。この問題に対処するために、レイテンシとデバイス毎のエネルギー消費制約を考慮したドロップアウト率とリソース割り当てを協調的に最適化することにより、一般化誤差の上限を最小化する最適化問題を定式化する。この問題を解決するために,ブランチ・アンド・バウンド(B\&B)に基づく手法を提案し,その大域的最適解を求める。さらに,B\&B法における計算複雑性を低減するため,高次凸近似法(P-SCA)に基づくアルゴリズムを提案し,高品質な準最適解を求める。最後に, 数値計算により, オーバーフィッティングを緩和し, 一般化能力を向上する手法の有効性を示した。

論文の概要: FedLoDrop: Federated LoRA with Dropout for Generalized LLM Fine-tuning

関連論文リスト