Fugu-MT 論文翻訳(概要): Low Rank Comes with Low Security: Gradient Assembly Poisoning Attacks against Distributed LoRA-based LLM Systems

論文の概要: Low Rank Comes with Low Security: Gradient Assembly Poisoning Attacks against Distributed LoRA-based LLM Systems

arxiv url: http://arxiv.org/abs/2601.00566v1
Date: Fri, 02 Jan 2026 04:42:56 GMT
ステータス: 翻訳完了
システム内更新日: 2026-01-05 15:04:33.518977
Title: Low Rank Comes with Low Security: Gradient Assembly Poisoning Attacks against Distributed LoRA-based LLM Systems
Title（参考訳）: 低ランクでセキュリティが低下する - 分散LoRAベースのLLMシステムに対するGradient Assembly Poisoning攻撃
Authors: Yueyan Dong, Minghui Xu, Qin Hu, Yinhao Xiao, Qi Luo, Yechao Zhang, Yue Zhang, Xiuzhen Cheng,
Abstract要約: Low-Rank Adaptation (LoRA) は、フェデレートされた設定で大規模言語モデル(LLM)を微調整する一般的なソリューションとなっている。我々は、この盲点を利用した新しい攻撃である、Gradient Assembly Poisoning (GAP)を提案する。 GAPは、表面流速を保ちながら劣化または偏りのある出力を一貫して誘導し、BLEUを最大14.5%削減し、事実的および文法的誤りを800%以上増加させ、92.6%の長文応答長を維持する。
参考スコア（独自算出の注目度）: 21.075158459478622
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Low-Rank Adaptation (LoRA) has become a popular solution for fine-tuning large language models (LLMs) in federated settings, dramatically reducing update costs by introducing trainable low-rank matrices. However, when integrated with frameworks like FedIT, LoRA introduces a critical vulnerability: clients submit $A$ and $B$ matrices separately, while only their product $AB$ determines the model update, yet this composite is never directly verified. We propose Gradient Assembly Poisoning (GAP), a novel attack that exploits this blind spot by crafting individually benign $A$ and $B$ matrices whose product yields malicious updates. GAP operates without access to training data or inter-client coordination and remains undetected by standard anomaly detectors. We identify four systemic vulnerabilities in LoRA-based federated systems and validate GAP across LLaMA, ChatGLM, and GPT-2. GAP consistently induces degraded or biased outputs while preserving surface fluency, reducing BLEU by up to 14.5\%, increasing factual and grammatical errors by over 800\%, and maintaining 92.6\% long-form response length. These results reveal a new class of stealthy, persistent threats in distributed LoRA fine-tuning.
Abstract（参考訳）: Low-Rank Adaptation (LoRA)は、フェデレートされた設定で大規模言語モデル(LLM)を微調整する一般的なソリューションとなり、トレーニング可能な低ランク行列を導入することで、更新コストを劇的に削減している。しかし、FedITのようなフレームワークと統合すると、LoRAは重大な脆弱性を導入している。クライアントが個別に$A$と$B$の行列を提出するのに対して、製品の$AB$だけがモデルの更新を決定するが、このコンポジットは直接検証されることはない。我々は、この盲点を悪用する新しい攻撃として、製品が悪意のある更新をもたらす行列を個別に$A$と$B$で作ることを提案する。 GAPは、訓練データやクライアント間調整へのアクセスなしに動作し、標準の異常検知器によって検出されないままである。また,LLaMA,ChatGLM,GPT-2の4つのシステム脆弱性を同定し,GAPをLLaMA,ChatGLM,GPT-2で検証した。 GAPは、表面流速を保ちながら劣化または偏りのある出力を一貫して誘導し、BLEUを最大14.5\%まで減少させ、事実的および文法的誤りを800\%以上増加させ、92.6\%の長文応答長を維持する。これらの結果から,分散LoRAファインチューニングにおける新たなステルス,永続的な脅威が明らかになった。

論文の概要: Low Rank Comes with Low Security: Gradient Assembly Poisoning Attacks against Distributed LoRA-based LLM Systems

関連論文リスト