Fugu-MT 論文翻訳(概要): Widening the Gap: Exploiting LLM Quantization via Outlier Injection

論文の概要: Widening the Gap: Exploiting LLM Quantization via Outlier Injection

arxiv url: http://arxiv.org/abs/2605.15152v1
Date: Thu, 14 May 2026 17:50:39 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-15 21:45:34.995774
Title: Widening the Gap: Exploiting LLM Quantization via Outlier Injection
Title（参考訳）: ギャップを広げる:アウトリアインジェクションによるLDM量子化の爆発
Authors: Xiaohua Zhan, Kazuki Egashira, Robin Staab, Mark Vero, Martin Vechev,
Abstract要約: 悪意のある振る舞いを継続的に引き起こす最初の量子化条件攻撃を導入する。我々の攻撃は、多くの現代的な量子化法で共有される単純な性質を利用する。我々の攻撃は、前回の攻撃が失敗する広範囲の量子化手法に対して高い成功率を達成することを示す。
参考スコア（独自算出の注目度）: 16.503478819196115
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: LLM quantization has become essential for memory-efficient deployment. Recent work has shown that quantization schemes can pose critical security risks: an adversary may release a model that appears benign in full precision but exhibits malicious behavior once quantized by users. However, existing quantization-conditioned attacks have been limited to relatively simple quantization methods, where the attacker can estimate weight regions that remain invariant under the target quantization. Notably, prior attacks have consistently failed to compromise more popular and sophisticated schemes, limiting their practical impact. In this work, we introduce the first quantization-conditioned attack that consistently induces malicious behavior that can be triggered by a broad range of advanced quantization techniques, including AWQ, GPTQ, and GGUF I-quants. Our attack exploits a simple property shared by many modern quantization methods: large outliers can cause other weights to be rounded to zero. Consequently, by injecting outliers into specific weight blocks, an adversary can therefore induce a targeted, predictable weight collapse in the model. This effect can be used to craft seemingly benign full-precision models that exhibit a wide range of malicious behaviors after quantization. Through extensive evaluation across three attack scenarios and LLMs, we show that our attack achieves high success rates against a broad range of quantization methods on which prior attacks fail. Our results demonstrate, for the first time, that the security risks of quantization are not restricted to simpler schemes but are broadly relevant across complex, widely-used quantization methods.
Abstract（参考訳）: LLM量子化は、メモリ効率の確保に不可欠である。近年の研究では、量子化スキームは重大なセキュリティリスクを引き起こす可能性があることが示されている。しかし、既存の量子化条件付き攻撃は比較的単純な量子化法に限られており、攻撃者はターゲット量子化の下で不変な重み領域を推定することができる。特に、以前の攻撃は、よりポピュラーで洗練されたスキームの妥協に一貫して失敗し、その実践的影響を制限した。本研究では、AWQ、GPTQ、GGUF I-quantsなど、幅広い高度な量子化技術によって引き起こされる有害な振る舞いを継続的に誘発する最初の量子化条件攻撃を提案する。我々の攻撃は、多くの現代的な量子化法で共有される単純な性質を利用する。したがって、特定の重みブロックに外周を注入することにより、敵はモデルにおいて標的となる、予測可能な重み崩壊を誘導することができる。この効果は、量子化後の広範囲の悪意のある振る舞いを示す、明らかに良質な完全精度のモデルを構築するために使用できる。 3つの攻撃シナリオとLLMの広範な評価を通じて、我々の攻撃は、前回の攻撃が失敗する広範囲な量子化手法に対して高い成功率を達成することを示す。我々の結果は、量子化のセキュリティリスクは、単純なスキームに限らず、複雑で広く使われている量子化手法に広く関係していることを示している。

論文の概要: Widening the Gap: Exploiting LLM Quantization via Outlier Injection

関連論文リスト