Fugu-MT 論文翻訳(概要): Inducing Overthink: Hierarchical Genetic Algorithm-based DoS Attack on Black-Box Large Language Reasoning Models

論文の概要: Inducing Overthink: Hierarchical Genetic Algorithm-based DoS Attack on Black-Box Large Language Reasoning Models

arxiv url: http://arxiv.org/abs/2605.13338v1
Date: Wed, 13 May 2026 10:57:10 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-14 23:30:27.995359
Title: Inducing Overthink: Hierarchical Genetic Algorithm-based DoS Attack on Black-Box Large Language Reasoning Models
Title（参考訳）: 階層型遺伝的アルゴリズムに基づくDoS攻撃によるブラックボックス大言語推論モデルの構築
Authors: Shuqiang Wang, Wei Cao, Jiaqi Weng, Jialing Tao, Licheng Pan, Hui Xue, Zhixuan Chu,
Abstract要約: 大規模推論モデル(LRM)は、信頼性の高いマルチステップ推論を必要とするシステムにますます統合されている。 LRMは「過大な考え」を示す傾向があり、過度に長く冗長な推論の痕跡を生み出している。入力問題の論理構造を体系的に摂動させることにより, LRMにおける過大な思考を誘発する自動ブラックボックスフレームワークを提案する。
参考スコア（独自算出の注目度）: 26.4508439363827
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Large Reasoning Models (LRMs) are increasingly integrated into systems requiring reliable multi-step inference, yet this growing dependence exposes new vulnerabilities related to computational availability. In particular, LRMs exhibit a tendency to "overthink", producing excessively long and redundant reasoning traces, when confronted with incomplete or logically inconsistent inputs. This behavior significantly increases inference latency and energy consumption, forming a potential vector for denial-of-service (DoS) style resource exhaustion. In this work, we investigate this attack surface and propose an automated black-box framework that induces overthinking in LRMs by systematically perturbing the logical structure of input problems. Our method employs a hierarchical genetic algorithm (HGA) operating on structured problem decompositions, and optimizes a composite fitness function designed to maximize both response length and reflective overthinking markers. Across four state-of-the-art reasoning models, the proposed method substantially amplifies output length, achieving up to a 26.1x increase on the MATH benchmark and consistently outperforming benign and manually crafted missing-premise baselines. We further demonstrate strong transferability, showing that adversarial inputs evolved using a small proxy model retain high effectiveness against large commercial LRMs. These findings highlight overthinking as a shared and exploitable vulnerability in modern reasoning systems, underscoring the need for more robust defenses.
Abstract（参考訳）: 大規模推論モデル(LRM)は、信頼性の高いマルチステップ推論を必要とするシステムにますます統合されているが、この依存度の増加は、計算可用性に関連する新たな脆弱性を公開する。特に、LRMは、不完全または論理的に矛盾した入力に直面したときに、過度に長く冗長な推論トレースを生じる「過大な考え」を示す傾向がある。この挙動は、推論遅延とエネルギー消費を著しく増加させ、DoS(DoS)スタイルのリソース枯渇の潜在的なベクターを形成する。本研究では, この攻撃面を解析し, 入力問題の論理構造を体系的に摂動することで, LRMにおける過度な思考を誘発する自動ブラックボックスフレームワークを提案する。提案手法では, 階層型遺伝的アルゴリズム(HGA)を用い, 応答長を最大化し, 反射的過大化マーカーを最大化するために, 合成適合度関数を最適化する。 4つの最先端推論モデルにおいて,提案手法は出力長を大幅に向上し,MATHベンチマークの26.1倍の精度向上を実現した。さらに,小型のプロキシモデルを用いて敵の入力が発展し,大型の商用LEMに対して高い効果を保っていることを示す。これらの発見は、現代の推論システムにおいて共有され、悪用可能な脆弱性として過度に考え、より堅牢な防御の必要性を強調している。

論文の概要: Inducing Overthink: Hierarchical Genetic Algorithm-based DoS Attack on Black-Box Large Language Reasoning Models

関連論文リスト