Fugu-MT 論文翻訳(概要): ToolFlood: Beyond Selection -- Hiding Valid Tools from LLM Agents via Semantic Covering

論文の概要: ToolFlood: Beyond Selection -- Hiding Valid Tools from LLM Agents via Semantic Covering

arxiv url: http://arxiv.org/abs/2603.13950v1
Date: Sat, 14 Mar 2026 13:54:49 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-17 16:19:35.504845
Title: ToolFlood: Beyond Selection -- Hiding Valid Tools from LLM Agents via Semantic Covering
Title（参考訳）: ToolFlood: セレクションを超えて -- セマンティックカバレッジを通じてLLMエージェントからバリデーションツールを隠蔽する
Authors: Hussein Jawad, Nicolas J-B Brunel,
Abstract要約: 本稿では,ツール拡張型Large Language Model (LLM)エージェントに対する検索層攻撃であるToolFloodを紹介する。検索後にどのツールが選択されるかを変更するのではなく、ToolFloodは、いくつかのアタッカー制御ツールを注入することで、検索自体を圧倒する。 ToolFloodは、95%のアタック成功率と低インジェクション率を実現している。
参考スコア（独自算出の注目度）: 2.6928305857508974
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Large Language Model (LLM) agents increasingly use external tools for complex tasks and rely on embedding-based retrieval to select a small top-k subset for reasoning. As these systems scale, the robustness of this retrieval stage is underexplored, even though prior work has examined attacks on tool selection. This paper introduces ToolFlood, a retrieval-layer attack on tool-augmented LLM agents. Rather than altering which tool is chosen after retrieval, ToolFlood overwhelms retrieval itself by injecting a few attacker-controlled tools whose metadata is carefully placed by exploiting the geometry of embedding space. These tools semantically span many user queries, dominate the top-k results, and push all benign tools out of the agent's context. ToolFlood uses a two-phase adversarial tool generation strategy. It first samples subsets of target queries and uses an LLM to iteratively generate diverse tool names and descriptions. It then runs an iterative greedy selection that chooses tools maximizing coverage of remaining queries in embedding space under a cosine-distance threshold, stopping when all queries are covered or a budget is reached. We provide theoretical analysis of retrieval saturation and show on standard benchmarks that ToolFlood achieves up to a 95% attack success rate with a low injection rate (1% in ToolBench). The code will be made publicly available at the following link: https://github.com/as1-prog/ToolFlood
Abstract（参考訳）: 大きな言語モデル(LLM)エージェントは、複雑なタスクに外部ツールを使い、推論のために小さなトップkサブセットを選択するために埋め込みベースの検索に依存している。これらのシステムの規模が拡大するにつれて、ツール選択に対する攻撃を事前に検討したにもかかわらず、この検索段階のロバスト性は過小評価されている。本稿では,ツール拡張LDMエージェントに対する検索層攻撃であるToolFloodを紹介する。検索後にどのツールを選択するかを変更する代わりに、ToolFloodは、埋め込みスペースの幾何学を利用してメタデータを注意深く配置するアタッカー制御ツールを注入することで、検索自体を圧倒する。これらのツールは、多くのユーザクエリにセマンティックに分散し、トップkの結果を支配し、すべての良質なツールをエージェントのコンテキストから追い出す。 ToolFloodは2段階のツール生成戦略を使用する。まずターゲットクエリのサブセットをサンプリングし、LLMを使用してさまざまなツール名と記述を反復的に生成する。次に、反復的な欲求選択を実行し、すべてのクエリがカバーされたり、予算が到達した時に停止する、余分なクエリをcosine-distanceのしきい値の下に埋め込んだスペースでカバレッジを最大化するツールを選択する。本稿では,検索飽和の理論的解析を行い,ToolFloodが95%の攻撃成功率(ToolBenchでは1%)を達成できる標準ベンチマークを示す。コードは以下のリンクで公開される。 https://github.com/as1-prog/ToolFlood

論文の概要: ToolFlood: Beyond Selection -- Hiding Valid Tools from LLM Agents via Semantic Covering

関連論文リスト