Fugu-MT 論文翻訳(概要): When Lower Privileges Suffice: Investigating Over-Privileged Tool Selection in LLM Agents

論文の概要: When Lower Privileges Suffice: Investigating Over-Privileged Tool Selection in LLM Agents

arxiv url: http://arxiv.org/abs/2606.20023v1
Date: Thu, 18 Jun 2026 09:54:48 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-19 18:23:39.78174
Title: When Lower Privileges Suffice: Investigating Over-Privileged Tool Selection in LLM Agents
Title（参考訳）: 下顎前立腺が十分であった場合 : LLMエージェントの過剰手術ツール選択の検討
Authors: Kaiyue Yang, Yuyan Bu, Jingwei Yi, Yuchi Wang, Biyu Zhou, Juntao Dai, Songlin Hu, Yaodong Yang,
Abstract要約: LLMエージェントは、ますます自律的にツールを選択するようになり、異なる特権を持つツールの中からの選択が安全関連になる。エージェントが選択またはエスカレートするオーバープライレジツールの選択について検討するが、十分な低プライレジ代替手段にもかかわらず、高プライレジツールを選択するかエスカレーションする。エージェントに十分な低特権のツールを好み、必要な時にのみエスカレートするように教える。
参考スコア（独自算出の注目度）: 24.231912493421948
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: As LLM agents increasingly select tools autonomously, their choices among tools with different privileges become safety-relevant. However, prior tool-selection studies focus on safety-agnostic metadata preferences, leaving privilege-sensitive choices underexplored. To address this gap, we study over-privileged tool selection, in which an agent selects or escalates to a higher-privilege tool despite a sufficient lower-privilege alternative. We introduce ToolPrivBench to evaluate whether agents choose higher-privilege tools despite sufficient lower-privilege alternatives, measuring both initial selection and escalation after transient tool failures. Across eight domains and five recurring risk patterns, we find that over-privileged tool selection is common among mainstream LLM agents and is further amplified by transient failures. We further find that general safety alignment does not reliably transfer to least-privilege tool choice, while prompt-level controls provide only limited mitigation under transient failures. We therefore introduce a privilege-aware post-training defense that teaches agents to prefer sufficient lower-privilege tools and escalate only when necessary. Our mitigation experiments show that this defense substantially reduces unnecessary high-privilege tool use while preserving general capabilities.
Abstract（参考訳）: LLMエージェントが自律的にツールを選択するようになると、異なる特権を持つツールの選択が安全関連になる。しかし、以前のツール選択研究では、安全に依存しないメタデータの嗜好に焦点が当てられており、特権に敏感な選択は未調査のままである。このギャップに対処するため,エージェントが選択またはエスカレーションを行うツール選択について検討した。本稿では,ツールPrivBenchを導入し,エージェントが十分な低特権の代替手段にもかかわらず,高特権のツールを選択するかどうかを評価するとともに,過渡的ツール障害後の初期選択とエスカレーションの両方を測定する。 8つのドメインと5つの繰り返し発生するリスクパターンにまたがって、過度に特権化されたツールの選択は、主要なLSMエージェントに共通しており、過度な障害によってさらに増幅されている。さらに, 汎用安全アライメントが最小限のツール選択に確実に移行しないのに対して, プロンプトレベルの制御は過渡的障害下では限定的な緩和しか提供しないことがわかった。そこで我々は,エージェントに十分な低特権ツールを優先し,必要時にのみエスカレートするように指導する特権意識のポストトレーニングディフェンスを導入する。我々の緩和実験は、この防御が汎用性を保ちながら不要な高特権ツールの使用を著しく減少させることを示している。

論文の概要: When Lower Privileges Suffice: Investigating Over-Privileged Tool Selection in LLM Agents

関連論文リスト