Fugu-MT 論文翻訳(概要): Evaluating Privilege Usage of Agents on Real-World Tools

論文の概要: Evaluating Privilege Usage of Agents on Real-World Tools

arxiv url: http://arxiv.org/abs/2603.28166v1
Date: Mon, 30 Mar 2026 08:35:00 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-31 23:18:45.30624
Title: Evaluating Privilege Usage of Agents on Real-World Tools
Title（参考訳）: 実世界のツールにおけるエージェントの民生利用評価
Authors: Quan Zhang, Lianhang Fu, Lvsi Lian, Gwihwan Go, Yujue Wang, Chijin Zhou, Yu Jiang, Geguang Pu,
Abstract要約: GrantBoxはエージェントの特権利用を分析するためのセキュリティ評価サンドボックスである。 GrantBoxは、現実世界のツールを自動的に統合し、LLMエージェントが真の特権を呼び出せるようにする。
参考スコア（独自算出の注目度）: 20.792970933124305
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Equipping LLM agents with real-world tools can substantially improve productivity. However, granting agents autonomy over tool use also transfers the associated privileges to both the agent and the underlying LLM. Improper privilege usage may lead to serious consequences, including information leakage and infrastructure damage. While several benchmarks have been built to study agents' security, they often rely on pre-coded tools and restricted interaction patterns. Such crafted environments differ substantially from the real-world, making it hard to assess agents' security capabilities in critical privilege control and usage. Therefore, we propose GrantBox, a security evaluation sandbox for analyzing agent privilege usage. GrantBox automatically integrates real-world tools and allows LLM agents to invoke genuine privileges, enabling the evaluation of privilege usage under prompt injection attacks. Our results indicate that while LLMs exhibit basic security awareness and can block some direct attacks, they remain vulnerable to more sophisticated attacks, resulting in an average attack success rate of 84.80% in carefully crafted scenarios.
Abstract（参考訳）: LLMエージェントを現実世界のツールで取得すると、生産性が大幅に向上する。しかし、ツール使用よりもエージェントの自主性を認めることは、エージェントと基礎となるLLMの両方に関連する特権を転送する。不適切な特権使用は、情報漏洩やインフラの損傷など、深刻な結果をもたらす可能性がある。エージェントのセキュリティを研究するためにいくつかのベンチマークが作成されているが、プリコードされたツールや制限されたインタラクションパターンに依存していることが多い。このような工芸的な環境は現実世界とは大きく異なり、重要な特権管理と使用法においてエージェントのセキュリティ能力を評価することは困難である。そこで我々は,エージェントの特権使用状況を分析するセキュリティ評価サンドボックスであるGrantBoxを提案する。 GrantBoxは、現実世界のツールを自動的に統合し、LLMエージェントが真の特権を呼び出せるようにし、プロンプトインジェクション攻撃による特権使用の評価を可能にする。 LLMは基本的なセキュリティ意識を示し、いくつかの直接攻撃をブロックできるが、より高度な攻撃に弱いままであり、慎重に構築されたシナリオでは平均84.80%の攻撃成功率となる。

論文の概要: Evaluating Privilege Usage of Agents on Real-World Tools

関連論文リスト