Fugu-MT 論文翻訳(概要): Reframing LLM Agent Security as an Agent-Human Interaction Problem

論文の概要: Reframing LLM Agent Security as an Agent-Human Interaction Problem

arxiv url: http://arxiv.org/abs/2605.24309v1
Date: Sat, 23 May 2026 00:36:48 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-26 19:50:17.864753
Title: Reframing LLM Agent Security as an Agent-Human Interaction Problem
Title（参考訳）: エージェント・ヒューマンインタラクション問題としてのLLMエージェントセキュリティ
Authors: Peiran Wang, Ying Li, Yuan Tian,
Abstract要約: 我々はエージェント・セキュリティは基本的にエージェント・ヒューマン・インタラクション(AHI)の問題であり、純粋にアルゴリズム的な問題ではないと主張している。 2026年4月現在、59の学術論文、21の生産エージェントシステム、26のセキュリティプラグインを体系的に分析している。広く展開されている3つの人間中心のセキュリティメカニズムが業界プラクティスを支配しています。
参考スコア（独自算出の注目度）: 11.867868278947524
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: We argue that LLM agent security is fundamentally an agent-human interaction (AHI) problem, not a purely algorithmic one. To substantiate this position, we conduct a systematic analysis of 59 academic papers, 21 production agent systems, and 26 security plugins as of April 2026. Our analysis reveals a striking pattern: the three widely deployed human-centric security mechanisms (policy specification, runtime approval, and scope configuration) dominate industry practice, each adopted by at least 14 of 21 systems (14, 15, and 16, respectively), while the categories most heavily studied in academia (intent anchoring and trust labeling) see zero production deployment. Yet current human participation mechanisms are far from satisfactory: they suffer from a fundamental trade-off between cognitive burden and security guarantees, leaving users caught between approval fatigue and uncontrolled agent autonomy. We make three contributions. First, through a systematic comparison of LLM-based and human-based intent alignment, we argue that human participation in agent security decisions is indispensable given current capabilities. Second, we quantify a pronounced industry-academia mismatch: the security mechanisms that practitioners actually deploy receive scant research attention, while the approaches that researchers favor remain undeployed. Third, we propose a three-direction research agenda and call for AHI security to be recognized as a first-class research citizen, one that demands its own design principles, evaluation methods, and theoretical foundations.
Abstract（参考訳）: LLMエージェントのセキュリティは基本的にエージェント・ヒューマン・インタラクション(AHI)の問題であり、純粋にアルゴリズム的な問題ではない、と我々は主張する。 2026年4月現在、59の学術論文、21の生産エージェントシステム、26のセキュリティプラグインを体系的に分析している。広く展開されている3つの人間中心のセキュリティメカニズム(ポリシー仕様、ランタイム承認、スコープ構成)が、それぞれ21のシステム(14、15、16)のうち少なくとも14のシステムで採用されている。認知的負担とセキュリティ保証の根本的なトレードオフに悩まされ、ユーザーは承認疲労と制御不能なエージェントの自律性に悩まされる。私たちは3つの貢献をします。まず、LLMと人間による意図的アライメントの体系的な比較を通じて、エージェントのセキュリティ決定への人間の関与は、現在の能力を考えると不可欠である、と論じる。第二に、業界と学界の明確なミスマッチを定量化する:実践者が実際に展開するセキュリティメカニズムは、研究の注目を集める一方で、研究者が好むアプローチは、まだデプロイされていないままである。第3に,AHIの安全を第一級研究市民として認定し,独自の設計原則,評価方法,理論基盤を求める3つの方向性研究課題を提案する。

論文の概要: Reframing LLM Agent Security as an Agent-Human Interaction Problem

関連論文リスト