Fugu-MT 論文翻訳(概要): Formalizing the Safety, Security, and Functional Properties of Agentic AI Systems

論文の概要: Formalizing the Safety, Security, and Functional Properties of Agentic AI Systems

arxiv url: http://arxiv.org/abs/2510.14133v1
Date: Wed, 15 Oct 2025 22:02:30 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-17 21:15:14.637028
Title: Formalizing the Safety, Security, and Functional Properties of Agentic AI Systems
Title（参考訳）: エージェントAIシステムの安全性・セキュリティ・機能特性の定式化
Authors: Edoardo Allegrini, Ananth Shreekumar, Z. Berkay Celik,
Abstract要約: 本稿では2つの基礎モデルからなるエージェントAIシステムのためのモデリングフレームワークを提案する。 1つ目はホストエージェントモデルで、ユーザと対話するトップレベルのエンティティを形式化し、タスクを分解し、外部エージェントやツールを活用して実行をオーケストレーションする。第2のタスクライフサイクルモデルでは、個々のサブタスクの状態と、作成から完了までの遷移を詳述し、タスク管理とエラー処理の詳細なビューを提供します。
参考スコア（独自算出の注目度）: 10.734711935895225
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Agentic AI systems, which leverage multiple autonomous agents and Large Language Models (LLMs), are increasingly used to address complex, multi-step tasks. The safety, security, and functionality of these systems are critical, especially in high-stakes applications. However, the current ecosystem of inter-agent communication is fragmented, with protocols such as the Model Context Protocol (MCP) for tool access and the Agent-to-Agent (A2A) protocol for coordination being analyzed in isolation. This fragmentation creates a semantic gap that prevents the rigorous analysis of system properties and introduces risks such as architectural misalignment and exploitable coordination issues. To address these challenges, we introduce a modeling framework for agentic AI systems composed of two foundational models. The first, the host agent model, formalizes the top-level entity that interacts with the user, decomposes tasks, and orchestrates their execution by leveraging external agents and tools. The second, the task lifecycle model, details the states and transitions of individual sub-tasks from creation to completion, providing a fine-grained view of task management and error handling. Together, these models provide a unified semantic framework for reasoning about the behavior of multi-AI agent systems. Grounded in this framework, we define 17 properties for the host agent and 14 for the task lifecycle, categorized into liveness, safety, completeness, and fairness. Expressed in temporal logic, these properties enable formal verification of system behavior, detection of coordination edge cases, and prevention of deadlocks and security vulnerabilities. Through this effort, we introduce the first rigorously grounded, domain-agnostic framework for the systematic analysis, design, and deployment of correct, reliable, and robust agentic AI systems.
Abstract（参考訳）: 複数の自律エージェントと大規模言語モデル(LLM)を活用するエージェントAIシステムは、複雑で多段階的なタスクに対処するためにますます利用されている。これらのシステムの安全性、セキュリティ、機能は、特に高度なアプリケーションにおいて重要である。しかし、現在のエージェント間通信のエコシステムは断片化されており、ツールアクセスのための Model Context Protocol (MCP) や、分離して分析されるコーディネーションのための Agent-to-Agent (A2A) プロトコルなどである。この断片化は、システムプロパティの厳密な分析を防止し、アーキテクチャ上のミスアライメントや悪用可能な調整問題のようなリスクを導入するセマンティックギャップを生み出します。これらの課題に対処するために,2つの基礎モデルからなるエージェントAIシステムのモデリングフレームワークを導入する。 1つ目はホストエージェントモデルで、ユーザと対話するトップレベルのエンティティを形式化し、タスクを分解し、外部エージェントやツールを活用して実行をオーケストレーションする。第2のタスクライフサイクルモデルでは、個々のサブタスクの状態と、作成から完了までの遷移を詳述し、タスク管理とエラー処理の詳細なビューを提供します。これらのモデルが組み合わさって、マルチAIエージェントシステムの振る舞いを推論するための統一的なセマンティックフレームワークを提供する。この枠組みに基づき、ホストエージェントの17のプロパティとタスクライフサイクルの14のプロパティを定義し、生存性、安全性、完全性、公正性に分類する。時間論理で表現されたこれらの特性は、システム動作の形式的検証、コーディネーションエッジケースの検出、デッドロックとセキュリティ脆弱性の防止を可能にする。この取り組みを通じて、我々は、正確で信頼性があり堅牢なエージェントAIシステムの体系的分析、設計、デプロイのための、厳格に根ざした、ドメインに依存しない最初のフレームワークを紹介します。

論文の概要: Formalizing the Safety, Security, and Functional Properties of Agentic AI Systems

関連論文リスト