Fugu-MT 論文翻訳(概要): Open Agent Specification (Agent Spec): A Unified Representation for AI Agents

論文の概要: Open Agent Specification (Agent Spec): A Unified Representation for AI Agents

arxiv url: http://arxiv.org/abs/2510.04173v4
Date: Fri, 07 Nov 2025 14:02:33 GMT
ステータス: 翻訳完了
システム内更新日: 2025-11-10 21:00:44.518533
Title: Open Agent Specification (Agent Spec): A Unified Representation for AI Agents
Title（参考訳）: Open Agent Specification (Agent Spec):AIエージェントの統一表現
Authors: Soufiane Amini, Yassine Benajiba, Cesare Bernardis, Paul Cayet, Hassan Chafi, Abderrahim Fathan, Louis Faucon, Damien Hilloulin, Sungpack Hong, Ingo Kossyk, Tran Minh Son Le, Rhicheek Patra, Sujith Ravi, Jonas Schweizer, Jyotika Singh, Shailender Singh, Weiyi Sun, Kartik Talamadupula, Jerry Xu,
Abstract要約: 我々はAIエージェントとエージェントを定義する宣言型言語Open Agent Specification(Agent Spec)を紹介する。 Agent Specは、コンポーネント、コントロールとデータフローのセマンティクス、スキーマの共通セットを定義し、エージェントを一度定義し、異なるランタイム間で実行できるようにする。
参考スコア（独自算出の注目度）: 10.685555728094338
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: The proliferation of agent frameworks has led to fragmentation in how agents are defined, executed, and evaluated. Existing systems differ in their abstractions, data flow semantics, and tool integrations, making it difficult to share or reproduce workflows. We introduce Open Agent Specification (Agent Spec), a declarative language that defines AI agents and agentic workflows in a way that is compatible across frameworks, promoting reusability, portability and interoperability of AI agents. Agent Spec defines a common set of components, control and data flow semantics, and schemas that allow an agent to be defined once and executed across different runtimes. Agent Spec also introduces a standardized Evaluation harness to assess agent behavior and agentic workflows across runtimes - analogous to how HELM and related harnesses standardized LLM evaluation - so that performance, robustness, and efficiency can be compared consistently across frameworks. We demonstrate this using four distinct runtimes (LangGraph, CrewAI, AutoGen, and WayFlow) evaluated over three different benchmarks (SimpleQA Verified, $\tau^2$-Bench and BIRD-SQL). We provide accompanying toolsets: a Python SDK (PyAgentSpec), a reference runtime (WayFlow), and adapters for popular frameworks (e.g., LangGraph, AutoGen, CrewAI). Agent Spec bridges the gap between model-centric and agent-centric standardization & evaluation, laying the groundwork for reliable, reusable, and portable agentic systems.
Abstract（参考訳）: エージェントフレームワークの拡散は、エージェントの定義、実行、評価の方法に断片化をもたらした。既存のシステムでは抽象化やデータフローのセマンティクス、ツールの統合が異なり、ワークフローの共有や複製が困難になる。私たちは、AIエージェントとエージェントワークフローをフレームワーク間で互換性のある方法で定義し、AIエージェントの再利用性、移植性、相互運用性を促進する宣言型言語Open Agent Specification(Agent Spec)を紹介します。 Agent Specは、コンポーネント、コントロールとデータフローのセマンティクス、スキーマの共通セットを定義し、エージェントを一度定義し、異なるランタイム間で実行できるようにする。 Agent Specはまた、ランタイム全体にわたるエージェントの振る舞いやエージェントワークフローを評価するための標準化された評価ハーネスも導入している。我々はこれを,3つの異なるベンチマーク(SimpleQA Verified, $\tau^2$-Bench, BIRD-SQL)で評価した4つの異なるランタイム(LangGraph, CrewAI, AutoGen, WayFlow)を用いて実証した。 Python SDK(PyAgentSpec)、参照ランタイム(WayFlow)、一般的なフレームワーク(例えば、LangGraph、AutoGen、CrewAI)用のアダプタなどです。 Agent Specは、モデル中心とエージェント中心の標準化と評価のギャップを埋め、信頼性、再利用可能な、ポータブルなエージェントシステムのための基盤となる。

論文の概要: Open Agent Specification (Agent Spec): A Unified Representation for AI Agents

関連論文リスト