Fugu-MT 論文翻訳(概要): Governing Dynamic Capabilities: Cryptographic Binding and Reproducibility Verification for AI Agent Tool Use

論文の概要: Governing Dynamic Capabilities: Cryptographic Binding and Reproducibility Verification for AI Agent Tool Use

arxiv url: http://arxiv.org/abs/2603.14332v2
Date: Thu, 19 Mar 2026 19:46:56 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-23 15:23:15.640522
Title: Governing Dynamic Capabilities: Cryptographic Binding and Reproducibility Verification for AI Agent Tool Use
Title（参考訳）: 動的機能の支配:AIエージェントツール使用のための暗号化バインディングと再現性検証
Authors: Ziling Zhou,
Abstract要約: 既存のセキュリティレイヤでは、AIエージェントに何ができるか、それが主張するものを実行したのか、マルチエージェントインタラクションで何が起きたのかを検証できない。既存のフレームワークはこれら2つを詳述し、サイレントな能力のエスカレーションを可能にし、検証済みの証明なしに相互作用を残す。我々は3つのエージェントガバナンス要件を導出する:能力の完全性(G1)、行動の妥当性(G2)、相互作用監査性(G3)。基本(Ed25519, SHA-256; 97 us verify)と拡張(BBS+選択開示、Groth16 DV-SNARK; 13.8 ms)の2つの暗号に依存しないインスタンス化で検証する。
参考スコア（独自算出の注目度）: 0.0
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: AI agents dynamically acquire tools, orchestrate sub-agents, and transact across organizational boundaries, yet no existing security layer verifies what an agent can do, whether it executed what it claims, or what happened in a multi-agent interaction. We trace this gap to the capability-context separation: inside a transformer, tool definitions and user context are indistinguishable tokens, but at the orchestration layer they have fundamentally different security semantics. Existing frameworks conflate the two, enabling silent capability escalation and leaving interactions without verifiable provenance. From this principle we derive three Agent Governance Requirements: capability integrity (G1), behavioral verifiability (G2), and interaction auditability (G3), defining what a governed agent ecosystem must enforce, independent of how. We prove two structural results: the Chain Verifiability Theorem (one unverifiable interior agent breaks end-to-end verification for all downstream nodes) and the Bounded Divergence Theorem (replay-based verification yields a probabilistic safety certificate, epsilon <= 1 - alpha^{1/n}). We validate with two crypto-agnostic instantiations -- basic (Ed25519, SHA-256; 97 us verify) and enhanced (BBS+ selective disclosure, Groth16 DV-SNARK; 13.8 ms) -- both satisfying nine security properties. A reproducibility study (9 models, 7 providers) reveals 5.8x variance in inference determinism, connecting model characteristics to governance architecture. End-to-end evaluation over 5-20 agent pipelines confirms <0.02% overhead and detection of all attack scenarios with zero false positives.
Abstract（参考訳）: AIエージェントは、ツールを動的に取得し、サブエージェントをオーケストレーションし、組織の境界を越えてトランザクションする。トランスフォーマー内では、ツール定義とユーザコンテキストは区別できないトークンですが、オーケストレーション層では、基本的に異なるセキュリティセマンティクスを持っています。既存のフレームワークはこれら2つを詳述し、サイレントな能力のエスカレーションを可能にし、検証済みの証明なしに相互作用を残す。この原則から、3つのエージェントガバナンス要件を導出する: 能力の完全性(G1)、行動の妥当性(G2)、相互作用監査可能性(G3)。チェイン検証可能性定理(検証不能な内部エージェントは、すべての下流ノードのエンドツーエンドの検証を破る)と境界分岐定理(リプレイによる検証は確率論的安全性証明、epsilon <= 1 - alpha^{1/n})の2つの構造的結果を示す。基本(Ed25519, SHA-256; 97 us verify)と拡張(BBS+選択開示、Groth16 DV-SNARK; 13.8 ms)の2つの暗号に依存しないインスタンス化で検証する。再現性の研究(9つのモデル、7つのプロバイダ)は、推論決定論の5.8倍のばらつきを示し、モデル特性とガバナンスアーキテクチャを結びつける。 5～20のエージェントパイプラインに対するエンドツーエンド評価では、オーバーヘッドが0.02%であることと、偽陽性がゼロであるすべての攻撃シナリオの検出が確認されている。

論文の概要: Governing Dynamic Capabilities: Cryptographic Binding and Reproducibility Verification for AI Agent Tool Use

関連論文リスト