Fugu-MT 論文翻訳(概要): Governing Dynamic Capabilities: Cryptographic Binding and Reproducibility Verification for AI Agent Tool Use

論文の概要: Governing Dynamic Capabilities: Cryptographic Binding and Reproducibility Verification for AI Agent Tool Use

arxiv url: http://arxiv.org/abs/2603.14332v1
Date: Sun, 15 Mar 2026 11:46:57 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-17 16:19:35.753216
Title: Governing Dynamic Capabilities: Cryptographic Binding and Reproducibility Verification for AI Agent Tool Use
Title（参考訳）: 動的機能の支配:AIエージェントツール使用のための暗号化バインディングと再現性検証
Authors: Ziling Zhou,
Abstract要約: AIエージェントは、MPPとA2Aを介して実行時に機能を動的に取得する。サイレントな能力エスカレーションを可能にし、EU AI Actトレーサビリティ要件に違反します。能力バウンドエージェント証明書は、スキルマニフェストハッシュでX.509 v3を拡張する。検証可能な相互作用台帳は、複数エージェントの法医学的再構築のためにハッシュリンクされた署名された記録を提供する。
参考スコア（独自算出の注目度）: 0.0
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: AI agents dynamically acquire capabilities at runtime via MCP and A2A, yet no framework detects when capabilities change post-authorization. We term this the capability-identity gap}: it enables silent capability escalation and violates EU AI Act traceability requirements. We propose three mechanisms. Capability-bound agent certificates extend X.509 v3 with a skills manifest hash; any tool change invalidates the certificate. Reproducibility commitments leverage LLM inference near-determinism for post-hoc replay verification. A verifiable interaction ledger provides hash-linked, signed records for multi-agent forensic reconstruction. We formalize nine security properties and prove they hold under a realistic adversary model. Our Rust prototype achieves 97us certificate verification (<1ns capability binding overhead, ~1,200,000 faster than BAID's zkVM), 0.62ms total governance overhead per tool call (0.1--1.2% of typical latency), and 4.7X separation from cross-provider outputs (Cohen's d > 1.0 on all four metrics), with best classification at F_1=0.876 (Jaccard, θ=0.408); single-provider deployments achieve F_1=0.990 with 11.5 times separation. We evaluate 12 attack scenarios -- silent escalation, tool trojanization, phantom delegation, evidence tampering, collusion, and runtime behavioral attacks validated against NVIDIA's Nemotron-AIQ traces -- each detected with a traceable mechanism, while the MCP+OAuth 2.1 baseline detects none. An end-to-end evaluation over a 5-to-20-agent pipeline with real LLM calls confirms that full governance (G1--G3) adds ~10.8ms per pipeline run (0.12% overhead), scales sub-linearly per agent, and detects all five in-situ attacks with zero false positives.
Abstract（参考訳）: AIエージェントは、MPPとA2Aを介して実行時に機能を動的に取得する。それはサイレントな能力のエスカレーションを可能にし、EU AI Actトレーサビリティ要件に違反します。本稿では3つのメカニズムを提案する。能力バウンドエージェント証明書は、スキルマニフェストハッシュでX.509 v3を拡張し、任意のツールの変更で証明書が無効になる。再現性へのコミットメントは、ポストホックリプレイ検証にLLM推論をほぼ決定性に活用する。検証可能な相互作用台帳は、複数エージェントの法医学的再構築のためにハッシュリンクされた署名された記録を提供する。 9つのセキュリティ特性を形式化し、それらが現実的な敵モデルの下で保持されていることを証明します。我々のRustプロトタイプは97us認証の検証(BAIDのzkVMより約1,200,000高速)、ツールコール毎の総ガバナンスオーバーヘッド(典型的なレイテンシの0.1--1.2%)、クロスプロファイラ出力からの4.7X分離(すべての4つのメトリクスでCohen's d > 1.0)、F_1=0.876(Jaccard, θ=0.408)での最高の分類(Jaccard, θ=0.408)、単一プロファイラデプロイメントは11.5回の分離でF_1=0.990を達成する。 NVIDIAのNemotron-AIQトレースに対して検証された12の攻撃シナリオ – サイレントエスカレーション、ツールトロジャン化、ファントムデリゲーション、エビデンス改ざん、コラシエーション、実行時の動作攻撃 – をトレース可能なメカニズムで検出し、MPP+OAuth 2.1ベースラインは検出しない。実際のLLMコールを備えた5対20エージェントパイプラインに対するエンドツーエンド評価では、完全なガバナンス(G1-G3)がパイプライン実行毎に10.8ms(オーバーヘッド0.12%)を追加し、エージェント毎にサブ線形にスケールし、偽陽性のない5つのインサイトアタックすべてを検出する。

論文の概要: Governing Dynamic Capabilities: Cryptographic Binding and Reproducibility Verification for AI Agent Tool Use

関連論文リスト