Fugu-MT 論文翻訳(概要): From Admission to Invariants: Measuring Deviation in Delegated Agent Systems

論文の概要: From Admission to Invariants: Measuring Deviation in Delegated Agent Systems

arxiv url: http://arxiv.org/abs/2604.17517v2
Date: Tue, 21 Apr 2026 18:18:32 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-23 15:36:10.357925
Title: From Admission to Invariants: Measuring Deviation in Delegated Agent Systems
Title（参考訳）: 許可から不変性へ:分解剤系における偏差の測定
Authors: Marcelo Fernandez,
Abstract要約: エージェントの行動が入場時に確立された許容行動空間A0内に留まっているかどうかを,施行に基づくガバナンスでは判断できないことを示す。実施に基づく監視のための情報理論上の不可能性を証明し、IMLが検出遅延を許容できる限り有限に抑えた入場時間ドリフトを検出することを示す。
参考スコア（独自算出の注目度）: 0.4929694290403903
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Autonomous agent systems are governed by enforcement mechanisms that flag hard constraint violations at runtime. The Agent Control Protocol identifies a structural limit of such systems: a correctly-functioning enforcement engine can enter a regime in which behavioral drift is invisible to it, because the enforcement signal operates below the layer where deviation is measurable. We show that enforcement-based governance is structurally unable to determine whether an agent behavior remains within the admissible behavior space A0 established at admission time. Our central result, the Non-Identifiability Theorem, proves that A0 is not in the sigma-algebra generated by the enforcement signal g under the Local Observability Assumption, which every practical enforcement system satisfies. The impossibility arises from a fundamental mismatch: g evaluates actions locally against a point-wise rule set, while A0 encodes global, trajectory-level behavioral properties set at admission time. An agent can therefore drift -- systematically shifting its behavioral distribution away from admission-time expectations -- while every individual action remains within the permitted action space. We define the Invariant Measurement Layer (IML), which bypasses this limitation by retaining direct access to the generative model of A0, restoring observability precisely in the region where enforcement is structurally blind. We prove an information-theoretic impossibility for enforcement-based monitoring and show IML detects admission-time drift with provably finite detection delay. Validated across four settings: three drift scenarios (300 and 1000 steps), a live n8n webhook pipeline, and a LangGraph StateGraph agent -- enforcement triggers zero violations while IML detects each drift type within 9-258 steps of drift onset.
Abstract（参考訳）: 自律エージェントシステムは、実行時に厳しい制約違反をフラグする執行機構によって管理される。エージェント制御プロトコル(Agen Control Protocol)は、そのようなシステムの構造的限界を識別する: 正しく機能する機関は、偏差が測定可能な層の下にあるため、動作のドリフトが見えない状態に入ることができる。エージェントの動作が入場時に確立された許容行動空間A0内に留まっているかどうかを,施行に基づくガバナンスが構造的に決定できないことを示す。我々の中心的な結果であるNon-Identifiability Theoremは、A0が局所可観測性推定(Local Observability Assumption)の下での強制信号gによって生成されるシグマ代数にはないことを証明している。 g はポイントワイドな規則集合に対して局所的に行動を評価するが、A0 は入場時に設定されたグローバルな軌道レベルの行動特性を符号化する。エージェントは、許可されたアクション空間内に個々のアクションが残されている間、その行動分布を入院時の期待から体系的にシフトさせることができる。 Invariant Measurement Layer (IML) は、A0の生成モデルへの直接アクセスを維持することにより、この制限を回避し、構造的に盲目な領域において、正確に可観測性を回復する。実施に基づく監視のための情報理論上の不可能性を証明し、IMLが検出遅延を許容できる限り有限に抑えた入場時間ドリフトを検出することを示す。 3つのドリフトシナリオ(300ステップと1000ステップ)、ライブのn8n webhookパイプライン、LangGraph StateGraphエージェントの4つの設定で検証されている。

論文の概要: From Admission to Invariants: Measuring Deviation in Delegated Agent Systems

関連論文リスト