Fugu-MT 論文翻訳(概要): Safe Bilevel Delegation (SBD): A Formal Framework for Runtime Delegation Safety in Multi-Agent Systems

論文の概要: Safe Bilevel Delegation (SBD): A Formal Framework for Runtime Delegation Safety in Multi-Agent Systems

arxiv url: http://arxiv.org/abs/2604.27358v1
Date: Thu, 30 Apr 2026 03:15:05 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-01 16:31:53.897279
Title: Safe Bilevel Delegation (SBD): A Formal Framework for Runtime Delegation Safety in Multi-Agent Systems
Title（参考訳）: Safe Bilevel Delegation (SBD):マルチエージェントシステムにおけるランタイムデリゲーション安全のための形式的フレームワーク
Authors: Yuan Sun,
Abstract要約: 本稿では,階層型マルチエージェントシステムにおけるランタイムデリゲート安全のための公式なフレームワークを提案する。医療用AI(MIMIC-III)、金融リスク管理(SとP500)、教育エージェント監督(ASSISTments)の3つの高レベル領域で安全な二段階デリゲーション(SBD)をインスタンス化する。
参考スコア（独自算出の注目度）: 4.161562398794914
License: http://creativecommons.org/licenses/by-sa/4.0/
Abstract: As large language model (LLM) agents are deployed in high-stakes environments, the question of how safely to delegate subtasks to specialized sub-agents becomes critical. Existing work addresses multi-agent architecture selection at design time or provides broad empirical guidelines, but neither provides a runtime mechanism that dynamically adjusts the safety-efficiency trade-off as task context changes during execution. We propose Safe Bilevel Delegation (SBD), a formal framework for runtime delegation safety in hierarchical multi-agent systems. SBD formulates task delegation as a bilevel optimization problem: an outer meta-weight network phi learns context-dependent safety-efficiency weights lambda(s) in [0,1]; an inner loop optimizes the delegation policy pi subject to a probabilistic safety constraint P(safe) >= 1-delta. The continuous delegation degree alpha in [0, 1] controls how much decision authority is transferred to each sub-agent, interpolating smoothly between full human override (alpha=0) and fully autonomous execution (alpha=1). We establish three theoretical results: (1) Safety Monotonicity--higher outer safety weight produces a weakly safer inner policy; (2) Inner Policy Convergence--projected gradient descent on the inner problem converges linearly under standard smoothness assumptions; (3) an Accountability Propagation bound that distributes responsibility across multi-hop delegation chains with a provable per-agent ceiling. We instantiate SBD in three high-stakes domains--medical AI (MIMIC-III), financial risk control (S and P 500), and educational agent supervision (ASSISTments)--specifying datasets, safety constraint sets, baselines, and evaluation protocols. This manuscript presents the formal framework and theoretical results in full; empirical validation following the protocols described herein is planned and will be reported in a forthcoming revision.
Abstract（参考訳）: 大規模言語モデル(LLM)エージェントがハイテイク環境にデプロイされるため、サブタスクを特定のサブエージェントにいかに安全に委譲するかという問題が重要になる。既存の作業は設計時にマルチエージェントアーキテクチャの選択に対処するか、あるいは広範な実証的なガイドラインを提供するが、実行中のタスクコンテキストの変化に応じて安全性と効率のトレードオフを動的に調整するランタイムメカニズムも提供しない。階層型マルチエージェントシステムにおけるランタイムデリゲート安全のための形式的フレームワークであるSafe Bilevel Delegation (SBD)を提案する。 SBD はタスクデリゲートを二段階最適化問題として定式化する: 外部メタウェイトネットワーク phi は[0,1] において文脈依存の安全性効率ウェイト lambda(s) を学習し、内部ループは確率論的安全制約 P(safe) >= 1-delta に従うデリゲートポリシー pi を最適化する。 0, 1] における連続デリゲート次数は、決定権限が各サブエージェントにどの程度移動されるかを制御するもので、完全な人間オーバーライド(アルファ=0)と完全自律実行(アルファ=1)の間をスムーズに補間する。安全モノトニック性が高い外的安全重量は, より弱い内的政策を生み出す; 2) 内的問題に対する内的政策収束性に基づく勾配勾配は, 標準的な平滑性仮定の下で直線的に収束する; (3) 証明可能なパーエージェント天井を持つマルチホップデリゲートチェーンに責任を分散する説明責任伝播境界。我々は、医療用AI(MIMIC-III)、金融リスク制御(SとP500)、教育エージェント監視(ASSISTments)の3つの高い領域でSBDをインスタンス化する。本書では、形式的な枠組みと理論的な成果を網羅し、本書に記載されたプロトコルの実証的検証を計画し、今後の改訂で報告する。

論文の概要: Safe Bilevel Delegation (SBD): A Formal Framework for Runtime Delegation Safety in Multi-Agent Systems

関連論文リスト