Fugu-MT 論文翻訳(概要): Training-Free Agentic AI: Probabilistic Control and Coordination in Multi-Agent LLM Systems

論文の概要: Training-Free Agentic AI: Probabilistic Control and Coordination in Multi-Agent LLM Systems

arxiv url: http://arxiv.org/abs/2603.13256v1
Date: Tue, 24 Feb 2026 21:39:14 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-23 08:17:42.237411
Title: Training-Free Agentic AI: Probabilistic Control and Coordination in Multi-Agent LLM Systems
Title（参考訳）: 学習自由エージェントAI:多エージェントLLMシステムにおける確率的制御とコーディネーション
Authors: Mohammad Parsa Hosseini, Ankit Shah, Saiyra Qureshi, Alex Huang, Connie Miao, Wei Wei,
Abstract要約: マルチエージェントLLMコラボレーションのための軽量かつトレーニング不要なコントローラであるREDEREFを紹介する。信念誘導ルーティングはトークンの使用量を28%減らし,エージェントコールを17%減らし,タイム・ツー・サクセスを19%減らした。その結果, 簡易で解釈可能な確率的制御は, 訓練や微調整を伴わずに, マルチエージェントLLMシステムの効率と堅牢性を有意義に向上させることができることを示した。
参考スコア（独自算出の注目度）: 6.036652381757588
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: Multi-agent large language model (LLM) systems enable complex, long-horizon reasoning by composing specialized agents, but practical deployment remains hindered by inefficient routing, noisy feedback, and high interaction cost. We introduce REDEREF, a lightweight and training-free controller for multi-agent LLM collaboration that improves routing efficiency during recursive delegation. REDEREF integrates (i) belief-guided delegation via Thompson sampling to prioritize agents with historically positive marginal contributions, (ii) reflection-driven re-routing using a calibrated LLM or programmatic judge, (iii) evidence-based selection rather than output averaging, and (iv) memory-aware priors to reduce cold-start inefficiency. Across multi-agent split-knowledge tasks, we show that while recursive retry alone saturates task success, belief-guided routing reduces token usage by 28%, agent calls by 17%, and time-to-success by 19% compared to random recursive delegation, and adapts gracefully under agent or judge degradation. These results demonstrate that simple, interpretable probabilistic control can meaningfully improve the efficiency and robustness of multi-agent LLM systems without training or fine-tuning.
Abstract（参考訳）: マルチエージェント大規模言語モデル (LLM) システムでは, 特殊エージェントを構成することで複雑で長期的推論が可能であるが, 非効率なルーティング, ノイズフィードバック, 高い相互作用コストによって, 実用的展開が妨げられている。我々は、再帰的デリゲート時のルーティング効率を向上させるマルチエージェントLLM協調のための軽量でトレーニング不要なコントローラであるREDEREFを紹介した。 REDEREFが統合 (i)トンプソンサンプリングによる信仰誘導代表団は、歴史的に肯定的な貢献をしたエージェントを優先する。二校正LDM又はプログラムジャッジを用いた反射駆動再描画三出力平均よりも証拠に基づく選択 (4) コールドスタートの非効率を抑えるために、メモリアウェアが先行する。マルチエージェント分割知識タスク全体では、再帰的再帰だけでタスク成功が飽和する一方で、信念誘導ルーティングはトークン使用量を28%削減し、エージェント呼び出しを17%削減し、ランダム再帰的デリゲートと比較してタイム・ツー・サクセスを19%削減し、エージェントや判断の劣化に対して適切に適応することを示した。これらの結果から, 簡易, 解釈可能な確率的制御は, 訓練や微調整を伴わずに, マルチエージェントLLMシステムの効率と堅牢性を有意義に向上できることが示された。

論文の概要: Training-Free Agentic AI: Probabilistic Control and Coordination in Multi-Agent LLM Systems

関連論文リスト