Fugu-MT 論文翻訳(概要): Task-Aware LLM Council with Adaptive Decision Pathways for Decision Support

論文の概要: Task-Aware LLM Council with Adaptive Decision Pathways for Decision Support

arxiv url: http://arxiv.org/abs/2601.22662v1
Date: Fri, 30 Jan 2026 07:29:20 GMT
ステータス: 翻訳完了
システム内更新日: 2026-02-02 18:28:15.296905
Title: Task-Aware LLM Council with Adaptive Decision Pathways for Decision Support
Title（参考訳）: 適応的決定経路を有するタスク対応LCM協議会による意思決定支援
Authors: Wei Zhu, Lixing Yu, Hao-Ren Yao, Zhiwen Tang, Kun Yue,
Abstract要約: Task-Aware LLM Council (TALC) はモンテカルロ木探索 (MCTS) と大規模言語モデルのカウンシルを統合している。 TALCは、強いベースラインよりも優れたタスク成功率と検索効率の向上を実現している。
参考スコア（独自算出の注目度）: 6.468209380404613
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Large language models (LLMs) have shown strong capabilities across diverse decision-making tasks. However, existing approaches often overlook the specialization differences among available models, treating all LLMs as uniformly applicable regardless of task characteristics. This limits their ability to adapt to varying reasoning demands and task complexities. In this work, we propose Task-Aware LLM Council (TALC), a task-adaptive decision framework that integrates a council of LLMs with Monte Carlo Tree Search (MCTS) to enable dynamic expert selection and efficient multi-step planning. Each LLM is equipped with a structured success memory profile derived from prior task trajectories, enabling semantic matching between current reasoning context and past successes. At each decision point, TALC routes control to the most contextually appropriate model and estimates node value using a dual-signal mechanism that fuses model-based evaluations with historical utility scores. These signals are adaptively weighted based on intra-node variance and used to guide MCTS selection, allowing the system to balance exploration depth with planning confidence. Experiments on WebShop, HumanEval, and the Game of 24 demonstrate that TALC achieves superior task success rates and improved search efficiency compared to strong baselines, validating the benefits of specialization-aware routing and adaptive planning.
Abstract（参考訳）: 大規模言語モデル(LLM)は、さまざまな意思決定タスクにまたがる強力な能力を示している。しかし、既存のアプローチでは、利用可能なモデル間の特殊化の違いを見落とし、全てのLCMをタスク特性に関係なく均一に適用できるとみなすことが多い。これにより、さまざまな推論要求やタスクの複雑さに適応する能力が制限される。本研究では,LCMのカウンシルとモンテカルロ木探索(MCTS)を統合し,動的専門家の選択と効率的なマルチステップ計画を可能にするタスク適応型LCMカウンシル(TALC)を提案する。各LSMは、以前のタスク軌跡から派生した構造化された成功メモリプロファイルを備えており、現在の推論コンテキストと過去の成功とのセマンティックマッチングを可能にする。各決定点において、TALCは、制御を最も文脈的に適切なモデルにルートし、履歴ユーティリティスコアとモデルに基づく評価を融合させる二重信号機構を用いてノード値を推定する。これらの信号は、ノード内分散に基づいて適応的に重み付けされ、MCTS選択を誘導するために使用され、システムは探索深度と計画信頼性のバランスをとることができる。 WebShop、HumanEval、Game of 24の実験では、TALCは強力なベースラインよりも優れたタスク成功率と検索効率の向上を実現し、特殊化対応ルーティングと適応計画の利点を検証している。

論文の概要: Task-Aware LLM Council with Adaptive Decision Pathways for Decision Support

関連論文リスト