Fugu-MT 論文翻訳(概要): Multi-Agent LLMs for Adaptive Acquisition in Bayesian Optimization

論文の概要: Multi-Agent LLMs for Adaptive Acquisition in Bayesian Optimization

arxiv url: http://arxiv.org/abs/2603.28959v1
Date: Mon, 30 Mar 2026 20:05:30 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-01 15:25:02.76141
Title: Multi-Agent LLMs for Adaptive Acquisition in Bayesian Optimization
Title（参考訳）: ベイズ最適化における適応的獲得のためのマルチエージェントLLM
Authors: Andrea Carbonati, Mohammadsina Almasi, Hadis Anahideh,
Abstract要約: 本稿では,Large Language Models (LLM) が探索・探索戦略をどのように構築し,適応するかを示す。本稿では,探索・探索制御を戦略的政策仲介と戦術的候補生成に分解する多エージェントフレームワークを提案する。
参考スコア（独自算出の注目度）: 2.6954666679827137
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: The exploration-exploitation trade-off is central to sequential decision-making and black-box optimization, yet how Large Language Models (LLMs) reason about and manage this trade-off remains poorly understood. Unlike Bayesian Optimization, where exploration and exploitation are explicitly encoded through acquisition functions, LLM-based optimization relies on implicit, prompt-based reasoning over historical evaluations, making search behavior difficult to analyze or control. In this work, we present a metric-level study of LLM-mediated search policy learning, studying how LLMs construct and adapt exploration-exploitation strategies under multiple operational definitions of exploration, including informativeness, diversity, and representativeness. We show that single-agent LLM approaches, which jointly perform strategy selection and candidate generation within a single prompt, suffer from cognitive overload, leading to unstable search dynamics and premature convergence. To address this limitation, we propose a multi-agent framework that decomposes exploration-exploitation control into strategic policy mediation and tactical candidate generation. A strategy agent assigns interpretable weights to multiple search criteria, while a generation agent produces candidates conditioned on the resulting search policy defined as weights. This decomposition renders exploration-exploitation decisions explicit, observable, and adjustable. Empirical results across various continuous optimization benchmarks indicate that separating strategic control from candidate generation substantially improves the effectiveness of LLM-mediated search.
Abstract（参考訳）: 探索と探索のトレードオフは、シーケンシャルな意思決定とブラックボックス最適化の中心であるが、Large Language Models (LLMs)がいかにしてこのトレードオフを推論し、管理するかは理解されていない。ベイズ最適化とは異なり、LLMに基づく最適化は歴史的評価に対する暗黙的かつ急進的な推論に依存しており、探索行動の分析や制御が困難である。本研究では, LLMによる探索政策学習のメトリクスレベル研究を行い, 情報性, 多様性, 代表性など, 探索の複数の操作的定義の下で, LLMが探索-探索戦略を構築し, 適応する方法について検討する。一つのプロンプト内で戦略選択と候補生成を共同で行う単一エージェントLSMアプローチは,認知的過負荷に悩まされ,不安定な探索ダイナミクスと早期収束をもたらすことを示す。この制限に対処するために、探索・探索制御を戦略的な政策仲介と戦術的候補生成に分解するマルチエージェントフレームワークを提案する。戦略エージェントは、解釈可能な重みを複数の探索基準に割り当て、生成エージェントは、重みとして定義された結果の探索ポリシーに条件付けられた候補を生成する。この分解により、探索・探索の決定は明確で、観測可能で、調整可能である。各種連続最適化ベンチマークにおける実験結果から, 候補生成からの戦略的制御の分離は, LLMによる探索の有効性を著しく向上させることが示された。

論文の概要: Multi-Agent LLMs for Adaptive Acquisition in Bayesian Optimization

関連論文リスト