Fugu-MT 論文翻訳(概要): Evidence-Gated LLM Priors for Multi-Objective Bayesian Optimization

論文の概要: Evidence-Gated LLM Priors for Multi-Objective Bayesian Optimization

arxiv url: http://arxiv.org/abs/2606.01730v1
Date: Mon, 01 Jun 2026 05:50:16 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-02 21:34:31.400841
Title: Evidence-Gated LLM Priors for Multi-Objective Bayesian Optimization
Title（参考訳）: Evidence-Gated LLM Priors for Multi-Objective Bayesian Optimization
Authors: Jiangyu Chen, Banyi,
Abstract要約: 大規模言語モデル (LLM) はブラックボックス最適化のアドバイザとしてますます使われているが、その提案と自己報告された自信は、必ずしも下流の客観的値に調整されるとは限らない。離散多目的ベイズ最適化において, LLM 生成した専門家を盲目的に信頼せずに利用する方法について検討した。
参考スコア（独自算出の注目度）: 0.0
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Large language models (LLMs) are increasingly used as heuristic advisors for black-box optimization, yet their suggestions and self-reported confidence are not necessarily calibrated to downstream objective values. This issue becomes more pronounced in multi-objective Bayesian optimization, where different objectives may require different expert knowledge and where an LLM expert can be useful for one objective but misleading for another. We study how to use LLM-generated expert priors in discrete multi-objective Bayesian optimization without blindly trusting them. We propose an objective-wise reputation-market mechanism that treats each expert-objective pair as a falsifiable prior source. Expert weights are updated online from observed objective feedback, discounted over time, and gated by market-level trust. We then introduce a decoupled counterfactual gate that can use the LLM prior without confidence, use it with confidence, or abstain from the LLM prior entirely. Across controlled synthetic stress tests and three molecule optimization benchmarks with \qwenflash{}-generated expert priors, we find that dynamic objective-wise calibration improves robustness over fixed LLM priors. However, raw LLM confidence is not reliably beneficial: on ESOL, confidence is positively correlated with prediction error; on FreeSolv, confidence can help; and on Lipophilicity, ignoring confidence remains strongest. Our fixed three-arm counterfactual gate improves over the first counterfactual variant on ESOL and FreeSolv, while an attempted margin portfolio exposes a useful negative result: margin selection should be acquisition-aware rather than based only on one-step prior error.
Abstract（参考訳）: 大規模言語モデル (LLM) はブラックボックス最適化のヒューリスティックアドバイザとしてますます使われているが、その提案と自己報告された自信は、必ずしも下流の客観的値に調整されるとは限らない。この問題は多目的ベイズ最適化においてより顕著になり、異なる目的が異なる専門家の知識を必要とする場合と、LLMの専門家が一つの目的に有用であるが別の目的に誤解をもたらす場合である。離散多目的ベイズ最適化において, LLM 生成した専門家を盲目的に信頼せずに利用する方法について検討した。本稿では、各専門家と客観的なペアを偽造可能な事前情報源として扱う客観的評価市場機構を提案する。専門家の重みは、観察された客観的フィードバックからオンラインで更新され、時間の経過とともに割引され、市場レベルの信頼によって強制される。次に,LLM を信頼せずに,信頼を持って使用したり,信頼を持って使用したり,あるいは LLM を完全に排除したりできる非結合の対物ゲートを導入する。制御された合成応力試験と,\qwenflash{} 生成したエキスパート前駆体を用いた3分子最適化ベンチマークにより,動的主観的キャリブレーションが固定LDM前駆体よりも堅牢性を向上させることが判明した。 ESOLでは、信頼は予測エラーと肯定的に相関し、FreeSolvでは、信頼は役立つ。固定された3腕対物ゲートはESOLとFreeSolvの最初の対物変種よりも改善され、一方、試用されたマージンポートフォリオは有用なネガティブな結果を示している。

論文の概要: Evidence-Gated LLM Priors for Multi-Objective Bayesian Optimization

関連論文リスト