Fugu-MT 論文翻訳(概要): LLMs are not (consistently) Bayesian: Quantifying internal (in)consistencies of LLMs' probabilistic beliefs

論文の概要: LLMs are not (consistently) Bayesian: Quantifying internal (in)consistencies of LLMs' probabilistic beliefs

arxiv url: http://arxiv.org/abs/2605.06915v1
Date: Thu, 07 May 2026 20:25:02 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-11 19:43:38.604549
Title: LLMs are not (consistently) Bayesian: Quantifying internal (in)consistencies of LLMs' probabilistic beliefs
Title（参考訳）: LLM は(一貫して)ベイズ的ではない: LLM の確率論的信念の内部的(内的)矛盾を定量化する
Authors: Chacha Chen, Matthew Jörke, Adam Goliński, Masha Fedzechkina, Guillermo Sapiro, Sinead Williamson, Nicholas Foti,
Abstract要約: 本稿では,情報処理規則としてLLMを研究する新しい手法を紹介する。我々は、情報処理ギャップを利用して、LCMが証拠から確率的信念を更新する方法の内部(内部)の整合性を研究する。
参考スコア（独自算出の注目度）: 13.649992636657347
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: Modern AI systems are being deployed in complex domains such as medicine, science, and law, where it is important that they not only produce correct answers, but also represent and update uncertain beliefs about the world as new evidence arrives. We introduce the novel technique of studying LLMs as information processing rules and utilize the information processing gap to study the internal (in)consistencies of how LLMs update their probabilistic beliefs from evidence. Our extensive experiments evaluate multiple approaches in which LLMs can incorporate evidence into their beliefs. Some of these approaches produce (nearly) Bayesian updates; others seem to use a learned heuristic. Surprisingly, the non-Bayesian heuristic updates often outperform exact Bayesian computation in terms of downstream task performance -- indicating the LLMs' probabilistic models of the world are misspecified. Lastly, we show how our measure can provide diagnostics to identify issues with LLM-powered inferential systems.
Abstract（参考訳）: 現代のAIシステムは、医学、科学、法といった複雑な領域に展開されており、正しい答えを生み出すだけでなく、新たな証拠が到来するにつれて、世界に関する不確実な信念を表現し、更新することが重要である。本稿では,LLMを情報処理規則として研究する新しい手法を紹介し,その情報処理ギャップを利用して,LCMが証拠から確率的信念を更新する方法の内的(内的)整合性について検討する。 LLMが証拠を彼らの信念に組み込むための複数のアプローチを評価する。これらのアプローチのいくつかは(ほぼ)ベイズ的更新を生み出している。驚くべきことに、非ベイズ的ヒューリスティックな更新は、ダウンストリームタスクのパフォーマンスの観点から、正確なベイズ計算よりも優れていることが多い。最後に,LLMを用いた推論システムにおける問題を特定するための診断手法について述べる。

論文の概要: LLMs are not (consistently) Bayesian: Quantifying internal (in)consistencies of LLMs' probabilistic beliefs

関連論文リスト