Fugu-MT 論文翻訳(概要): Incoherent Beliefs & Inconsistent Actions in Large Language Models

論文の概要: Incoherent Beliefs & Inconsistent Actions in Large Language Models

arxiv url: http://arxiv.org/abs/2511.13240v1
Date: Mon, 17 Nov 2025 11:04:00 GMT
ステータス: 翻訳完了
システム内更新日: 2025-11-18 14:36:25.142428
Title: Incoherent Beliefs & Inconsistent Actions in Large Language Models
Title（参考訳）: 大規模言語モデルにおける一貫性のない信念と一貫性のない行動
Authors: Arka Pal, Teo Kitanovski, Arthur Liang, Akilesh Potti, Micah Goldblum,
Abstract要約: 現実世界のタスクや環境は、大きな言語モデル(LLM)が一般的に評価される静的データセットとは異なる。 LLMのパフォーマンスの2つの重要な要素について検討する: LLMが彼らの信念を一貫性を持って更新する能力と、彼らが取っている行動がそれらの信念と整合している範囲である。本結果は,複雑な実世界の環境下でのLCMの挙動を予測することの難しさを浮き彫りにする。
参考スコア（独自算出の注目度）: 33.54139088666698
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: Real-world tasks and environments exhibit differences from the static datasets that large language models (LLMs) are typically evaluated on. Such tasks can involve sequential interaction, requiring coherent updating of beliefs in light of new evidence, and making appropriate decisions based on those beliefs. Predicting how LLMs will perform in such dynamic environments is important, but can be tricky to determine from measurements in static settings. In this work, we examine two critical components of LLM performance: the ability of LLMs to coherently update their beliefs, and the extent to which the actions they take are consistent with those beliefs. First, we find that LLMs are largely inconsistent in how they update their beliefs; models can exhibit up to a 30% average difference between the directly elicited posterior, and the correct update of their prior. Second, we find that LLMs also often take actions which are inconsistent with the beliefs they hold. On a betting market, for example, LLMs often do not even bet in the same direction as their internally held beliefs over the underlying outcomes. We also find they have moderate self-inconsistency in how they respond to challenges by users to given answers. Finally, we show that the above properties hold even for strong models that obtain high accuracy or that are well-calibrated on the tasks at hand. Our results highlight the difficulties of predicting LLM behavior in complex real-world settings.
Abstract（参考訳）: 現実世界のタスクや環境は、大きな言語モデル(LLM)が一般的に評価される静的データセットとは異なる。このようなタスクは、シーケンシャルな相互作用を伴い、新しい証拠に照らして信念を一貫性を持って更新し、それらの信念に基づいて適切な決定を行う必要がある。 LLMがこのような動的環境でどのように機能するかを予測することは重要であるが、静的な設定での測定から判断するのは難しい。本研究では,LLMのパフォーマンスの2つの重要な要素について考察する。まず、LSMは、彼らの信念をどう更新するかにほとんど矛盾しておらず、モデルは直接引き起こされた後部とそれ以前の正しい更新の間に最大30%の平均的な差を示すことができる。第二に、LLMは、自分の持つ信念と矛盾する行動をとることもしばしばある。例えば賭け市場では、LSMは、基礎となる結果に対する内部的な信念と同じ方向に賭けることさえないことが多い。また、ユーザによる回答の課題への対応方法にも、適度な自己整合性があることに気付きました。最後に, 上記の特性が, 高い精度が得られる, あるいは手前のタスクに対して十分に校正された強いモデルであっても成り立つことを示す。本結果は,複雑な実世界の環境下でのLCMの挙動を予測することの難しさを浮き彫りにする。

論文の概要: Incoherent Beliefs & Inconsistent Actions in Large Language Models

関連論文リスト