Fugu-MT 論文翻訳(概要): Do Language Models Have Bayesian Brains? Distinguishing Stochastic and Deterministic Decision Patterns within Large Language Models

論文の概要: Do Language Models Have Bayesian Brains? Distinguishing Stochastic and Deterministic Decision Patterns within Large Language Models

arxiv url: http://arxiv.org/abs/2506.10268v1
Date: Thu, 12 Jun 2025 01:23:22 GMT
ステータス: 翻訳完了
システム内更新日: 2025-06-13 15:37:22.511498
Title: Do Language Models Have Bayesian Brains? Distinguishing Stochastic and Deterministic Decision Patterns within Large Language Models
Title（参考訳）: 言語モデルにベイズ脳は存在するか? 大規模言語モデルにおける確率的・決定論的決定パターンの解消
Authors: Andrea Yaoyun Cui, Pengfei Yu,
Abstract要約: 特定の条件下では、言語モデルがほぼ決定論的意思決定を示すことが示される。このことは、サンプリング仮定に挑戦し、人間のような先入観を抽出する以前の手法を損なう。本稿では,ギブズサンプリングにおける決定論的決定パターンの識別方法を提案する。
参考スコア（独自算出の注目度）: 2.5812117322021644
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Language models are essentially probability distributions over token sequences. Auto-regressive models generate sentences by iteratively computing and sampling from the distribution of the next token. This iterative sampling introduces stochasticity, leading to the assumption that language models make probabilistic decisions, similar to sampling from unknown distributions. Building on this assumption, prior research has used simulated Gibbs sampling, inspired by experiments designed to elicit human priors, to infer the priors of language models. In this paper, we revisit a critical question: Do language models possess Bayesian brains? Our findings show that under certain conditions, language models can exhibit near-deterministic decision-making, such as producing maximum likelihood estimations, even with a non-zero sampling temperature. This challenges the sampling assumption and undermines previous methods for eliciting human-like priors. Furthermore, we demonstrate that without proper scrutiny, a system with deterministic behavior undergoing simulated Gibbs sampling can converge to a "false prior." To address this, we propose a straightforward approach to distinguish between stochastic and deterministic decision patterns in Gibbs sampling, helping to prevent the inference of misleading language model priors. We experiment on a variety of large language models to identify their decision patterns under various circumstances. Our results provide key insights in understanding decision making of large language models.
Abstract（参考訳）: 言語モデルは基本的にトークン列上の確率分布である。自己回帰モデルは、次のトークンの分布から反復的に計算しサンプリングすることで文を生成する。この反復的なサンプリングは確率性を導入し、未知の分布からのサンプリングと同様、言語モデルが確率的決定を行うという仮定に繋がる。この仮定に基づいて、先行研究では、人間の先行性を引き出すために設計された実験にインスパイアされた模擬ギブスサンプリングを使用して、言語モデルの先行性を推測している。本稿では、言語モデルがベイズ脳を持っているかという重要な疑問を再考する。その結果, ある条件下では, 非ゼロサンプリング温度であっても, 最大推定値の生成など, 言語モデルがほぼ決定的な決定を下せることがわかった。このことは、サンプリング仮定に挑戦し、人間のような先入観を抽出する以前の手法を損なう。さらに, 適切な精査がなければ, 模擬ギブズサンプリングを行う決定論的挙動を持つシステムが「偽先行」に収束することを示した。これを解決するために,ギブズサンプリングにおける確率的および決定論的決定パターンを区別する簡単な手法を提案する。様々な状況下で,様々な言語モデルを用いて意思決定パターンを識別する実験を行った。この結果から,大規模言語モデルの意思決定を理解する上で重要な知見が得られた。

論文の概要: Do Language Models Have Bayesian Brains? Distinguishing Stochastic and Deterministic Decision Patterns within Large Language Models

関連論文リスト