Fugu-MT 論文翻訳(概要): Stochastic CHAOS: Why Deterministic Inference Kills, and Distributional Variability Is the Heartbeat of Artifical Cognition

論文の概要: Stochastic CHAOS: Why Deterministic Inference Kills, and Distributional Variability Is the Heartbeat of Artifical Cognition

arxiv url: http://arxiv.org/abs/2601.07239v1
Date: Mon, 12 Jan 2026 06:19:09 GMT
ステータス: 翻訳完了
システム内更新日: 2026-01-13 19:08:01.241169
Title: Stochastic CHAOS: Why Deterministic Inference Kills, and Distributional Variability Is the Heartbeat of Artifical Cognition
Title（参考訳）: 確率的CHAOS:なぜ決定論的推論が致命的であり、分布変動が人工認知の心拍数であるのか
Authors: Tanmay Joshi, Shourya Aggarwal, Anusa Saha, Aadi Pandey, Shreyash Dhoot, Vighnesh Rai, Raxit Goswami, Aman Chadha, Vinija Jain, Amitava Das,
Abstract要約: LLMにとって、決定論的推論は致命的である、と我々は主張する。不確実性をモデル化し、創発的な能力を抑え、単一の脆い経路に推論を崩壊させ、尾のリスクを隠すことで安全性のアライメントを弱める。
参考スコア（独自算出の注目度）: 14.945980804235885
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Deterministic inference is a comforting ideal in classical software: the same program on the same input should always produce the same output. As large language models move into real-world deployment, this ideal has been imported wholesale into inference stacks. Recent work from the Thinking Machines Lab has presented a detailed analysis of nondeterminism in LLM inference, showing how batch-invariant kernels and deterministic attention can enforce bitwise-identical outputs, positioning deterministic inference as a prerequisite for reproducibility and enterprise reliability. In this paper, we take the opposite stance. We argue that, for LLMs, deterministic inference kills. It kills the ability to model uncertainty, suppresses emergent abilities, collapses reasoning into a single brittle path, and weakens safety alignment by hiding tail risks. LLMs implement conditional distributions over outputs, not fixed functions. Collapsing these distributions to a single canonical completion may appear reassuring, but it systematically conceals properties central to artificial cognition. We instead advocate Stochastic CHAOS, treating distributional variability as a signal to be measured and controlled. Empirically, we show that deterministic inference is systematically misleading. Single-sample deterministic evaluation underestimates both capability and fragility, masking failure probability under paraphrases and noise. Phase-like transitions associated with emergent abilities disappear under greedy decoding. Multi-path reasoning degrades when forced onto deterministic backbones, reducing accuracy and diagnostic insight. Finally, deterministic evaluation underestimates safety risk by hiding rare but dangerous behaviors that appear only under multi-sample evaluation.
Abstract（参考訳）: 決定論的推論は古典的ソフトウェアにおいて快適な理想であり、同じ入力上の同じプログラムが常に同じ出力を生成するべきである。大規模な言語モデルが現実世界のデプロイメントへと移行するにつれ、この理想はインジェクションスタックに輸入された。最近のThinking Machines Labの研究は、LLM推論における非決定性に関する詳細な分析を示し、バッチ不変カーネルと決定論的注意がどのようにビットワイズ識別出力を強制するかを示し、再現性と企業信頼性の前提として決定論的推論を位置づけている。本稿では,逆の立場を取る。 LLMにとって、決定論的推論は致命的である、と我々は主張する。不確実性をモデル化し、創発的な能力を抑え、単一の脆い経路に推論を崩壊させ、尾のリスクを隠すことで安全性のアライメントを弱める。 LLMは、固定関数ではなく出力上の条件分布を実装している。これらの分布を単一の正準完備化に分解することは再保証されるように見えるが、人工的な認知の中心となる性質を体系的に隠蔽する。代わりに、確率的CHAOSを提唱し、分布変動を測定・制御する信号として扱う。経験的に、決定論的推論は体系的に誤解を招く。単一サンプル決定論的評価は、機能と脆弱性、パラフレーズとノイズによる障害確率を過小評価する。創発能力に関連する相様遷移は、強欲な復号化の下で消失する。多経路推論は、決定論的バックボーンに強制されると劣化し、精度と診断の洞察が低下する。最後に、決定論的評価は、多サンプル評価の下でのみ現れる稀だが危険な行動を隠すことによって、安全性のリスクを過小評価する。

論文の概要: Stochastic CHAOS: Why Deterministic Inference Kills, and Distributional Variability Is the Heartbeat of Artifical Cognition

関連論文リスト