Fugu-MT 論文翻訳(概要): Words as Difference Makers: How Large Language Models Determine Causal Structure in Text

論文の概要: Words as Difference Makers: How Large Language Models Determine Causal Structure in Text

arxiv url: http://arxiv.org/abs/2606.22430v1
Date: Sun, 21 Jun 2026 10:40:47 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-25 18:19:46.97028
Title: Words as Difference Makers: How Large Language Models Determine Causal Structure in Text
Title（参考訳）: 違い要因としての単語:大言語モデルがテキストの因果構造をいかに決定するか
Authors: Wolfgang Pietsch,
Abstract要約: 私は、大きな言語モデル(LLM)は差分論理に基づく特定の帰納的アプローチを採用していると論じます。 LLMの特定のアーキテクチャ特性を分析し、変動誘導におけるそれらの役割を解明する。
参考スコア（独自算出の注目度）: 0.0
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Because large language models (LLMs) are impressively successful in predicting text, it appears that they must have access to a 'world model' representing causal and definitional structure. However, the dominant formalisms of modern causal inference -- Judea Pearl's interventionist approach and the Neyman-Rubin potential outcomes framework -- struggle to illuminate how LLMs learn causal structure. I resolve this puzzle by arguing that LLMs employ a specific inductive approach based on a difference-making logic -- sometimes called variational induction. I demonstrate how central aspects of this logic are realized during training, where LLMs require enormous amounts of text data from a wide range of contexts to identify difference- and indifference-makers within word sequences. Furthermore, I analyze specific architectural features of LLMs -- such as token embeddings and self-attention -- to determine their roles in variational induction. The difference-making logic of LLMs fundamentally parallels the experimental method, where causal relations are derived by systematically varying individual circumstances to determine their influence on a phenomenon.
Abstract（参考訳）: 大きな言語モデル(LLM)はテキストの予測に驚くほど成功したため、因果的構造と定義的構造を表す「世界モデル」にアクセスできなければならない。しかし、現代の因果推論の卓越した形式主義(ジューデア・パールの介入主義的アプローチとナイマン・ルービンの潜在的な結果の枠組み)は、LLMが因果構造を学ぶ方法の解明に苦慮している。 LLMは、差分生成論理に基づく特定の帰納的アプローチ(時として変分帰納法と呼ばれる)を採用する、と論じることで、この問題を解決する。この論理の中枢的な側面が、学習中にどのように実現されるかを示す。そこでは、LLMは、単語列内の差分と差分を識別するために、幅広い文脈から大量のテキストデータを必要とする。さらに、トークン埋め込みや自己アテンションなど、LLMの特定のアーキテクチャの特徴を分析して、変分誘導におけるそれらの役割を判断します。 LLMの差分論理は、因果関係が系統的に異なる個別の状況によって引き起こされ、その現象への影響を決定する実験手法を根本的に平行にしている。

論文の概要: Words as Difference Makers: How Large Language Models Determine Causal Structure in Text

関連論文リスト