Fugu-MT 論文翻訳(概要): Uncovering Emergent Physics Representations Learned In-Context by Large Language Models

論文の概要: Uncovering Emergent Physics Representations Learned In-Context by Large Language Models

arxiv url: http://arxiv.org/abs/2508.12448v1
Date: Sun, 17 Aug 2025 17:49:17 GMT
ステータス: 翻訳完了
システム内更新日: 2025-08-19 14:49:10.775811
Title: Uncovering Emergent Physics Representations Learned In-Context by Large Language Models
Title（参考訳）: 大規模言語モデルによる文脈学習による創発的物理表現の発見
Authors: Yeongwoo Song, Jaeyong Bae, Dong-Kyum Kim, Hawoong Jeong,
Abstract要約: 大規模言語モデル(LLM)は、テキストプロンプトだけで幅広いタスクを解くことができる、印象的なインコンテキスト学習(ICL)能力を示す。本稿では LLM の ICL 能力について考察する。物理系における動的予測タスクをプロキシとして使用し、LLMが文脈で物理を学習できるかどうかを評価する。
参考スコア（独自算出の注目度）: 1.8749305679160366
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Large language models (LLMs) exhibit impressive in-context learning (ICL) abilities, enabling them to solve wide range of tasks via textual prompts alone. As these capabilities advance, the range of applicable domains continues to expand significantly. However, identifying the precise mechanisms or internal structures within LLMs that allow successful ICL across diverse, distinct classes of tasks remains elusive. Physics-based tasks offer a promising testbed for probing this challenge. Unlike synthetic sequences such as basic arithmetic or symbolic equations, physical systems provide experimentally controllable, real-world data based on structured dynamics grounded in fundamental principles. This makes them particularly suitable for studying the emergent reasoning behaviors of LLMs in a realistic yet tractable setting. Here, we mechanistically investigate the ICL ability of LLMs, especially focusing on their ability to reason about physics. Using a dynamics forecasting task in physical systems as a proxy, we evaluate whether LLMs can learn physics in context. We first show that the performance of dynamics forecasting in context improves with longer input contexts. To uncover how such capability emerges in LLMs, we analyze the model's residual stream activations using sparse autoencoders (SAEs). Our experiments reveal that the features captured by SAEs correlate with key physical variables, such as energy. These findings demonstrate that meaningful physical concepts are encoded within LLMs during in-context learning. In sum, our work provides a novel case study that broadens our understanding of how LLMs learn in context.
Abstract（参考訳）: 大規模言語モデル(LLM)は、テキストプロンプトだけで幅広いタスクを解くことができる、印象的なインコンテキスト学習(ICL)能力を示す。これらの能力が進歩するにつれて、適用範囲は拡大し続けている。しかし、多種多様なタスクのクラスにまたがってICCを成功させることができるLSM内の正確なメカニズムや内部構造を特定することは、いまだ解明されていない。物理ベースのタスクは、この課題を調査するための有望なテストベッドを提供する。基本的な算術や記号方程式のような合成列とは異なり、物理系は基本原理に基づく構造化力学に基づく実験的に制御可能な実世界のデータを提供する。これにより, LLMの創発的推論行動について, 現実的かつトラクタブルな環境下での研究に特に適している。本稿では LLM の ICL 能力,特に物理学的推論能力について考察する。物理系における動的予測タスクをプロキシとして使用し、LLMが文脈で物理を学習できるかどうかを評価する。まず、コンテキストにおける動的予測の性能が、より長い入力コンテキストで改善されることを示す。 LLMにおいてそのような機能がどのように現れるかを明らかにするために、スパースオートエンコーダ(SAE)を用いてモデルの残ストリームアクティベーションを分析する。実験の結果,SAEが捉えた特徴はエネルギーなどの物理変数と相関していることがわかった。これらの結果は、文脈内学習において、意味のある物理概念がLLM内にエンコードされていることを示している。まとめると、我々の研究は、LLMが文脈でどのように学習するかについての理解を広げる新しいケーススタディを提供する。

論文の概要: Uncovering Emergent Physics Representations Learned In-Context by Large Language Models

関連論文リスト