Fugu-MT 論文翻訳(概要): Far from the Shallow: Brain-Predictive Reasoning Embedding through Residual Disentanglement

論文の概要: Far from the Shallow: Brain-Predictive Reasoning Embedding through Residual Disentanglement

arxiv url: http://arxiv.org/abs/2510.22860v1
Date: Sun, 26 Oct 2025 22:46:26 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-28 19:54:32.568189
Title: Far from the Shallow: Brain-Predictive Reasoning Embedding through Residual Disentanglement
Title（参考訳）: 浅部から遠ざかる:残差による脳波予測推論
Authors: Linyang He, Tianjun Zhong, Richard Antonello, Gavin Mischler, Micah Goldblum, Nima Mesgarani,
Abstract要約: 現代の大規模言語モデル(LLM)は、言語に対する神経反応をモデル化するためにますます使われている。彼らの内部表現は極めて「絡み合っている」ものであり、語彙、構文、意味、推論に関する情報を混ぜている。この絡み合いは、言語学的に浅い特徴に対する従来の脳のコード解析に偏っている。
参考スコア（独自算出の注目度）: 43.96899536703126
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: Understanding how the human brain progresses from processing simple linguistic inputs to performing high-level reasoning is a fundamental challenge in neuroscience. While modern large language models (LLMs) are increasingly used to model neural responses to language, their internal representations are highly "entangled," mixing information about lexicon, syntax, meaning, and reasoning. This entanglement biases conventional brain encoding analyses toward linguistically shallow features (e.g., lexicon and syntax), making it difficult to isolate the neural substrates of cognitively deeper processes. Here, we introduce a residual disentanglement method that computationally isolates these components. By first probing an LM to identify feature-specific layers, our method iteratively regresses out lower-level representations to produce four nearly orthogonal embeddings for lexicon, syntax, meaning, and, critically, reasoning. We used these disentangled embeddings to model intracranial (ECoG) brain recordings from neurosurgical patients listening to natural speech. We show that: 1) This isolated reasoning embedding exhibits unique predictive power, accounting for variance in neural activity not explained by other linguistic features and even extending to the recruitment of visual regions beyond classical language areas. 2) The neural signature for reasoning is temporally distinct, peaking later (~350-400ms) than signals related to lexicon, syntax, and meaning, consistent with its position atop a processing hierarchy. 3) Standard, non-disentangled LLM embeddings can be misleading, as their predictive success is primarily attributable to linguistically shallow features, masking the more subtle contributions of deeper cognitive processing.
Abstract（参考訳）: 単純な言語入力の処理から高いレベルの推論に至るまで、人間の脳がどのように進歩するかを理解することは、神経科学における根本的な課題である。現代の大規模言語モデル(LLM)は、言語に対する神経反応をモデル化するためにますます使われてきているが、その内部表現は、辞書、構文、意味、推論に関する情報を混在させ、非常に「絡み合っている」。この絡み合いは、言語的に浅い特徴(例えば、レキシコン、構文)に対する従来の脳のコード解析に偏りがあり、認知的に深いプロセスの神経基質を分離することは困難である。本稿では,これらの成分を計算的に分離する残差分散法を提案する。まず,特徴特化層を特定するためにLMを探索することにより,低レベルの表現を反復的に退避させ,語彙,構文,意味,批判的に4つのほぼ直交的な埋め込みを生成する。自然音声を聴く神経外科患者の頭蓋内脳波(ECoG)をモデル化するために,これらのアンタングル埋め込みを用いた。以下に示す。 1) この分離された推論埋め込みは,他の言語的特徴によって説明されない神経活動のばらつきを考慮し,古典的な言語領域を超えて視覚領域の採用にまで拡張する,独自の予測力を示す。 2) 推論のためのニューラルシグネチャは,レキシコン,構文,意味に関連する信号よりも後(約350～400ms)で,処理階層上の位置と一致している。 3) 言語学的に浅い特徴に起因し, より深い認知処理のより微妙な寄与を隠蔽するため, 標準的な非異方性LLM埋め込みは誤解を招く可能性がある。

論文の概要: Far from the Shallow: Brain-Predictive Reasoning Embedding through Residual Disentanglement

関連論文リスト