Fugu-MT 論文翻訳(概要): A mathematical theory for understanding when abstract representations emerge in neural networks

論文の概要: A mathematical theory for understanding when abstract representations emerge in neural networks

arxiv url: http://arxiv.org/abs/2510.09816v1
Date: Fri, 10 Oct 2025 19:30:57 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-14 18:06:29.636
Title: A mathematical theory for understanding when abstract representations emerge in neural networks
Title（参考訳）: 抽象表現がニューラルネットワークに現れるときの理解のための数学的理論
Authors: Bin Wang, W. Jeffrey Johnston, Stefano Fusi,
Abstract要約: フィードフォワード非線形ネットワークの最後に隠された層に潜伏変数の抽象表現が現れることが保証されていることを示す。これらの表現は、所望の出力の構造や入力刺激の意味を反映している。
参考スコア（独自算出の注目度）: 4.415536082342714
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: Recent experiments reveal that task-relevant variables are often encoded in approximately orthogonal subspaces of the neural activity space. These disentangled low-dimensional representations are observed in multiple brain areas and across different species, and are typically the result of a process of abstraction that supports simple forms of out-of-distribution generalization. The mechanisms by which such geometries emerge remain poorly understood, and the mechanisms that have been investigated are typically unsupervised (e.g., based on variational auto-encoders). Here, we show mathematically that abstract representations of latent variables are guaranteed to appear in the last hidden layer of feedforward nonlinear networks when they are trained on tasks that depend directly on these latent variables. These abstract representations reflect the structure of the desired outputs or the semantics of the input stimuli. To investigate the neural representations that emerge in these networks, we develop an analytical framework that maps the optimization over the network weights into a mean-field problem over the distribution of neural preactivations. Applying this framework to a finite-width ReLU network, we find that its hidden layer exhibits an abstract representation at all global minima of the task objective. We further extend these analyses to two broad families of activation functions and deep feedforward architectures, demonstrating that abstract representations naturally arise in all these scenarios. Together, these results provide an explanation for the widely observed abstract representations in both the brain and artificial neural networks, as well as a mathematically tractable toolkit for understanding the emergence of different kinds of representations in task-optimized, feature-learning network models.
Abstract（参考訳）: 近年の研究では、タスク関連変数が神経活動空間のほぼ直交部分空間に符号化されていることが示されている。これらの非交叉低次元の表現は、複数の脳領域や異なる種で観察され、通常、単純なアウト・オブ・ディストリビューションの一般化の形式をサポートする抽象過程の結果である。このような測地が現れるメカニズムはいまだよく理解されておらず、調査されたメカニズムは一般に教師なし(例えば、変分オートエンコーダに基づく)である。ここでは、潜伏変数の抽象表現が、これらの潜伏変数に直接依存するタスクでトレーニングされた際に、フィードフォワード非線形ネットワークの最後の隠れた層に現れることを数学的に示す。これらの抽象表現は、所望の出力の構造や入力刺激の意味を反映している。これらのネットワークに現れるニューラル表現を調べるために,ニューラルネットワークの重みに対する最適化を,ニューラルプレアクティベーションの分布に関する平均場問題にマッピングする分析フレームワークを開発した。このフレームワークを有限幅ReLUネットワークに適用すると、その隠蔽層がタスク目的のすべての大域的ミニマに抽象表現を示すことが分かる。さらに、これらの分析を2種類の活性化関数とディープフィードフォワードアーキテクチャに拡張し、これらすべてのシナリオにおいて抽象表現が自然に発生することを示す。これらの結果は、脳と人工ニューラルネットワークの双方で広く観察されている抽象表現を説明するとともに、タスク最適化された特徴学習ネットワークモデルにおける様々な種類の表現の出現を理解する数学的に抽出可能なツールキットを提供する。

論文の概要: A mathematical theory for understanding when abstract representations emerge in neural networks

関連論文リスト