Fugu-MT 論文翻訳(概要): Category learning in deep neural networks: Information content and geometry of internal representations

論文の概要: Category learning in deep neural networks: Information content and geometry of internal representations

arxiv url: http://arxiv.org/abs/2510.19021v1
Date: Tue, 21 Oct 2025 19:02:51 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-25 03:08:14.52483
Title: Category learning in deep neural networks: Information content and geometry of internal representations
Title（参考訳）: ディープニューラルネットワークにおけるカテゴリー学習:内部表現の情報内容と幾何学
Authors: Laurent Bonnasse-Gahot, Jean-Pierre Nadal,
Abstract要約: 動物では、カテゴリー学習はカテゴリ境界に近い刺激の識別を促進する。この現象は分類知覚と呼ばれ、分類タスクで訓練された人工ニューラルネットワークでも経験的に観察された。ベイズコストの最小化(クロスエントロピー損失)は,カテゴリの集合と,決定層に先行する神経活動の相互情報の最大化を意味することを示す。
参考スコア（独自算出の注目度）: 2.1485350418225244
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: In animals, category learning enhances discrimination between stimuli close to the category boundary. This phenomenon, called categorical perception, was also empirically observed in artificial neural networks trained on classification tasks. In previous modeling works based on neuroscience data, we show that this expansion/compression is a necessary outcome of efficient learning. Here we extend our theoretical framework to artificial networks. We show that minimizing the Bayes cost (mean of the cross-entropy loss) implies maximizing the mutual information between the set of categories and the neural activities prior to the decision layer. Considering structured data with an underlying feature space of small dimension, we show that maximizing the mutual information implies (i) finding an appropriate projection space, and, (ii) building a neural representation with the appropriate metric. The latter is based on a Fisher information matrix measuring the sensitivity of the neural activity to changes in the projection space. Optimal learning makes this neural Fisher information follow a category-specific Fisher information, measuring the sensitivity of the category membership. Category learning thus induces an expansion of neural space near decision boundaries. We characterize the properties of the categorical Fisher information, showing that its eigenvectors give the most discriminant directions at each point of the projection space. We find that, unexpectedly, its maxima are in general not exactly at, but near, the class boundaries. Considering toy models and the MNIST dataset, we numerically illustrate how after learning the two Fisher information matrices match, and essentially align with the category boundaries. Finally, we relate our approach to the Information Bottleneck one, and we exhibit a bias-variance decomposition of the Bayes cost, of interest on its own.
Abstract（参考訳）: 動物では、カテゴリー学習はカテゴリ境界に近い刺激の識別を促進する。この現象は分類知覚と呼ばれ、分類タスクで訓練された人工ニューラルネットワークでも経験的に観察された。これまでの神経科学データに基づくモデリングでは、この拡張/圧縮が効率的な学習に必要な結果であることが示されている。ここでは、理論的枠組みを人工ネットワークに拡張する。ベイズコストの最小化(クロスエントロピー損失)は,カテゴリの集合と,決定層に先行する神経活動の相互情報の最大化を意味することを示す。小次元の特徴空間を持つ構造化データを考えると、相互情報の最大化は意味があることが分かる。一適切な射影空間を見つけること、及び (ii)適切な計量を用いて神経表現を構築すること。後者は、射影空間の変化に対する神経活動の感度を測定するフィッシャー情報行列に基づいている。最適な学習により、このニューラルフィッシャー情報はカテゴリ固有のフィッシャー情報に従い、カテゴリメンバーシップの感度を測定する。カテゴリー学習は、決定境界付近の神経空間の拡大を誘導する。分類的フィッシャー情報の特徴を特徴付け、その固有ベクトルが射影空間の各点において最も識別可能な方向を与えることを示す。意外なことに、その極大性は一般に、正確には、しかし、クラス境界に近いものではない。玩具モデルとMNISTデータセットを考慮すると、2つのフィッシャー情報行列がどのように一致しているかを数値的に説明し、本質的にカテゴリ境界と整合する。最後に、我々のアプローチをインフォメーション・ボトルネック(Information Bottleneck)に関連付け、ベイズコストのバイアス分散分解を示す。

論文の概要: Category learning in deep neural networks: Information content and geometry of internal representations

関連論文リスト