Fugu-MT 論文翻訳(概要): Pruning-induced phases in fully-connected neural networks: the eumentia, the dementia, and the amentia

論文の概要: Pruning-induced phases in fully-connected neural networks: the eumentia, the dementia, and the amentia

arxiv url: http://arxiv.org/abs/2603.12316v1
Date: Thu, 12 Mar 2026 18:00:01 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-16 17:38:11.698875
Title: Pruning-induced phases in fully-connected neural networks: the eumentia, the dementia, and the amentia
Title（参考訳）: 完全連結ニューラルネットワークにおけるプルーニング誘起相:umentia、dementia、amentia
Authors: Haining Pan, Nakul Aggarwal, J. H. Pixley,
Abstract要約: 我々は、MNISTでトレーニングされた完全接続ニューラルネットワークについて検討し、トレーニング段階と評価段階の両方で独立にドロップアウト率を変化させた。 eumentia(ネットワークが学習する)、dementia(ネットワークが忘れる)、amentia(ネットワークが学習できない)の3つの相を識別する。以上の結果から, 投棄によって引き起こされるプルーニングは, 統計力学のレンズを通してニューラルネットワークの動作を理解する具体的環境を提供することが示された。
参考スコア（独自算出の注目度）: 0.18206461789819073
License: http://creativecommons.org/publicdomain/zero/1.0/
Abstract: Modern neural networks are heavily overparameterized, and pruning, which removes redundant neurons or connections, has emerged as a key approach to compressing them without sacrificing performance. However, while practical pruning methods are well developed, whether pruning induces sharp phase transitions in the neural networks and, if so, to what universality class they belong, remain open questions. To address this, we study fully-connected neural networks trained on MNIST, independently varying the dropout (i.e., removing neurons) rate at both the training and evaluation stages to map the phase diagram. We identify three distinct phases: eumentia (the network learns), dementia (the network has forgotten), and amentia (the network cannot learn), sharply distinguished by the power-law scaling of the cross-entropy loss with the training dataset size. {In the eumentia phase, the algebraic decay of the loss, as documented in the machine learning literature as neural scaling laws, is from the perspective of statistical mechanics the hallmark of quasi-long-range order.} We demonstrate that the transition between the eumentia and dementia phases is accompanied by scale invariance, with a diverging length scale that exhibits hallmarks of a Berezinskii-Kosterlitz-Thouless-like transition; the phase structure is robust across different network widths and depths. Our results establish that dropout-induced pruning provides a concrete setting in which neural network behavior can be understood through the lens of statistical mechanics.
Abstract（参考訳）: 現代のニューラルネットワークは過度にパラメータ化されており、冗長なニューロンや接続を取り除くプルーニングは、パフォーマンスを犠牲にすることなくそれらを圧縮するための重要なアプローチとして現れている。しかし、実用的な刈り取り法はよく開発されているが、刈り込みがニューラルネットワークの急激な位相遷移を誘発するかどうか、そしてもしそうであれば、彼らが属する普遍性クラスはオープンな問題のままである。そこで本研究では,MNISTでトレーニングされた完全連結ニューラルネットワークについて検討し,学習段階と評価段階の両方で,それぞれ独立にドロップアウト率(ニューロン除去)を変化させ,位相図をマッピングした。ネットワークが学習する)umentia、(ネットワークが学習する)dementia、(ネットワークが学習できない)amentiaの3つの異なるフェーズを特定し、トレーニングデータセットサイズによるクロスエントロピー損失のパワー-ロケーションによって明確に区別する。 {umentia 相において、損失の代数的崩壊は、機械学習の文献にニューラルスケーリング法則として記述されているように、統計力学の観点からは準長距離秩序の指標である。 The transition between the eumentia and dementia phases is accompanied with scale invariance, with a diverging length scale that showmarks of a Berezinskii-Kosterlitz-Thouless-like transition; the phase structure are robust across different network widths and depths。以上の結果から, 投棄によって引き起こされるプルーニングは, 統計力学のレンズを通してニューラルネットワークの動作を理解する具体的環境を提供することが示された。

論文の概要: Pruning-induced phases in fully-connected neural networks: the eumentia, the dementia, and the amentia

関連論文リスト