Fugu-MT 論文翻訳(概要): On the Loss Landscape Geometry of Regularized Deep Matrix Factorization: Uniqueness and Sharpness

論文の概要: On the Loss Landscape Geometry of Regularized Deep Matrix Factorization: Uniqueness and Sharpness

arxiv url: http://arxiv.org/abs/2603.27072v1
Date: Sat, 28 Mar 2026 01:22:01 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-31 23:18:44.768927
Title: On the Loss Landscape Geometry of Regularized Deep Matrix Factorization: Uniqueness and Sharpness
Title（参考訳）: 正規化深度行列係数の損失景観幾何学:特異性とシャープ性
Authors: Anil Kamber, Rahul Parhi,
Abstract要約: ヘッセンスペクトルは正則化深部スカラー分解問題の最小値に一定であり、二乗誤差損失を持つ。対象行列がルベーグ測度ゼロ集合に属さないなら、各層のフロベニウスノルムはすべての最小化子に対して一定である。
参考スコア（独自算出の注目度）: 5.736588561666141
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Weight decay is ubiquitous in training deep neural network architectures. Its empirical success is often attributed to capacity control; nonetheless, our theoretical understanding of its effect on the loss landscape and the set of minimizers remains limited. In this paper, we show that $\ell^2$-regularized deep matrix factorization/deep linear network training problems with squared-error loss admit a unique end-to-end minimizer for all target matrices subject to factorization, except for a set of Lebesgue measure zero formed by the depth and the regularization parameter. This observation reveals fundamental properties of the loss landscape of regularized deep matrix factorization problems: the Hessian spectrum is constant across all minimizers of the regularized deep scalar factorization problem with squared-error loss. Moreover, we show that, in regularized deep matrix factorization problems with squared-error loss, if the target matrix does not belong to the Lebesgue measure-zero set, then the Frobenius norm of each layer is constant across all minimizers. This, in turn, yields a global lower bound on the trace of the Hessian evaluated at any minimizer of the regularized deep matrix factorization problem. Furthermore, we establish a critical threshold for the regularization parameter above which the unique end-to-end minimizer collapses to zero.
Abstract（参考訳）: 重崩壊は、ディープニューラルネットワークアーキテクチャのトレーニングにおいてユビキタスである。その実証的な成功は、しばしばキャパシティコントロールによるものであるが、しかしながら、損失ランドスケープと最小限の集合に対する我々の理論的理解は限定的である。本稿では,二乗誤差損失を伴い,$\ell^2$-regularized Deep matrix factorization/deep linear network training problems have a unique end-to-end minimalr for all target matrixs subject to factorization, except for a set of a set of Lebesgue measure zero formed by the depth and the regularization parameters。この観測は、正則化深層行列因数分解問題の損失ランドスケープの基本的な性質を明らかにする: ヘッセンスペクトルは正則化深部スカラー因数分解問題のすべての最小値に一定である。さらに、二乗誤差損失を伴う正規化深度行列分解問題において、ターゲット行列がルベーグ測度ゼロ集合に属さない場合、各層のフロベニウスノルムはすべての最小化子に対して一定であることを示す。これは、正則化された深行列分解問題の任意の最小値で評価されたヘッセンのトレース上の大域的な下界を与える。さらに、上述の正規化パラメータに対して、一意の終端最小値が0に崩壊する臨界しきい値を確立する。

論文の概要: On the Loss Landscape Geometry of Regularized Deep Matrix Factorization: Uniqueness and Sharpness

関連論文リスト