Fugu-MT 論文翻訳(概要): Generalization in Representation Models via Random Matrix Theory: Application to Recurrent Networks

論文の概要: Generalization in Representation Models via Random Matrix Theory: Application to Recurrent Networks

arxiv url: http://arxiv.org/abs/2511.02401v1
Date: Tue, 04 Nov 2025 09:30:31 GMT
ステータス: 翻訳完了
システム内更新日: 2025-11-10 21:00:44.525824
Title: Generalization in Representation Models via Random Matrix Theory: Application to Recurrent Networks
Title（参考訳）: ランダム行列理論による表現モデルの一般化:リカレントネットワークへの応用
Authors: Yessin Moakher, Malik Tiomoko, Cosme Louart, Zhenyu Liao,
Abstract要約: まず,固定された特徴表現(凍結中間層)と学習可能な読み出し層を用いたモデルの一般化誤差について検討する。一般化誤差に対する閉形式式を導出するためにランダム行列理論を適用する。次に、この解析を繰り返し表現に適用し、その性能を特徴付ける簡潔な公式を得る。
参考スコア（独自算出の注目度）: 7.721672385781673
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: We first study the generalization error of models that use a fixed feature representation (frozen intermediate layers) followed by a trainable readout layer. This setting encompasses a range of architectures, from deep random-feature models to echo-state networks (ESNs) with recurrent dynamics. Working in the high-dimensional regime, we apply Random Matrix Theory to derive a closed-form expression for the asymptotic generalization error. We then apply this analysis to recurrent representations and obtain concise formula that characterize their performance. Surprisingly, we show that a linear ESN is equivalent to ridge regression with an exponentially time-weighted (''memory'') input covariance, revealing a clear inductive bias toward recent inputs. Experiments match predictions: ESNs win in low-sample, short-memory regimes, while ridge prevails with more data or long-range dependencies. Our methodology provides a general framework for analyzing overparameterized models and offers insights into the behavior of deep learning networks.
Abstract（参考訳）: まず,固定された特徴表現(凍結中間層)と学習可能な読み出し層を用いたモデルの一般化誤差について検討する。この設定は、深いランダム機能モデルから、リカレントダイナミクスを備えたエコー状態ネットワーク(ESN)まで、様々なアーキテクチャを含んでいる。高次元状態において、漸近的一般化誤差に対する閉形式式を導出するためにランダム行列理論を適用する。次に、この解析を繰り返し表現に適用し、それらの性能を特徴付ける簡潔な公式を得る。意外なことに、線形ESNは指数的に時間重み付き('Memory'')入力共分散を持つリッジ回帰と等価であり、最近の入力に対する明らかな帰納バイアスが明らかである。実験は予測にマッチする: ESNは低サンプリング、短メモリのレシエーションで勝利し、リッジはより多くのデータや長距離依存関係で優位である。提案手法は,過パラメータモデル解析のための一般的なフレームワークを提供し,ディープラーニングネットワークの動作に関する洞察を提供する。

論文の概要: Generalization in Representation Models via Random Matrix Theory: Application to Recurrent Networks

関連論文リスト