Fugu-MT 論文翻訳(概要): When does Gaussian equivalence fail and how to fix it: Non-universal behavior of random features with quadratic scaling

論文の概要: When does Gaussian equivalence fail and how to fix it: Non-universal behavior of random features with quadratic scaling

arxiv url: http://arxiv.org/abs/2512.03325v1
Date: Wed, 03 Dec 2025 00:23:12 GMT
ステータス: 翻訳完了
システム内更新日: 2025-12-04 20:02:55.051037
Title: When does Gaussian equivalence fail and how to fix it: Non-universal behavior of random features with quadratic scaling
Title（参考訳）: ガウス同値性はいつ失敗し、それをどのように修正するか:2次スケーリングを伴うランダムな特徴の非普遍的挙動
Authors: Garrett G. Wen, Hong Hu, Yue M. Lu, Zhou Fan, Theodor Misiakiewicz,
Abstract要約: ガウス同値理論 (GET) は、高次元の複素特徴の挙動をガウス級数によって捉えることができると述べている。しかし、数値実験により、この同値性は一般的なスケーリング体制下での単純な埋め込みでも失敗する可能性があることが示されている。我々は、低次元のガウス成分を高次元ガウスモデルに付加すると考えられる条件等価(CGE)モデルを導入する。
参考スコア（独自算出の注目度）: 15.148577493784051
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: A major effort in modern high-dimensional statistics has been devoted to the analysis of linear predictors trained on nonlinear feature embeddings via empirical risk minimization (ERM). Gaussian equivalence theory (GET) has emerged as a powerful universality principle in this context: it states that the behavior of high-dimensional, complex features can be captured by Gaussian surrogates, which are more amenable to analysis. Despite its remarkable successes, numerical experiments show that this equivalence can fail even for simple embeddings -- such as polynomial maps -- under general scaling regimes. We investigate this breakdown in the setting of random feature (RF) models in the quadratic scaling regime, where both the number of features and the sample size grow quadratically with the data dimension. We show that when the target function depends on a low-dimensional projection of the data, such as generalized linear models, GET yields incorrect predictions. To capture the correct asymptotics, we introduce a Conditional Gaussian Equivalent (CGE) model, which can be viewed as appending a low-dimensional non-Gaussian component to an otherwise high-dimensional Gaussian model. This hybrid model retains the tractability of the Gaussian framework and accurately describes RF models in the quadratic scaling regime. We derive sharp asymptotics for the training and test errors in this setting, which continue to agree with numerical simulations even when GET fails. Our analysis combines general results on CLT for Wiener chaos expansions and a careful two-phase Lindeberg swapping argument. Beyond RF models and quadratic scaling, our work hints at a rich landscape of universality phenomena in high-dimensional ERM.
Abstract（参考訳）: 現代の高次元統計学における主要な取り組みは、経験的リスク最小化(ERM)による非線形特徴埋め込みを訓練した線形予測器の分析に費やされている。ガウス同値理論(GET)は、この文脈において強力な普遍性原理として現れており、高次元で複雑な特徴の振る舞いはガウスの代理によって捉えられるが、解析にはより適している。その顕著な成功にもかかわらず、数値実験により、この同値性は一般的なスケーリング体制の下で単純な埋め込み(多項式写像など)でも失敗することを示した。本研究では,2次スケーリング方式におけるランダム特徴量(RF)モデルの設定における,特徴数とサンプルサイズの両方がデータ次元と2次的に増加する場合の分解について検討する。対象関数が一般化線形モデルなどのデータの低次元投影に依存する場合、GETは誤った予測を行う。正しい漸近を捉えるために、条件付きガウス等価(CGE)モデルを導入し、これは低次元のガウス成分を高次元ガウスモデルに付加すると見なすことができる。このハイブリッドモデルはガウスのフレームワークのトラクタビリティを保持し、二次スケーリングシステムにおけるRFモデルを正確に記述する。この設定では, GET が故障しても数値シミュレーションに一致し続け, トレーニングとテストの誤りに対して, 鋭い漸近を導出する。 Weenerカオス展開に対するCLTの一般的な結果と、慎重な2相リンデバーグスワップ引数を組み合わせた分析を行った。 RFモデルと二次スケーリング以外にも、高次元EMMにおける普遍性現象の豊かな風景を示唆している。

論文の概要: When does Gaussian equivalence fail and how to fix it: Non-universal behavior of random features with quadratic scaling

関連論文リスト