Fugu-MT 論文翻訳(概要): Precise asymptotic analysis of Sobolev training for random feature models

論文の概要: Precise asymptotic analysis of Sobolev training for random feature models

arxiv url: http://arxiv.org/abs/2511.03050v1
Date: Tue, 04 Nov 2025 22:49:33 GMT
ステータス: 翻訳完了
システム内更新日: 2025-11-06 18:19:32.265944
Title: Precise asymptotic analysis of Sobolev training for random feature models
Title（参考訳）: ランダム特徴モデルのためのソボレフ訓練の高精度漸近解析
Authors: Katharine E Fisher, Matthew TC Li, Youssef Marzouk, Timo Schorlepp,
Abstract要約: 本研究では,ソボレフトレーニングが高次元の予測モデルの一般化誤差に及ぼす影響について検討した。本研究は,雑音関数と勾配データを補間することにより,モデルが最適に機能する設定を同定する。
参考スコア（独自算出の注目度）: 0.0
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Gradient information is widely useful and available in applications, and is therefore natural to include in the training of neural networks. Yet little is known theoretically about the impact of Sobolev training -- regression with both function and gradient data -- on the generalization error of highly overparameterized predictive models in high dimensions. In this paper, we obtain a precise characterization of this training modality for random feature (RF) models in the limit where the number of trainable parameters, input dimensions, and training data tend proportionally to infinity. Our model for Sobolev training reflects practical implementations by sketching gradient data onto finite dimensional subspaces. By combining the replica method from statistical physics with linearizations in operator-valued free probability theory, we derive a closed-form description for the generalization errors of the trained RF models. For target functions described by single-index models, we demonstrate that supplementing function data with additional gradient data does not universally improve predictive performance. Rather, the degree of overparameterization should inform the choice of training method. More broadly, our results identify settings where models perform optimally by interpolating noisy function and gradient data.
Abstract（参考訳）: グラディエント情報はアプリケーションで広く利用でき、ニューラルネットワークのトレーニングに含めるのは自然である。しかし、ソボレフトレーニング(関数と勾配データの回帰)が高次元の高度に過度にパラメータ化された予測モデルの一般化誤差に与える影響について理論的にはほとんど知られていない。本稿では、トレーニング可能なパラメータ数、入力次元、トレーニングデータが無限大に比例する極限における、ランダム特徴量(RF)モデルのこのトレーニングモータリティを正確に評価する。ソボレフトレーニングのモデルは、勾配データを有限次元部分空間にスケッチすることで実践的な実装を反映する。統計物理学からのレプリカ法と演算子値自由確率理論の線形化を組み合わせることにより、訓練されたRFモデルの一般化誤差に対する閉形式記述を導出する。単一インデックスモデルにより記述された対象関数に対して、追加の勾配データによる補足関数データが予測性能を普遍的に改善しないことを示す。むしろ、過パラメータ化の度合いは、訓練方法の選択を知らせるべきである。より広範に、ノイズ関数と勾配データを補間することにより、モデルが最適に機能する設定を同定する。

論文の概要: Precise asymptotic analysis of Sobolev training for random feature models

関連論文リスト