Fugu-MT 論文翻訳(概要): Average Gradient Outer Product in kernel regression provably recovers the central subspace for multi-index models

論文の概要: Average Gradient Outer Product in kernel regression provably recovers the central subspace for multi-index models

arxiv url: http://arxiv.org/abs/2605.15082v1
Date: Thu, 14 May 2026 17:05:30 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-15 21:45:34.967039
Title: Average Gradient Outer Product in kernel regression provably recovers the central subspace for multi-index models
Title（参考訳）: カーネル回帰における平均勾配外積はマルチインデックスモデルの中央部分空間を確実に回復する
Authors: Libin Zhu, Damek Davis, Dmitriy Drusvyatskiy, Maryam Fazel,
Abstract要約: 学習した予測者がデータ中の有用な低次元を発見できる状況について検討するが、正確な予測に必要なサンプルは少ない。具体的には、有限個のデータペアから$Uinmathbb times d$と$rll d$を持つマルチインデックス構造である$f*(x)=h(Ux)$を復元する問題を考察する。低次$p$が全ての関連方向の予測を行う場合、サブスペースリカバリはより低いサンプルレジーム$nasymp dp+で発生することを示す。
参考スコア（独自算出の注目度）: 21.46621487100042
License: http://creativecommons.org/licenses/by/4.0/
Abstract: We study a prototypical situation when a learned predictor can discover useful low-dimensional structure in data, while using fewer samples than are needed for accurate prediction. Specifically, we consider the problem of recovering a multi-index polynomial $f^*(x)=h(Ux)$, with $U\in\mathbb{R}^{r\times d}$ and $r\ll d$, from finitely many data/label pairs. Importantly, the target function depends on input $x$ only through the projection onto an unknown $r$-dimensional central subspace. The algorithm we analyze is appealingly simple: fit kernel ridge regression (KRR) to the data and compute the Average Gradient Outer Product (AGOP) from the fitted predictor. Our main results show that under reasonable assumptions the top $r$-dimensional eigenspace of AGOP provably recovers the central subspace, even in regimes when the prediction error remains large. Specifically, if the target function $f^*$ has degree $p^*$, it is known that $n\asymp d^{p^*}$ samples are necessary for KRR to achieve accurate prediction. In contrast, we show that if a low degree $p$ component of $f^*$ already carries all relevant directions for prediction, subspace recovery occurs in the much lower sample regime $n\asymp d^{p+δ}$ for any $δ\in(0,1)$. Our results thus demonstrate a separation between prediction and representation, and provide an explanation for why iterative kernel methods such as Recursive Feature Machines (RFM) can be sample-efficient in practice.
Abstract（参考訳）: 学習した予測者がデータ中の有用な低次元構造を発見できるが、正確な予測に必要なサンプルは少ない。具体的には、有限個のデータ/ラベル対から$U\in\mathbb{R}^{r\times d}$と$r\ll d$で多重インデックス多項式 $f^*(x)=h(Ux)$ を復元する問題を考察する。重要なことに、ターゲット関数は、未知の$r$-次元中央部分空間への射影を通してのみ入力$x$に依存する。データにカーネルリッジ回帰(KRR)を適合させ、適合した予測器から平均勾配外積(AGOP)を計算する。本研究の主目的は,AGOP の高次元固有空間が,予測誤差が大きい状態でも確実に中央部分空間を復元できることである。具体的には、ターゲット関数 $f^*$ が次数 $p^*$ を持つなら、KRR が正確な予測を達成するためには $n\asymp d^{p^*}$ サンプルが必要であることが知られている。対照的に、$f^*$ の低次$p$成分が予測のすべての関連方向を既に持っている場合、サブスペース回復は任意の$δ\in(0,1)$に対してより低いサンプルレジーム $n\asymp d^{p+δ}$ で発生する。本研究では,予測と表現の分離を実証し,再帰的特徴マシン (RFM) のような反復的カーネル手法が実際になぜサンプル効率が高いのかを説明する。

論文の概要: Average Gradient Outer Product in kernel regression provably recovers the central subspace for multi-index models

関連論文リスト