Fugu-MT 論文翻訳(概要): Algorithms and SQ Lower Bounds for Robustly Learning Real-valued Multi-index Models

論文の概要: Algorithms and SQ Lower Bounds for Robustly Learning Real-valued Multi-index Models

arxiv url: http://arxiv.org/abs/2505.21475v1
Date: Tue, 27 May 2025 17:47:26 GMT
ステータス: 翻訳完了
システム内更新日: 2025-05-28 17:05:58.842257
Title: Algorithms and SQ Lower Bounds for Robustly Learning Real-valued Multi-index Models
Title（参考訳）: 実数値マルチインデックスモデルのロバスト学習のためのアルゴリズムとSQ下界
Authors: Ilias Diakonikolas, Giannis Iakovidis, Daniel M. Kane, Lisheng Ren,
Abstract要約: ガウス分布に基づく実数値マルチインデックスモデル(MIM)の学習の複雑さについて検討する。 K$-MIM は関数 $f:mathbbRdto mathbbR$ であり、入力の$K$-次元部分空間への射影のみに依存する。逆ラベルノイズが存在する場合でも, 正方形損失に対して幅広いMIMを学習するための一般アルゴリズムを提案する。
参考スコア（独自算出の注目度）: 34.196233651364615
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: We study the complexity of learning real-valued Multi-Index Models (MIMs) under the Gaussian distribution. A $K$-MIM is a function $f:\mathbb{R}^d\to \mathbb{R}$ that depends only on the projection of its input onto a $K$-dimensional subspace. We give a general algorithm for PAC learning a broad class of MIMs with respect to the square loss, even in the presence of adversarial label noise. Moreover, we establish a nearly matching Statistical Query (SQ) lower bound, providing evidence that the complexity of our algorithm is qualitatively optimal as a function of the dimension. Specifically, we consider the class of bounded variation MIMs with the property that degree at most $m$ distinguishing moments exist with respect to projections onto any subspace. In the presence of adversarial label noise, the complexity of our learning algorithm is $d^{O(m)}2^{\mathrm{poly}(K/\epsilon)}$. For the realizable and independent noise settings, our algorithm incurs complexity $d^{O(m)}2^{\mathrm{poly}(K)}(1/\epsilon)^{O(K)}$. To complement our upper bound, we show that if for some subspace degree-$m$ distinguishing moments do not exist, then any SQ learner for the corresponding class of MIMs requires complexity $d^{\Omega(m)}$. As an application, we give the first efficient learner for the class of positive-homogeneous $L$-Lipschitz $K$-MIMs. The resulting algorithm has complexity $\mathrm{poly}(d) 2^{\mathrm{poly}(KL/\epsilon)}$. This gives a new PAC learning algorithm for Lipschitz homogeneous ReLU networks with complexity independent of the network size, removing the exponential dependence incurred in prior work.
Abstract（参考訳）: ガウス分布に基づく実数値マルチインデックスモデル(MIM)の学習の複雑さについて検討する。 K$-MIM は函数 $f:\mathbb{R}^d\to \mathbb{R}$ であり、$K$-次元部分空間への入力の射影にのみ依存する。逆ラベルノイズが存在する場合でも, 正方形損失に対して幅広いMIMを学習するための一般アルゴリズムを提案する。さらに、ほぼ一致する統計的クエリ(SQ)の下限を確立し、このアルゴリズムの複雑さが次元の関数として質的に最適であることを示す。具体的には、任意の部分空間への射影に関して、少なくとも$m$の微分モーメントを持つような性質を持つ有界変分MIMのクラスを考える。逆ラベルノイズが存在する場合、学習アルゴリズムの複雑さは$d^{O(m)}2^{\mathrm{poly}(K/\epsilon)}$である。実現可能かつ独立なノイズ設定に対して、我々のアルゴリズムは複雑さを$d^{O(m)}2^{\mathrm{poly}(K)}(1/\epsilon)^{O(K)}$に導く。上界を補完するために、ある部分空間次数-$m$ のモーメントが存在しない場合、対応する MIM のクラスに対する任意の SQ 学習者は、複雑性$d^{\Omega(m)}$ を必要とする。アプリケーションとして、正ホモジニアス$L$-Lipschitz$K$-MIMのクラスに対して、最初の効率的な学習者を与える。結果として得られるアルゴリズムは複雑さ$\mathrm{poly}(d) 2^{\mathrm{poly}(KL/\epsilon)}$である。これにより、ネットワークサイズに依存しない複雑性を持つリプシッツ均質ReLUネットワークに対して、新しいPAC学習アルゴリズムが提供される。

論文の概要: Algorithms and SQ Lower Bounds for Robustly Learning Real-valued Multi-index Models

関連論文リスト