Fugu-MT 論文翻訳(概要): The Structure of Cross-Validation Error: Stability, Covariance, and Minimax Limits

論文の概要: The Structure of Cross-Validation Error: Stability, Covariance, and Minimax Limits

arxiv url: http://arxiv.org/abs/2511.03554v1
Date: Wed, 05 Nov 2025 15:35:46 GMT
ステータス: 翻訳完了
システム内更新日: 2025-11-06 18:19:32.466861
Title: The Structure of Cross-Validation Error: Stability, Covariance, and Minimax Limits
Title（参考訳）: クロスバリデーション誤差の構造:安定性、共分散およびミニマックス限界
Authors: Ido Nachum, Rüdiger Urbanke, Thomas Weinberger,
Abstract要約: アルゴリズム分布対の性質が$k$-foldクロスバリデーションにおける折りたたみ数の選択にどのように影響するかを示す。また、CVが$n$の検証セットによって1/n$達成可能なオーダーの最適値を得ることができないことも証明する。
参考スコア（独自算出の注目度）: 3.3008315224941978
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Despite ongoing theoretical research on cross-validation (CV), many theoretical questions about CV remain widely open. This motivates our investigation into how properties of algorithm-distribution pairs can affect the choice for the number of folds in $k$-fold cross-validation. Our results consist of a novel decomposition of the mean-squared error of cross-validation for risk estimation, which explicitly captures the correlations of error estimates across overlapping folds and includes a novel algorithmic stability notion, squared loss stability, that is considerably weaker than the typically required hypothesis stability in other comparable works. Furthermore, we prove: 1. For every learning algorithm that minimizes empirical error, a minimax lower bound on the mean-squared error of $k$-fold CV estimating the population risk $L_\mathcal{D}$: \[ \min_{k \mid n}\; \max_{\mathcal{D}}\; \mathbb{E}\!\left[\big(\widehat{L}_{\mathrm{CV}}^{(k)} - L_{\mathcal{D}}\big)^{2}\right] \;=\; \Omega\!\big(\sqrt{k}/n\big), \] where $n$ is the sample size and $k$ the number of folds. This shows that even under idealized conditions, for large values of $k$, CV cannot attain the optimum of order $1/n$ achievable by a validation set of size $n$, reflecting an inherent penalty caused by dependence between folds. 2. Complementing this, we exhibit learning rules for which \[ \max_{\mathcal{D}}\; \mathbb{E}\!\left[\big(\widehat{L}_{\mathrm{CV}}^{(k)} - L_{\mathcal{D}}\big)^{2}\right] \;=\; \Omega(k/n), \] matching (up to constants) the accuracy of a hold-out estimator of a single fold of size $n/k$. Together these results delineate the fundamental trade-off in resampling-based risk estimation: CV cannot fully exploit all $n$ samples for unbiased risk evaluation, and its minimax performance is pinned between the $k/n$ and $\sqrt{k}/n$ regimes.
Abstract（参考訳）: CV (cross-validation) に関する理論的研究が進行中であるにもかかわらず、CVに関する多くの理論的疑問が広く開かれている。このことは、アルゴリズム分布対の性質が$k$-foldクロスバリデーションにおける折りたたみ数の選択にどのように影響するかを調査する動機となっている。この結果は、重なり合う折り畳みの誤差推定の相関を明示的に捉え、新しいアルゴリズム的安定性の概念である2乗損失安定性を含む、リスク推定における平均二乗誤差の新たな分解から成り立っている。さらに, 経験的誤差を最小限に抑える学習アルゴリズムについて, 平均二乗誤差の最小値が$k$-fold CVで, 集団リスクを推定する最小値が$L_\mathcal{D}$: \[ \min_{k \mid n}\; \max_{\mathcal{D}}\; \mathbb{E}\! \left[\big(\widehat{L}_{\mathrm{CV}}^{(k)} - L_{\mathcal{D}}\big)^{2}\right] \;=\; \Omega\! \big(\sqrt{k}/n\big), \] ここで$n$はサンプルサイズ、$k$は折りたたみ数である。これは、理想化された条件の下でも、大きな値が$k$であっても、CVは、折り畳み間の依存によって引き起こされる固有のペナルティを反映して、$n$の検証セットによって1/n$の最適値を得ることができないことを示している。 2.これを補完して, \[ \max_{\mathcal{D}}\; \mathbb{E}\! \left[\big(\widehat{L}_{\mathrm{CV}}^{(k)} - L_{\mathcal{D}}\big)^{2}\right] \;=\; \Omega(k/n), \] マッチング(定数まで) サイズ$n/k$のホールドアウト推定子の精度。 CVは非バイアスリスク評価のためにすべての$n$サンプルをフルに活用することができず、そのミニマックス性能は$k/n$と$\sqrt{k}/n$レジームの間で固定される。

論文の概要: The Structure of Cross-Validation Error: Stability, Covariance, and Minimax Limits

関連論文リスト