Fugu-MT 論文翻訳(概要): Machine Learning. The Science of Selection under Uncertainty

論文の概要: Machine Learning. The Science of Selection under Uncertainty

arxiv url: http://arxiv.org/abs/2509.21547v1
Date: Thu, 25 Sep 2025 20:33:13 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-29 20:57:53.989617
Title: Machine Learning. The Science of Selection under Uncertainty
Title（参考訳）: 不確かさ下における選択の科学
Authors: Yevgeny Seldin,
Abstract要約: この本は、不確実性の下での選択結果に関する理論的保証を得るための統計ツールを提供する。この本は、マルコフ、チェビシェフ、ホーフディング、バーンスタイン、経験的バーンスタイン、kl、スプリット-klなど、幅広い不平等をカバーしている。本稿では, 環境フィードバック, 環境抵抗, 構造複雑性を特徴とするオンライン学習問題の空間について述べる。
参考スコア（独自算出の注目度）: 9.594432031144718
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Learning, whether natural or artificial, is a process of selection. It starts with a set of candidate options and selects the more successful ones. In the case of machine learning the selection is done based on empirical estimates of prediction accuracy of candidate prediction rules on some data. Due to randomness of data sampling the empirical estimates are inherently noisy, leading to selection under uncertainty. The book provides statistical tools to obtain theoretical guarantees on the outcome of selection under uncertainty. We start with concentration of measure inequalities, which are the main statistical instrument for controlling how much an empirical estimate of expectation of a function deviates from the true expectation. The book covers a broad range of inequalities, including Markov's, Chebyshev's, Hoeffding's, Bernstein's, Empirical Bernstein's, Unexpected Bernstein's, kl, and split-kl. We then study the classical (offline) supervised learning and provide a range of tools for deriving generalization bounds, including Occam's razor, Vapnik-Chervonenkis analysis, and PAC-Bayesian analysis. The latter is further applied to derive generalization guarantees for weighted majority votes. After covering the offline setting, we turn our attention to online learning. We present the space of online learning problems characterized by environmental feedback, environmental resistance, and structural complexity. A common performance measure in online learning is regret, which compares performance of an algorithm to performance of the best prediction rule in hindsight, out of a restricted set of prediction rules. We present tools for deriving regret bounds in stochastic and adversarial environments, and under full information and bandit feedback.
Abstract（参考訳）: 自然であれ人工であれ、学習は選択の過程である。一連の候補オプションから始まり、より成功したオプションを選択する。機械学習の場合、いくつかのデータ上で候補予測規則の予測精度の実証的な推定に基づいて選択を行う。データサンプリングのランダム性のため、経験的推定は本質的にノイズがあり、不確実性の下での選択につながる。この本は、不確実性の下での選択結果に関する理論的保証を得るための統計ツールを提供する。まず、関数の期待値が真の期待値からどれだけ逸脱するかを測る主要な統計指標である測度不等式(英語版)の集中から始める。この本は、マルコフ、チェビシェフ、ホーフディング、ベルンシュタイン、経験的ベルンシュタイン、未発見ベルンシュタイン、kl、スプリット-klを含む幅広い不等式をカバーしている。次に、古典的(オフライン)教師付き学習を研究し、Occam's razor、Vapnik-Chervonenkis分析、PAC-Bayesian解析など、一般化境界を導出するための様々なツールを提供する。後者は、重み付けされた多数決に対する一般化の保証を導出するためにさらに適用される。オフライン設定をカバーした後、オンライン学習に注意を向けます。本稿では, 環境フィードバック, 環境抵抗, 構造複雑性を特徴とするオンライン学習問題の空間について述べる。オンライン学習における一般的なパフォーマンス指標は後悔であり、アルゴリズムのパフォーマンスを、制約された予測規則のセットから、後見の最良の予測規則のパフォーマンスと比較する。本稿では, 確率的, 敵対的環境における後悔境界の導出と, 完全な情報と包帯的フィードバックの両面から, 後悔境界を導出するためのツールを提案する。

論文の概要: Machine Learning. The Science of Selection under Uncertainty

関連論文リスト