Fugu-MT 論文翻訳(概要): Optimal Rates for Regularized Conditional Mean Embedding Learning

論文の概要: Optimal Rates for Regularized Conditional Mean Embedding Learning

arxiv url: http://arxiv.org/abs/2208.01711v3
Date: Tue, 12 Dec 2023 10:06:41 GMT
ステータス: 翻訳完了
システム内更新日: 2023-12-13 21:03:06.001029
Title: Optimal Rates for Regularized Conditional Mean Embedding Learning
Title（参考訳）: 正規化条件付き平均埋め込み学習のための最適レート
Authors: Zhu Li, Dimitri Meunier, Mattes Mollenhauer, Arthur Gretton
Abstract要約: 経験的CME推定器に対して,不特定条件下での新しい適応的統計的学習率を導出する。我々の解析は、$mathcalH_Y$を有限次元と仮定することなく、最適な$O(log n / n)$レートと一致することを示した。
参考スコア（独自算出の注目度）: 32.870965795423366
License: http://creativecommons.org/licenses/by/4.0/
Abstract: We address the consistency of a kernel ridge regression estimate of the conditional mean embedding (CME), which is an embedding of the conditional distribution of $Y$ given $X$ into a target reproducing kernel Hilbert space $\mathcal{H}_Y$. The CME allows us to take conditional expectations of target RKHS functions, and has been employed in nonparametric causal and Bayesian inference. We address the misspecified setting, where the target CME is in the space of Hilbert-Schmidt operators acting from an input interpolation space between $\mathcal{H}_X$ and $L_2$, to $\mathcal{H}_Y$. This space of operators is shown to be isomorphic to a newly defined vector-valued interpolation space. Using this isomorphism, we derive a novel and adaptive statistical learning rate for the empirical CME estimator under the misspecified setting. Our analysis reveals that our rates match the optimal $O(\log n / n)$ rates without assuming $\mathcal{H}_Y$ to be finite dimensional. We further establish a lower bound on the learning rate, which shows that the obtained upper bound is optimal.
Abstract（参考訳）: 条件付き平均埋め込み (conditional mean embedded, cme) のカーネルリッジ回帰推定(kernel ridge regression estimation)の一貫性について論じる。これは、目標再生成カーネル hilbert space $\mathcal{h}_y$ への$y$ の条件付き分布の埋め込みである。 CMEにより、ターゲットRKHS関数の条件付き期待値が得られ、非パラメトリック因果推定やベイズ推定に採用されている。ここでは、ターゲット CME が入力補間空間から $\mathcal{H}_X$ と $L_2$ を $\mathcal{H}_Y$ に作用するヒルベルト・シュミット作用素の空間にあるような不特定設定に対処する。この作用素空間は新しく定義されたベクトル値補間空間に同型であることが示されている。この同型性を用いて、不特定条件下での経験的CME推定器の新しい適応的統計的学習率を導出する。我々の解析は、$\mathcal{H}_Y$を有限次元と仮定することなく、最適な$O(\log n / n)$レートと一致することを示した。さらに,学習速度の低い上限を確立し,得られた上限値が最適であることを示す。

関連論文リスト

Near-Optimal Clustering in Mixture of Markov Chains [74.3828414695655]
我々は、長さ$H$の軌跡を、大きさ$S$の有限状態空間上の未知のエルゴードマルコフ鎖の1つによって生成される、$T$ trajectories of length $H$の問題を研究する。我々は、連鎖の遷移核間の重み付きKL分散によって支配されるクラスタリングエラー率に基づいて、インスタンス依存で高い確率の低い境界を導出する。次に,新しい2段階クラスタリングアルゴリズムを提案する。
論文参考訳（メタデータ） (2025-06-02T05:10:40Z)
Nearly Minimax Optimal Regret for Learning Linear Mixture Stochastic Shortest Path [80.60592344361073]
線形混合遷移カーネルを用いた最短経路(SSP)問題について検討する。エージェントは繰り返し環境と対話し、累積コストを最小化しながら特定の目標状態に到達する。既存の作業は、イテレーションコスト関数の厳密な下限や、最適ポリシーに対する期待長の上限を仮定することが多い。
論文参考訳（メタデータ） (2024-02-14T07:52:00Z)
Semidefinite programming relaxations and debiasing for MAXCUT-based clustering [1.9761774213809036]
2つのガウス分布を$mathbbRp$で混合して引き出す小さなデータサンプルを$n$で分割する問題を考察する。グラフ上の最大カットを求めるように定式化された整数二次プログラムの半定値プログラミング緩和を用いる。
論文参考訳（メタデータ） (2024-01-16T03:14:24Z)
Convergence Rates for Stochastic Approximation: Biased Noise with Unbounded Variance, and Applications [2.0584253077707477]
目的関数 $J(cdot)$ の定常点を求めるグラディエント・Descent (SGD) 法の収束特性について検討した。この結果は、すべての定常点が大域最小値である性質を持つ invex' 関数のクラスに適用できる。
論文参考訳（メタデータ） (2023-12-05T15:22:39Z)
Rates of Convergence in Certain Native Spaces of Approximations used in Reinforcement Learning [0.0]
本稿では、再生カーネルヒルベルト空間(RKHS)$H(Omega)$の集合に生じる値関数近似の収束率について研究する。値関数およびコントローラ近似における誤差の上限の明示は、有限次元近似の空間に対して$mathcalP_H,N$で導かれる。
論文参考訳（メタデータ） (2023-09-14T02:02:08Z)
Statistical Learning under Heterogeneous Distribution Shift [71.8393170225794]
ground-truth predictor is additive $mathbbE[mathbfz mid mathbfx,mathbfy] = f_star(mathbfx) +g_star(mathbfy)$.
論文参考訳（メタデータ） (2023-02-27T16:34:21Z)
Kernel-based off-policy estimation without overlap: Instance optimality beyond semiparametric efficiency [53.90687548731265]
本研究では,観測データに基づいて線形関数を推定するための最適手順について検討する。任意の凸および対称函数クラス $mathcalF$ に対して、平均二乗誤差で有界な非漸近局所ミニマックスを導出する。
論文参考訳（メタデータ） (2023-01-16T02:57:37Z)
Optimal policy evaluation using kernel-based temporal difference methods [78.83926562536791]
カーネルヒルベルト空間を用いて、無限水平割引マルコフ報酬過程の値関数を推定する。我々は、関連するカーネル演算子の固有値に明示的に依存した誤差の非漸近上界を導出する。 MRP のサブクラスに対する minimax の下位境界を証明する。
論文参考訳（メタデータ） (2021-09-24T14:48:20Z)
Provably Breaking the Quadratic Error Compounding Barrier in Imitation Learning, Optimally [58.463668865380946]
状態空間 $mathcalS$ を用いたエピソードマルコフ決定過程 (MDPs) における模擬学習の統計的限界について検討する。 rajaraman et al (2020) におけるmdアルゴリズムを用いた準最適性に対する上限 $o(|mathcals|h3/2/n)$ を定式化する。 Omega(H3/2/N)$ $mathcalS|geq 3$ であるのに対して、未知の遷移条件はよりシャープレートに悩まされる。
論文参考訳（メタデータ） (2021-02-25T15:50:19Z)
Nonparametric approximation of conditional expectation operators [0.3655021726150368]
最小の仮定の下で、$[Pf](x) := mathbbE[f(Y) mid X = x ]$ で定義される$L2$-operatorの近似について検討する。我々は、再生されたカーネル空間上で作用するヒルベルト・シュミット作用素により、作用素ノルムにおいて$P$が任意に適切に近似できることを証明した。
論文参考訳（メタデータ） (2020-12-23T19:06:12Z)
Sample Complexity of Asynchronous Q-Learning: Sharper Analysis and Variance Reduction [63.41789556777387]
非同期Q-ラーニングはマルコフ決定過程(MDP)の最適行動値関数(またはQ-関数)を学習することを目的としている。 Q-関数の入出力$varepsilon$-正確な推定に必要なサンプルの数は、少なくとも$frac1mu_min (1-gamma)5varepsilon2+ fract_mixmu_min (1-gamma)$の順である。
論文参考訳（メタデータ） (2020-06-04T17:51:00Z)
Learning the Hypotheses Space from data Part II: Convergence and Feasibility [0.0]
学習空間に基づくモデル選択フレームワークの一貫性を示す。データから仮説空間を学習することは可能であることを示す。
論文参考訳（メタデータ） (2020-01-30T21:48:37Z)

関連論文リストは本サイト内にある論文のタイトル・アブストラクトから自動的に作成しています。

指定された論文の情報です。
本サイトの運営者は本サイト（すべての情報・翻訳含む）の品質を保証せず、本サイト（すべての情報・翻訳含む）を使用して発生したあらゆる結果について一切の責任を負いません。