Fugu-MT 論文翻訳(概要): Penalized Overdamped and Underdamped Langevin Monte Carlo Algorithms for Constrained Sampling

論文の概要: Penalized Overdamped and Underdamped Langevin Monte Carlo Algorithms for Constrained Sampling

arxiv url: http://arxiv.org/abs/2212.00570v2
Date: Sun, 14 Apr 2024 19:57:21 GMT
ステータス: 翻訳完了
システム内更新日: 2024-04-17 00:36:54.884022
Title: Penalized Overdamped and Underdamped Langevin Monte Carlo Algorithms for Constrained Sampling
Title（参考訳）: 制約サンプリングのためのLangevin Monte Carloアルゴリズム
Authors: Mert Gürbüzbalaban, Yuanhan Hu, Lingjiong Zhu,
Abstract要約: 目的が目標分布である$pi(x)prop ef(x)$から$x$が制約されたときにサンプリングする制約付きサンプリング問題を考える。ペナルティ法によって動機付けられた制約付き問題を,制約違反に対するペナルティ関数を導入することにより,非制約サンプリング問題に変換する。 PSGLD と PSGULMC の場合、$tildemathcalO(d/varepsilon18)$ が強凸で滑らかであるとき、$tildemathcalO(d/varepsilon) を得る。
参考スコア（独自算出の注目度）: 17.832449046193933
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: We consider the constrained sampling problem where the goal is to sample from a target distribution $\pi(x)\propto e^{-f(x)}$ when $x$ is constrained to lie on a convex body $\mathcal{C}$. Motivated by penalty methods from continuous optimization, we propose penalized Langevin Dynamics (PLD) and penalized underdamped Langevin Monte Carlo (PULMC) methods that convert the constrained sampling problem into an unconstrained sampling problem by introducing a penalty function for constraint violations. When $f$ is smooth and gradients are available, we get $\tilde{\mathcal{O}}(d/\varepsilon^{10})$ iteration complexity for PLD to sample the target up to an $\varepsilon$-error where the error is measured in the TV distance and $\tilde{\mathcal{O}}(\cdot)$ hides logarithmic factors. For PULMC, we improve the result to $\tilde{\mathcal{O}}(\sqrt{d}/\varepsilon^{7})$ when the Hessian of $f$ is Lipschitz and the boundary of $\mathcal{C}$ is sufficiently smooth. To our knowledge, these are the first convergence results for underdamped Langevin Monte Carlo methods in the constrained sampling that handle non-convex $f$ and provide guarantees with the best dimension dependency among existing methods with deterministic gradient. If unbiased stochastic estimates of the gradient of $f$ are available, we propose PSGLD and PSGULMC methods that can handle stochastic gradients and are scaleable to large datasets without requiring Metropolis-Hasting correction steps. For PSGLD and PSGULMC, when $f$ is strongly convex and smooth, we obtain $\tilde{\mathcal{O}}(d/\varepsilon^{18})$ and $\tilde{\mathcal{O}}(d\sqrt{d}/\varepsilon^{39})$ iteration complexity in W2 distance. When $f$ is smooth and can be non-convex, we provide finite-time performance bounds and iteration complexity results. Finally, we illustrate the performance on Bayesian LASSO regression and Bayesian constrained deep learning problems.
Abstract（参考訳）: 対象分布 $\pi(x)\propto e^{-f(x)}$ が凸体 $\mathcal{C}$ 上にあるとき、目的が対象分布 $\pi(x)\propto e^{-f(x)} からサンプリングすることであるような制約付きサンプリング問題を考える。ペナルティ法を連続最適化から動機付け,制約違反に対するペナルティ関数を導入して,制約サンプリング問題を非制約サンプリング問題に変換する,ペナルティ付きランゲヴィン・ダイナミクス(PLD)およびペナルティ付きアンダーダム型ランゲヴィン・モンテカルロ(PULMC)手法を提案する。 f$がスムーズでグラデーションが利用できる場合、PDDがターゲットを最大で$\varepsilon$-errorまでサンプリングするのに、$\tilde{\mathcal{O}}(d/\varepsilon^{10})$イテレーションの複雑さがあり、テレビ距離でエラーが測定され、$\tilde{\mathcal{O}}(\cdot)$が対数要素を隠す。 PULMC に対して、$f の Hessian が Lipschitz であり、$\mathcal{C}$ の境界が十分に滑らかであるとき、 $\tilde{\mathcal{O}}(\sqrt{d}/\varepsilon^{7})$ に改善する。我々の知る限り、これらは非凸$f$を処理し、決定論的勾配を持つ既存の方法の中で最高の次元依存性を持つ保証を与える制約付きサンプリングにおいて、アンダーダムされたランゲヴィン・モンテカルロ法に対する最初の収束結果である。もし、$f$の勾配のバイアスのない確率的推定が利用可能であれば、確率的勾配を扱えるPSGLDおよびPSGULMC法を提案し、メトロポリス・ハスティング補正ステップを必要とせずに大規模データセットに拡張可能である。 PSGLD と PSGULMC に対して、$f$ が強凸かつ滑らかであるとき、W2 距離における反復複雑性$ $\tilde{\mathcal{O}}(d/\varepsilon^{18})$ と $\tilde{\mathcal{O}}(d\sqrt{d}/\varepsilon^{39}) を得る。 f$ が滑らかで非凸であれば、有限時間の性能境界とイテレーションの複雑さの結果を提供する。最後に,ベイジアンLASSO回帰とベイジアン制約によるディープラーニング問題の性能について述べる。

論文の概要: Penalized Overdamped and Underdamped Langevin Monte Carlo Algorithms for Constrained Sampling

関連論文リスト