Fugu-MT 論文翻訳(概要): Pseudonorm Approachability and Applications to Regret Minimization

論文の概要: Pseudonorm Approachability and Applications to Regret Minimization

arxiv url: http://arxiv.org/abs/2302.01517v1
Date: Fri, 3 Feb 2023 03:19:14 GMT
ステータス: 翻訳完了
システム内更新日: 2023-02-06 17:41:35.991254
Title: Pseudonorm Approachability and Applications to Regret Minimization
Title（参考訳）: 擬似ノルムアプローチとレグレット最小化への応用
Authors: Christoph Dann, Yishay Mansour, Mehryar Mohri, Jon Schneider, Balasubramanian Sivan
Abstract要約: 我々は、高次元 $ell_infty$-approachability 問題を、低次元の擬ノルムアプローチ可能性問題に変換する。我々は、$ell$や他のノルムに対するアプローチ可能性に関する以前の研究に類似した疑似ノルムアプローチ可能性のアルゴリズム理論を開発する。
参考スコア（独自算出の注目度）: 73.54127663296906
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Blackwell's celebrated approachability theory provides a general framework for a variety of learning problems, including regret minimization. However, Blackwell's proof and implicit algorithm measure approachability using the $\ell_2$ (Euclidean) distance. We argue that in many applications such as regret minimization, it is more useful to study approachability under other distance metrics, most commonly the $\ell_\infty$-metric. But, the time and space complexity of the algorithms designed for $\ell_\infty$-approachability depend on the dimension of the space of the vectorial payoffs, which is often prohibitively large. Thus, we present a framework for converting high-dimensional $\ell_\infty$-approachability problems to low-dimensional pseudonorm approachability problems, thereby resolving such issues. We first show that the $\ell_\infty$-distance between the average payoff and the approachability set can be equivalently defined as a pseudodistance between a lower-dimensional average vector payoff and a new convex set we define. Next, we develop an algorithmic theory of pseudonorm approachability, analogous to previous work on approachability for $\ell_2$ and other norms, showing that it can be achieved via online linear optimization (OLO) over a convex set given by the Fenchel dual of the unit pseudonorm ball. We then use that to show, modulo mild normalization assumptions, that there exists an $\ell_\infty$-approachability algorithm whose convergence is independent of the dimension of the original vectorial payoff. We further show that that algorithm admits a polynomial-time complexity, assuming that the original $\ell_\infty$-distance can be computed efficiently. We also give an $\ell_\infty$-approachability algorithm whose convergence is logarithmic in that dimension using an FTRL algorithm with a maximum-entropy regularizer.
Abstract（参考訳）: ブラックウェルの有望なアプローチ可能性理論は、後悔の最小化を含む様々な学習問題の一般的な枠組みを提供する。しかし、ブラックウェルの証明と暗黙のアルゴリズムは、$\ell_2$ (ユークリッド)距離を用いてアプローチ可能性を測定する。後悔の最小化のような多くの応用において、他の距離メトリクス(最も一般的には$\ell_\infty$-metric)の下でアプローチ可能性を研究することがより有用であると主張する。しかし、$\ell_\infty$-approachabilityのために設計されたアルゴリズムの時間と空間の複雑さは、しばしば禁止的に大きいベクトル的ペイオフの空間の次元に依存する。そこで本稿では,高次元$\ell_\infty$-approachability問題を低次元疑似ノルムアプローチ可能性問題に変換する枠組みを提案する。まず、平均ペイオフと接近可能性セットの間の$\ell_\infty$- distance を、我々が定義する低次元平均ベクトルペイオフと新しい凸集合の間の擬似距離として等価に定義できることを示す。次に,$\ell_2$ などのノルムに対するアプローチ可能性に関する従来の研究と類似した疑似ノルムアプローチ可能性のアルゴリズム理論を開発し,単位疑似ノルムボールのフェンシェル双対によって与えられる凸集合上のオンライン線形最適化 (olo) によって実現可能であることを示した。次に、モジュロ弱正規化仮定(modulo mild normalization assumptions)を用いて、元のベクトルペイオフの次元に依存しない$\ell_\infty$-approachabilityアルゴリズムが存在することを示す。さらに、元の$\ell_\infty$-distanceを効率的に計算できると仮定して、このアルゴリズムが多項式時間複雑性を持つことを示す。また、最大エントロピー正規化器を持つFTRLアルゴリズムを用いて、その次元における収束が対数である$\ell_\infty$-approachabilityアルゴリズムを提案する。

論文の概要: Pseudonorm Approachability and Applications to Regret Minimization

関連論文リスト