Fugu-MT 論文翻訳(概要): The Complexity of Finding Local Optima in Contrastive Learning

論文の概要: The Complexity of Finding Local Optima in Contrastive Learning

arxiv url: http://arxiv.org/abs/2509.16898v1
Date: Sun, 21 Sep 2025 03:21:04 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-23 18:58:16.027657
Title: The Complexity of Finding Local Optima in Contrastive Learning
Title（参考訳）: コントラスト学習における局所最適点探索の複雑さ
Authors: Jingming Yan, Yiyuan Luo, Vaggos Chatziafratis, Ioannis Panageas, Parnian Shahkar, Stelios Stavroulakis,
Abstract要約: 個別設定で$mathsfPLS$-hardness、連続設定で$mathsfPLS$-hardnessを証明します。この結果から,アルゴリズムが様々なコントラスト学習問題の局所的最適解を見つけることは不可能であることが示唆された。
参考スコア（独自算出の注目度）: 18.910128965812124
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Contrastive learning is a powerful technique for discovering meaningful data representations by optimizing objectives based on $\textit{contrastive information}$, often given as a set of weighted triplets $\{(x_i, y_i^+, z_{i}^-)\}_{i = 1}^m$ indicating that an "anchor" $x_i$ is more similar to a "positive" example $y_i$ than to a "negative" example $z_i$. The goal is to find representations (e.g., embeddings in $\mathbb{R}^d$ or a tree metric) where anchors are placed closer to positive than to negative examples. While finding $\textit{global}$ optima of contrastive objectives is $\mathsf{NP}$-hard, the complexity of finding $\textit{local}$ optima -- representations that do not improve by local search algorithms such as gradient-based methods -- remains open. Our work settles the complexity of finding local optima in various contrastive learning problems by proving $\mathsf{PLS}$-hardness in discrete settings (e.g., maximize satisfied triplets) and $\mathsf{CLS}$-hardness in continuous settings (e.g., minimize Triplet Loss), where $\mathsf{PLS}$ (Polynomial Local Search) and $\mathsf{CLS}$ (Continuous Local Search) are well-studied complexity classes capturing local search dynamics in discrete and continuous optimization, respectively. Our results imply that no polynomial time algorithm (local search or otherwise) can find a local optimum for various contrastive learning problems, unless $\mathsf{PLS}\subseteq\mathsf{P}$ (or $\mathsf{CLS}\subseteq \mathsf{P}$ for continuous problems). Even in the unlikely scenario that $\mathsf{PLS}\subseteq\mathsf{P}$ (or $\mathsf{CLS}\subseteq \mathsf{P}$), our reductions imply that there exist instances where local search algorithms need exponential time to reach a local optimum, even for $d=1$ (embeddings on a line).
Abstract（参考訳）: コントラスト学習(Contrastive learning)は、$\textit{contrastive information}$に基づいて目的を最適化することで有意義なデータ表現を発見する強力なテクニックであり、しばしば重み付き三重項の集合として与えられる$\{(x_i, y_i^+, z_{i}^-)\}_{i = 1}^m$は、"正"の例よりも"負"の例である$y_i$に近いことを示す。目的は、表現(例えば $\mathbb{R}^d$ または木計量への埋め込み)を見つけることであり、アンカーは負の例よりも正に近い位置に置かれる。対照的に、$\textit{global}$ optima は $\mathsf{NP}$-hard であるが、$\textit{local}$ optima を見つける複雑さは、勾配ベースのメソッドのような局所的な検索アルゴリズムによって改善されない。我々の研究は、離散的な設定(例えば、満足度の高い三重項)における$-hardnessと連続的な設定(例えば、トリプレット損失を最小化する)における$\mathsf{PLS}と$\mathsf{CLS}と$\mathsf{CLS}は、それぞれ離散的および連続的な最適化における局所的な探索ダイナミクスをよく研究した複雑性クラスである。我々の結果は, 多項式時間アルゴリズム(局所探索等)が, 連続問題に対して$\mathsf{PLS}\subseteq\mathsf{P}$ (または$\mathsf{CLS}\subseteq \mathsf{P}$) でない限り, 様々なコントラスト学習問題に対して局所的な最適解を求めることができないことを示唆している。不可能なシナリオでは、$\mathsf{PLS}\subseteq\mathsf{P}$ (または$\mathsf{CLS}\subseteq \mathsf{P}$) であっても、我々の減少は、局所探索アルゴリズムが局所最適点に到達するのに指数時間を必要とするインスタンスが存在することを意味する。

論文の概要: The Complexity of Finding Local Optima in Contrastive Learning

関連論文リスト