Fugu-MT 論文翻訳(概要): Beyond Distribution Estimation: Simplex Anchored Structural Inference Towards Universal Semi-Supervised Learning

論文の概要: Beyond Distribution Estimation: Simplex Anchored Structural Inference Towards Universal Semi-Supervised Learning

arxiv url: http://arxiv.org/abs/2605.07557v2
Date: Mon, 11 May 2026 02:23:43 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-12 19:24:01.356321
Title: Beyond Distribution Estimation: Simplex Anchored Structural Inference Towards Universal Semi-Supervised Learning
Title（参考訳）: 分布推定を超えて:Universal Semi-Supervised Learningに向けたSimplex Anchored構造推論
Authors: Yaxin Hou, Jun Ma, Hanyang Li, Bo Han, Jie Yu, Yuheng Jia,
Abstract要約: 我々は、Universal Semi-supervised Learning (UniSSL)として批判的だが未探索のパラダイムを定式化する。既存の手法は、通常擬似ラベルでラベル付けされていないデータを利用する。我々は、表現レベルの構造推論からバイパス分布推定へ焦点を移す。
参考スコア（独自算出の注目度）: 47.92504362652887
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Semi-supervised learning faces significant challenges in realistic scenarios where labeled data is scarce and unlabeled data follows unknown, arbitrary distributions. We formalize this critical yet under-explored paradigm as Universal Semi-supervised Learning (UniSSL). Existing methods typically leverage unlabeled data via pseudo-labeling. However, they often rely on the idealized assumption of a uniform unlabeled data distribution or require sufficient labeled data to estimate it. In the UniSSL setting, such dependencies lead to numerous erroneous pseudo-labels, thereby triggering representation confusion. Fortunately, we observe that inter-sample relations captured by representations are more reliable than pseudo-labels. Leveraging this insight, we shift our focus to representation-level structural inference to bypass distribution estimation. Accordingly, we propose Simplex Anchored Graph-state Equipartition (SAGE), which captures high-order inter-sample dependencies to establish structural consensus for guiding representation learning. Meanwhile, to mitigate representation confusion, we employ vectors that satisfy a simplex equiangular tight frame to serve as a coordinate frame for guiding inter-class representation separation. Finally, we introduce a weighting strategy based on distribution-agnostic metrics to prioritize reliable pseudo-labels and an auxiliary branch to isolate potentially erroneous pseudo-labels. Evaluations on five standard benchmarks show that SAGE consistently outperforms state-of-the-art methods, with an average accuracy gain of $\textbf{8.52\%}$.
Abstract（参考訳）: 半教師付き学習は、ラベル付きデータが不足し、ラベルなしデータが未知の任意の分布に従う現実的なシナリオにおいて重大な課題に直面している。我々はこの批判的かつ未探索のパラダイムをUniSSL(Universal Semi-supervised Learning)として定式化する。既存の手法は、通常擬似ラベルでラベル付けされていないデータを利用する。しかし、彼らはしばしば、統一されたラベルのないデータ分布の理想的な仮定に頼るか、それを見積もるために十分なラベル付きデータを必要とする。 UniSSL設定では、そのような依存関係は多くの誤った擬似ラベルをもたらし、表現の混乱を引き起こす。幸いなことに、表現によってキャプチャされたサンプル間の関係は擬似ラベルよりも信頼性が高い。この知見を生かして、我々は表現レベルの構造推論からバイパス分布推定へと焦点を移した。そこで我々は,SAGE (Simplex Anchored Graph-state Equipartition) を提案する。一方、表現の混乱を軽減するために、クラス間表現分離を導くための座標フレームとして、単純な等角的タイトフレームを満たすベクトルを用いる。最後に,分布に依存しない指標に基づく重み付け手法を導入し,信頼性の高い擬似ラベルを優先し,誤用する可能性のある擬似ラベルを分離する補助的ブランチを導入する。 5つの標準ベンチマークの評価によると、SAGEは最先端の手法を一貫して上回り、平均精度は$\textbf{8.52\%}$である。

論文の概要: Beyond Distribution Estimation: Simplex Anchored Structural Inference Towards Universal Semi-Supervised Learning

関連論文リスト