Statistical and Algorithmic Insights for Semi-supervised Learning with
Self-training
- URL: http://arxiv.org/abs/2006.11006v1
- Date: Fri, 19 Jun 2020 08:09:07 GMT
- Title: Statistical and Algorithmic Insights for Semi-supervised Learning with
Self-training
- Authors: Samet Oymak, Talha Cihad Gulcu
- Abstract summary: Self-training is a classical approach in semi-supervised learning.
We show that self-training iterations gracefully improve the model accuracy even if they do get stuck in sub-optimal fixed points.
We then establish a connection between self-training based semi-supervision and the more general problem of learning with heterogenous data.
- Score: 30.866440916522826
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Self-training is a classical approach in semi-supervised learning which is
successfully applied to a variety of machine learning problems. Self-training
algorithm generates pseudo-labels for the unlabeled examples and progressively
refines these pseudo-labels which hopefully coincides with the actual labels.
This work provides theoretical insights into self-training algorithm with a
focus on linear classifiers. We first investigate Gaussian mixture models and
provide a sharp non-asymptotic finite-sample characterization of the
self-training iterations. Our analysis reveals the provable benefits of
rejecting samples with low confidence and demonstrates that self-training
iterations gracefully improve the model accuracy even if they do get stuck in
sub-optimal fixed points. We then demonstrate that regularization and class
margin (i.e. separation) is provably important for the success and lack of
regularization may prevent self-training from identifying the core features in
the data. Finally, we discuss statistical aspects of empirical risk
minimization with self-training for general distributions. We show how a purely
unsupervised notion of generalization based on self-training based clustering
can be formalized based on cluster margin. We then establish a connection
between self-training based semi-supervision and the more general problem of
learning with heterogenous data and weak supervision.
Related papers
- Incremental Self-training for Semi-supervised Learning [56.57057576885672]
IST is simple yet effective and fits existing self-training-based semi-supervised learning methods.
We verify the proposed IST on five datasets and two types of backbone, effectively improving the recognition accuracy and learning speed.
arXiv Detail & Related papers (2024-04-14T05:02:00Z) - Noisy Correspondence Learning with Self-Reinforcing Errors Mitigation [63.180725016463974]
Cross-modal retrieval relies on well-matched large-scale datasets that are laborious in practice.
We introduce a novel noisy correspondence learning framework, namely textbfSelf-textbfReinforcing textbfErrors textbfMitigation (SREM)
arXiv Detail & Related papers (2023-12-27T09:03:43Z) - Self-Training: A Survey [5.772546394254112]
Semi-supervised algorithms aim to learn prediction functions from a small set of labeled observations and a large set of unlabeled observations.
Among the existing techniques, self-training methods have undoubtedly attracted greater attention in recent years.
We present self-training methods for binary and multi-class classification; as well as their variants and two related approaches.
arXiv Detail & Related papers (2022-02-24T11:40:44Z) - How does unlabeled data improve generalization in self-training? A
one-hidden-layer theoretical analysis [93.37576644429578]
This work establishes the first theoretical analysis for the known iterative self-training paradigm.
We prove the benefits of unlabeled data in both training convergence and generalization ability.
Experiments from shallow neural networks to deep neural networks are also provided to justify the correctness of our established theoretical insights on self-training.
arXiv Detail & Related papers (2022-01-21T02:16:52Z) - Contrastive Regularization for Semi-Supervised Learning [46.020125061295886]
We propose contrastive regularization to improve both efficiency and accuracy of the consistency regularization by well-clustered features of unlabeled data.
Our method also shows robust performance on open-set semi-supervised learning where unlabeled data includes out-of-distribution samples.
arXiv Detail & Related papers (2022-01-17T07:20:11Z) - Hybrid Dynamic Contrast and Probability Distillation for Unsupervised
Person Re-Id [109.1730454118532]
Unsupervised person re-identification (Re-Id) has attracted increasing attention due to its practical application in the read-world video surveillance system.
We present the hybrid dynamic cluster contrast and probability distillation algorithm.
It formulates the unsupervised Re-Id problem into an unified local-to-global dynamic contrastive learning and self-supervised probability distillation framework.
arXiv Detail & Related papers (2021-09-29T02:56:45Z) - Theoretical Analysis of Self-Training with Deep Networks on Unlabeled
Data [48.4779912667317]
Self-training algorithms have been very successful for learning with unlabeled data using neural networks.
This work provides a unified theoretical analysis of self-training with deep networks for semi-supervised learning, unsupervised domain adaptation, and unsupervised learning.
arXiv Detail & Related papers (2020-10-07T19:43:55Z) - Uncertainty-aware Self-training for Text Classification with Few Labels [54.13279574908808]
We study self-training as one of the earliest semi-supervised learning approaches to reduce the annotation bottleneck.
We propose an approach to improve self-training by incorporating uncertainty estimates of the underlying neural network.
We show our methods leveraging only 20-30 labeled samples per class for each task for training and for validation can perform within 3% of fully supervised pre-trained language models.
arXiv Detail & Related papers (2020-06-27T08:13:58Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.