Frequency-Aware Self-Supervised Long-Tailed Learning
- URL: http://arxiv.org/abs/2309.04723v2
- Date: Fri, 15 Sep 2023 12:46:57 GMT
- Title: Frequency-Aware Self-Supervised Long-Tailed Learning
- Authors: Ci-Siang Lin, Min-Hung Chen, Yu-Chiang Frank Wang
- Abstract summary: We propose Frequency-Aware Self-Supervised Learning (FASSL) for learning from unlabeled data with inherent long-tailed distributions.
We first learn frequency-aware prototypes, reflecting the associated long-tailed distribution. Particularly focusing on rare-class samples, the relationships between image data and the derived prototypes are exploited.
- Score: 36.00672675332761
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Data collected from the real world typically exhibit long-tailed
distributions, where frequent classes contain abundant data while rare ones
have only a limited number of samples. While existing supervised learning
approaches have been proposed to tackle such data imbalance, the requirement of
label supervision would limit their applicability to real-world scenarios in
which label annotation might not be available. Without the access to class
labels nor the associated class frequencies, we propose Frequency-Aware
Self-Supervised Learning (FASSL) in this paper. Targeting at learning from
unlabeled data with inherent long-tailed distributions, the goal of FASSL is to
produce discriminative feature representations for downstream classification
tasks. In FASSL, we first learn frequency-aware prototypes, reflecting the
associated long-tailed distribution. Particularly focusing on rare-class
samples, the relationships between image data and the derived prototypes are
further exploited with the introduced self-supervised learning scheme.
Experiments on long-tailed image datasets quantitatively and qualitatively
verify the effectiveness of our learning scheme.
Related papers
- Integrating Distribution Matching into Semi-Supervised Contrastive Learning for Labeled and Unlabeled Data [1.0312968200748116]
Semi-supervised contrastive learning (SSL) is highly relevant in scenarios where a small amount of labeled data coexists with a large volume of unlabeled data.<n>This study aims to enhance pseudo-label-based SSL by incorporating distribution matching between labeled and unlabeled feature embeddings.
arXiv Detail & Related papers (2026-01-08T02:32:12Z) - Continuous Contrastive Learning for Long-Tailed Semi-Supervised Recognition [50.61991746981703]
Current state-of-the-art LTSSL approaches rely on high-quality pseudo-labels for large-scale unlabeled data.
This paper introduces a novel probabilistic framework that unifies various recent proposals in long-tail learning.
We introduce a continuous contrastive learning method, CCL, extending our framework to unlabeled data using reliable and smoothed pseudo-labels.
arXiv Detail & Related papers (2024-10-08T15:06:10Z) - A Self-Supervised Learning Pipeline for Demographically Fair Facial Attribute Classification [3.5092955099876266]
This paper proposes a fully self-supervised pipeline for demographically fair facial attribute classification.
We leverage completely unlabeled data pseudolabeled via pre-trained encoders, diverse data curation techniques, and meta-learning-based weighted contrastive learning.
arXiv Detail & Related papers (2024-07-14T07:11:57Z) - Generalized Semi-Supervised Learning via Self-Supervised Feature Adaptation [87.17768598044427]
Traditional semi-supervised learning assumes that the feature distributions of labeled and unlabeled data are consistent.
We propose Self-Supervised Feature Adaptation (SSFA), a generic framework for improving SSL performance when labeled and unlabeled data come from different distributions.
Our proposed SSFA is applicable to various pseudo-label-based SSL learners and significantly improves performance in labeled, unlabeled, and even unseen distributions.
arXiv Detail & Related papers (2024-05-31T03:13:45Z) - Pseudo-Labeled Auto-Curriculum Learning for Semi-Supervised Keypoint
Localization [88.74813798138466]
Localizing keypoints of an object is a basic visual problem.
Supervised learning of a keypoint localization network often requires a large amount of data.
We propose to automatically select reliable pseudo-labeled samples with a series of dynamic thresholds.
arXiv Detail & Related papers (2022-01-21T09:51:58Z) - Self-supervised Learning is More Robust to Dataset Imbalance [65.84339596595383]
We investigate self-supervised learning under dataset imbalance.
Off-the-shelf self-supervised representations are already more robust to class imbalance than supervised representations.
We devise a re-weighted regularization technique that consistently improves the SSL representation quality on imbalanced datasets.
arXiv Detail & Related papers (2021-10-11T06:29:56Z) - Self-Tuning for Data-Efficient Deep Learning [75.34320911480008]
Self-Tuning is a novel approach to enable data-efficient deep learning.
It unifies the exploration of labeled and unlabeled data and the transfer of a pre-trained model.
It outperforms its SSL and TL counterparts on five tasks by sharp margins.
arXiv Detail & Related papers (2021-02-25T14:56:19Z) - ORDisCo: Effective and Efficient Usage of Incremental Unlabeled Data for
Semi-supervised Continual Learning [52.831894583501395]
Continual learning assumes the incoming data are fully labeled, which might not be applicable in real applications.
We propose deep Online Replay with Discriminator Consistency (ORDisCo) to interdependently learn a classifier with a conditional generative adversarial network (GAN)
We show ORDisCo achieves significant performance improvement on various semi-supervised learning benchmark datasets for SSCL.
arXiv Detail & Related papers (2021-01-02T09:04:14Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.