Data Augmentation Imbalance For Imbalanced Attribute Classification
- URL: http://arxiv.org/abs/2004.13628v3
- Date: Thu, 21 May 2020 07:36:49 GMT
- Title: Data Augmentation Imbalance For Imbalanced Attribute Classification
- Authors: Yang Hu, Xiaying Bai, Pan Zhou, Fanhua Shang, Shengmei Shen
- Abstract summary: We propose a new re-sampling algorithm called: data augmentation imbalance (DAI) to explicitly enhance the ability to discriminate the fewer attributes.
Our DAI algorithm achieves state-of-the-art results, based on pedestrian attribute datasets.
- Score: 60.71438625139922
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Pedestrian attribute recognition is an important multi-label classification
problem. Although the convolutional neural networks are prominent in learning
discriminative features from images, the data imbalance in multi-label setting
for fine-grained tasks remains an open problem. In this paper, we propose a new
re-sampling algorithm called: data augmentation imbalance (DAI) to explicitly
enhance the ability to discriminate the fewer attributes via increasing the
proportion of labels accounting for a small part. Fundamentally, by applying
over-sampling and under-sampling on the multi-label dataset at the same time,
the thought of robbing the rich attributes and helping the poor makes a
significant contribution to DAI. Extensive empirical evidence shows that our
DAI algorithm achieves state-of-the-art results, based on pedestrian attribute
datasets, i.e. standard PA-100K and PETA datasets.
Related papers
- Preserving Empirical Probabilities in BERT for Small-sample Clinical Entity Recognition [5.466034990848432]
This paper explores the effects of unbalanced entity labels of the BERT-based pre-trained model.
We analyze the different mechanisms of loss calculation and loss propagation for the task of token classification on randomized datasets.
arXiv Detail & Related papers (2024-09-05T04:38:49Z) - Improving SMOTE via Fusing Conditional VAE for Data-adaptive Noise Filtering [0.5735035463793009]
We introduce a framework to enhance the SMOTE algorithm using Variational Autoencoders (VAE)
Our approach systematically quantifies the density of data points in a low-dimensional latent space using the VAE, simultaneously incorporating information on class labels and classification difficulty.
Empirical studies on several imbalanced datasets represent that this simple process innovatively improves the conventional SMOTE algorithm over the deep learning models.
arXiv Detail & Related papers (2024-05-30T07:06:02Z) - Pedestrian Attribute Recognition as Label-balanced Multi-label Learning [12.605514698358165]
We propose a novel framework that successfully decouples label-balanced data re-sampling from the curse of attributes co-occurrence.
Our work achieves best accuracy on various popular benchmarks, and importantly, with minimal computational budget.
arXiv Detail & Related papers (2024-05-08T07:27:15Z) - Improving a Named Entity Recognizer Trained on Noisy Data with a Few
Clean Instances [55.37242480995541]
We propose to denoise noisy NER data with guidance from a small set of clean instances.
Along with the main NER model we train a discriminator model and use its outputs to recalibrate the sample weights.
Results on public crowdsourcing and distant supervision datasets show that the proposed method can consistently improve performance with a small guidance set.
arXiv Detail & Related papers (2023-10-25T17:23:37Z) - Tackling Diverse Minorities in Imbalanced Classification [80.78227787608714]
Imbalanced datasets are commonly observed in various real-world applications, presenting significant challenges in training classifiers.
We propose generating synthetic samples iteratively by mixing data samples from both minority and majority classes.
We demonstrate the effectiveness of our proposed framework through extensive experiments conducted on seven publicly available benchmark datasets.
arXiv Detail & Related papers (2023-08-28T18:48:34Z) - Fairness via Adversarial Attribute Neighbourhood Robust Learning [49.93775302674591]
We propose a principled underlineRobust underlineAdversarial underlineAttribute underlineNeighbourhood (RAAN) loss to debias the classification head.
arXiv Detail & Related papers (2022-10-12T23:39:28Z) - Assessing the Quality of the Datasets by Identifying Mislabeled Samples [14.881597737762316]
We propose a novel statistic -- noise score -- as a measure for the quality of each data point to identify mislabeled samples.
In our work, we use the representations derived by the inference network of data quality supervised variational autoencoder (AQUAVS)
We validate our proposed statistic through experimentation by corrupting MNIST, FashionMNIST, and CIFAR10/100 datasets.
arXiv Detail & Related papers (2021-09-10T17:14:09Z) - Semi-supervised Long-tailed Recognition using Alternate Sampling [95.93760490301395]
Main challenges in long-tailed recognition come from the imbalanced data distribution and sample scarcity in its tail classes.
We propose a new recognition setting, namely semi-supervised long-tailed recognition.
We demonstrate significant accuracy improvements over other competitive methods on two datasets.
arXiv Detail & Related papers (2021-05-01T00:43:38Z) - Granular conditional entropy-based attribute reduction for partially
labeled data with proxy labels [12.755874217721054]
We propose a rough sets-based semi-supervised attribute reduction method for partially labeled data.
A novel conditional entropy measure is proposed, and its monotonicity is proved in theory.
Experiments conducted on UCI data sets demonstrate that the proposed semi-supervised attribute reduction method is promising.
arXiv Detail & Related papers (2021-01-23T12:50:09Z) - Exploiting Sample Uncertainty for Domain Adaptive Person
Re-Identification [137.9939571408506]
We estimate and exploit the credibility of the assigned pseudo-label of each sample to alleviate the influence of noisy labels.
Our uncertainty-guided optimization brings significant improvement and achieves the state-of-the-art performance on benchmark datasets.
arXiv Detail & Related papers (2020-12-16T04:09:04Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.