The Semi-Supervised iNaturalist-Aves Challenge at FGVC7 Workshop
- URL: http://arxiv.org/abs/2103.06937v1
- Date: Thu, 11 Mar 2021 20:21:16 GMT
- Title: The Semi-Supervised iNaturalist-Aves Challenge at FGVC7 Workshop
- Authors: Jong-Chyi Su and Subhransu Maji
- Abstract summary: This document describes the details and the motivation behind a new dataset we collected for the semi-supervised recognition challengecitesemi-aves at the FGVC7 workshop at CVPR 2020.
The dataset contains 1000 species of birds sampled from the iNat-2018 dataset for a total of nearly 150k images.
- Score: 42.02670649470055
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: This document describes the details and the motivation behind a new dataset
we collected for the semi-supervised recognition challenge~\cite{semi-aves} at
the FGVC7 workshop at CVPR 2020. The dataset contains 1000 species of birds
sampled from the iNat-2018 dataset for a total of nearly 150k images. From this
collection, we sample a subset of classes and their labels, while adding the
images from the remaining classes to the unlabeled set of images. The presence
of out-of-domain data (novel classes), high class-imbalance, and fine-grained
similarity between classes poses significant challenges for existing
semi-supervised recognition techniques in the literature. The dataset is
available here: \url{https://github.com/cvl-umass/semi-inat-2020}
Related papers
- Distributionally Robust Classification on a Data Budget [26.69877485937123]
We show that standard ResNet-50 trained with the cross-entropy loss on 2.4 million image samples can attain comparable robustness to a CLIP ResNet-50 trained on 400 million samples.
This is the first result showing (near) state-of-the-art distributional robustness on limited data budgets.
arXiv Detail & Related papers (2023-08-07T15:30:02Z) - Better May Not Be Fairer: A Study on Subgroup Discrepancy in Image
Classification [73.87160347728314]
We investigate how natural background colors play a role as spurious features by annotating the test sets of CIFAR10 and CIFAR100 into subgroups based on the background color of each image.
We find that overall human-level accuracy does not guarantee consistent subgroup performances, and the phenomenon remains even on models pre-trained on ImageNet or after data augmentation (DA)
Experimental results show that FlowAug achieves more consistent subgroup results than other types of DA methods on CIFAR10/100 and on CIFAR10/100-C.
arXiv Detail & Related papers (2022-12-16T18:51:10Z) - Oracle-MNIST: a Realistic Image Dataset for Benchmarking Machine
Learning Algorithms [57.29464116557734]
We introduce the Oracle-MNIST dataset, comprising of 28$times $28 grayscale images of 30,222 ancient characters.
The training set totally consists of 27,222 images, and the test set contains 300 images per class.
arXiv Detail & Related papers (2022-05-19T09:57:45Z) - Generalized Category Discovery [148.32255950504182]
We consider a highly general image recognition setting wherein, given a labelled and unlabelled set of images, the task is to categorize all images in the unlabelled set.
Here, the unlabelled images may come from labelled classes or from novel ones.
We first establish strong baselines by taking state-of-the-art algorithms from novel category discovery and adapting them for this task.
We then introduce a simple yet effective semi-supervised $k$-means method to cluster the unlabelled data into seen and unseen classes.
arXiv Detail & Related papers (2022-01-07T18:58:35Z) - Novel Class Discovery in Semantic Segmentation [104.30729847367104]
We introduce a new setting of Novel Class Discovery in Semantic (NCDSS)
It aims at segmenting unlabeled images containing new classes given prior knowledge from a labeled set of disjoint classes.
In NCDSS, we need to distinguish the objects and background, and to handle the existence of multiple classes within an image.
We propose the Entropy-based Uncertainty Modeling and Self-training (EUMS) framework to overcome noisy pseudo-labels.
arXiv Detail & Related papers (2021-12-03T13:31:59Z) - A Strong Baseline for the VIPriors Data-Efficient Image Classification
Challenge [9.017660524497389]
We present a strong baseline for data-efficient image classification on the VIPriors challenge dataset.
Our baseline achieves 69.7% accuracy and outperforms 50% of submissions to the VIPriors 2021 challenge.
arXiv Detail & Related papers (2021-09-28T08:45:15Z) - The Semi-Supervised iNaturalist Challenge at the FGVC8 Workshop [42.02670649470055]
Semi-iNat is a challenging dataset for semi-supervised classification with a long-tailed distribution of classes, fine-grained categories, and domain shifts between labeled and unlabeled data.
This dataset is behind the second iteration of the semi-supervised recognition challenge to be held at the FGVC8 workshop at CVPR 2021.
arXiv Detail & Related papers (2021-06-02T17:59:41Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.