ObjectNet Dataset: Reanalysis and Correction
- URL: http://arxiv.org/abs/2004.02042v1
- Date: Sat, 4 Apr 2020 22:45:57 GMT
- Title: ObjectNet Dataset: Reanalysis and Correction
- Authors: Ali Borji
- Abstract summary: Recently, Barbu et al introduced a dataset called ObjectNet which includes objects in daily life situations.
They showed a dramatic performance drop of the state of the art object recognition models on this dataset.
We highlight a major problem with their work which is applying object recognizers to the scenes containing multiple objects rather than isolated objects.
- Score: 47.64219291655723
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Recently, Barbu et al introduced a dataset called ObjectNet which includes
objects in daily life situations. They showed a dramatic performance drop of
the state of the art object recognition models on this dataset. Due to the
importance and implications of their results regarding generalization ability
of deep models, we take a second look at their findings. We highlight a major
problem with their work which is applying object recognizers to the scenes
containing multiple objects rather than isolated objects. The latter results in
around 20-30% performance gain using our code. Compared with the results
reported in the ObjectNet paper, we observe that around 10-15 % of the
performance loss can be recovered, without any test time data augmentation. In
accordance with Barbu et al.'s conclusions, however, we also conclude that deep
models suffer drastically on this dataset. Thus, we believe that ObjectNet
remains a challenging dataset for testing the generalization power of models
beyond datasets on which they have been trained.
Related papers
- Exploring the Effectiveness of Dataset Synthesis: An application of
Apple Detection in Orchards [68.95806641664713]
We explore the usability of Stable Diffusion 2.1-base for generating synthetic datasets of apple trees for object detection.
We train a YOLOv5m object detection model to predict apples in a real-world apple detection dataset.
Results demonstrate that the model trained on generated data is slightly underperforming compared to a baseline model trained on real-world images.
arXiv Detail & Related papers (2023-06-20T09:46:01Z) - Spoken ObjectNet: A Bias-Controlled Spoken Caption Dataset [14.44921491933053]
Spoken ObjectNet is designed to remove some of these biases and provide a way to better evaluate how effectively models will perform in real-world scenarios.
This dataset expands upon ObjectNet, which is a bias-controlled image dataset that features similar image classes to those present in ImageNet.
Results show that models trained on other datasets and then evaluated on Spoken ObjectNet tend to perform poorly due to biases in other datasets that the models have learned.
arXiv Detail & Related papers (2021-10-14T17:38:20Z) - Salient Objects in Clutter [130.63976772770368]
This paper identifies and addresses a serious design bias of existing salient object detection (SOD) datasets.
This design bias has led to a saturation in performance for state-of-the-art SOD models when evaluated on existing datasets.
We propose a new high-quality dataset and update the previous saliency benchmark.
arXiv Detail & Related papers (2021-05-07T03:49:26Z) - REGRAD: A Large-Scale Relational Grasp Dataset for Safe and
Object-Specific Robotic Grasping in Clutter [52.117388513480435]
We present a new dataset named regrad to sustain the modeling of relationships among objects and grasps.
Our dataset is collected in both forms of 2D images and 3D point clouds.
Users are free to import their own object models for the generation of as many data as they want.
arXiv Detail & Related papers (2021-04-29T05:31:21Z) - Unsupervised Learning of 3D Object Categories from Videos in the Wild [75.09720013151247]
We focus on learning a model from multiple views of a large collection of object instances.
We propose a new neural network design, called warp-conditioned ray embedding (WCR), which significantly improves reconstruction.
Our evaluation demonstrates performance improvements over several deep monocular reconstruction baselines on existing benchmarks.
arXiv Detail & Related papers (2021-03-30T17:57:01Z) - Contemplating real-world object classification [53.10151901863263]
We reanalyze the ObjectNet dataset recently proposed by Barbu et al. containing objects in daily life situations.
We find that applying deep models to the isolated objects, rather than the entire scene as is done in the original paper, results in around 20-30% performance improvement.
arXiv Detail & Related papers (2021-03-08T23:29:59Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.