Contemplating real-world object classification
        - URL: http://arxiv.org/abs/2103.05137v1
- Date: Mon, 8 Mar 2021 23:29:59 GMT
- Title: Contemplating real-world object classification
- Authors: Ali Borji
- Abstract summary: We reanalyze the ObjectNet dataset recently proposed by Barbu et al. containing objects in daily life situations.
We find that applying deep models to the isolated objects, rather than the entire scene as is done in the original paper, results in around 20-30% performance improvement.
- Score: 53.10151901863263
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract:   Deep object recognition models have been very successful over benchmark
datasets such as ImageNet. How accurate and robust are they to distribution
shifts arising from natural and synthetic variations in datasets? Prior
research on this problem has primarily focused on ImageNet variations (e.g.,
ImageNetV2, ImageNet-A). To avoid potential inherited biases in these studies,
we take a different approach. Specifically, we reanalyze the ObjectNet dataset
recently proposed by Barbu et al. containing objects in daily life situations.
They showed a dramatic performance drop of the state of the art object
recognition models on this dataset. Due to the importance and implications of
their results regarding the generalization ability of deep models, we take a
second look at their analysis. We find that applying deep models to the
isolated objects, rather than the entire scene as is done in the original
paper, results in around 20-30% performance improvement. Relative to the
numbers reported in Barbu et al., around 10-15% of the performance loss is
recovered, without any test time data augmentation. Despite this gain, however,
we conclude that deep models still suffer drastically on the ObjectNet dataset.
We also investigate the robustness of models against synthetic image
perturbations such as geometric transformations (e.g., scale, rotation,
translation), natural image distortions (e.g., impulse noise, blur) as well as
adversarial attacks (e.g., FGSM and PGD-5). Our results indicate that limiting
the object area as much as possible (i.e., from the entire image to the
bounding box to the segmentation mask) leads to consistent improvement in
accuracy and robustness.
 
      
        Related papers
        - Corner Cases: How Size and Position of Objects Challenge   ImageNet-Trained Models [17.331413720045898]
 Backgrounds in images play a major role in contributing to spurious correlations among different data points.<n>In this paper, we show that these biases can impact how much a model relies on spurious features in the background to make its predictions.
 arXiv  Detail & Related papers  (2025-05-06T14:27:01Z)
- ImageNet-D: Benchmarking Neural Network Robustness on Diffusion   Synthetic Object [78.58860252442045]
 We introduce generative model as a data source for hard images that benchmark deep models' robustness.
We are able to generate images with more diversified backgrounds, textures, and materials than any prior work, where we term this benchmark as ImageNet-D.
Our work suggests that diffusion models can be an effective source to test vision models.
 arXiv  Detail & Related papers  (2024-03-27T17:23:39Z)
- DVMNet++: Rethinking Relative Pose Estimation for Unseen Objects [59.51874686414509]
 Existing approaches typically predict 3D translation utilizing the ground-truth object bounding box and approximate 3D rotation with a large number of discrete hypotheses.
We present a Deep Voxel Matching Network (DVMNet++) that computes the relative object pose in a single pass.
Our approach delivers more accurate relative pose estimates for novel objects at a lower computational cost compared to state-of-the-art methods.
 arXiv  Detail & Related papers  (2024-03-20T15:41:32Z)
- Innovative Horizons in Aerial Imagery: LSKNet Meets DiffusionDet for
  Advanced Object Detection [55.2480439325792]
 We present an in-depth evaluation of an object detection model that integrates the LSKNet backbone with the DiffusionDet head.
The proposed model achieves a mean average precision (MAP) of approximately 45.7%, which is a significant improvement.
This advancement underscores the effectiveness of the proposed modifications and sets a new benchmark in aerial image analysis.
 arXiv  Detail & Related papers  (2023-11-21T19:49:13Z)
- Uncertainty in AI: Evaluating Deep Neural Networks on
  Out-of-Distribution Images [0.0]
 This paper investigates the uncertainty of various deep neural networks, including ResNet-50, VGG16, DenseNet121, AlexNet, and GoogleNet, when dealing with perturbed data.
While ResNet-50 was the most accurate single model for OOD images, the ensemble performed even better, correctly classifying all images.
 arXiv  Detail & Related papers  (2023-09-04T22:46:59Z)
- ImageNet-E: Benchmarking Neural Network Robustness via Attribute Editing [45.14977000707886]
 Higher accuracy on ImageNet usually leads to better robustness against different corruptions.
We create a toolkit for object editing with controls of backgrounds, sizes, positions, and directions.
We evaluate the performance of current deep learning models, including both convolutional neural networks and vision transformers.
 arXiv  Detail & Related papers  (2023-03-30T02:02:32Z)
- Salient Objects in Clutter [130.63976772770368]
 This paper identifies and addresses a serious design bias of existing salient object detection (SOD) datasets.
This design bias has led to a saturation in performance for state-of-the-art SOD models when evaluated on existing datasets.
We propose a new high-quality dataset and update the previous saliency benchmark.
 arXiv  Detail & Related papers  (2021-05-07T03:49:26Z)
- Rethinking Natural Adversarial Examples for Classification Models [43.87819913022369]
 ImageNet-A is a famous dataset of natural adversarial examples.
We validated the hypothesis by reducing the background influence in ImageNet-A examples with object detection techniques.
Experiments showed that the object detection models with various classification models as backbones obtained much higher accuracy than their corresponding classification models.
 arXiv  Detail & Related papers  (2021-02-23T14:46:48Z)
- Secrets of 3D Implicit Object Shape Reconstruction in the Wild [92.5554695397653]
 Reconstructing high-fidelity 3D objects from sparse, partial observation is crucial for various applications in computer vision, robotics, and graphics.
Recent neural implicit modeling methods show promising results on synthetic or dense datasets.
But, they perform poorly on real-world data that is sparse and noisy.
This paper analyzes the root cause of such deficient performance of a popular neural implicit model.
 arXiv  Detail & Related papers  (2021-01-18T03:24:48Z)
- ObjectNet Dataset: Reanalysis and Correction [47.64219291655723]
 Recently, Barbu et al introduced a dataset called ObjectNet which includes objects in daily life situations.
They showed a dramatic performance drop of the state of the art object recognition models on this dataset.
We highlight a major problem with their work which is applying object recognizers to the scenes containing multiple objects rather than isolated objects.
 arXiv  Detail & Related papers  (2020-04-04T22:45:57Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
       
     
           This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.