Uncovering the Background-Induced bias in RGB based 6-DoF Object Pose
Estimation
- URL: http://arxiv.org/abs/2304.08230v1
- Date: Mon, 17 Apr 2023 12:54:20 GMT
- Title: Uncovering the Background-Induced bias in RGB based 6-DoF Object Pose
Estimation
- Authors: Elena Govi, Davide Sapienza, Carmelo Scribano, Tobia Poppi, Giorgia
Franchini, Paola Ard\`on, Micaela Verucchi and Marko Bertogna
- Abstract summary: In recent years, there has been a growing trend of using data-driven methods in industrial settings.
It becomes critical to understand how the manipulation of video and images can impact the effectiveness of a machine learning method.
Our case study aims precisely to analyze the Linemod dataset, considered the state of the art in 6D pose estimation context.
- Score: 5.30320006562872
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In recent years, there has been a growing trend of using data-driven methods
in industrial settings. These kinds of methods often process video images or
parts, therefore the integrity of such images is crucial. Sometimes datasets,
e.g. consisting of images, can be sophisticated for various reasons. It becomes
critical to understand how the manipulation of video and images can impact the
effectiveness of a machine learning method. Our case study aims precisely to
analyze the Linemod dataset, considered the state of the art in 6D pose
estimation context. That dataset presents images accompanied by ArUco markers;
it is evident that such markers will not be available in real-world contexts.
We analyze how the presence of the markers affects the pose estimation
accuracy, and how this bias may be mitigated through data augmentation and
other methods. Our work aims to show how the presence of these markers goes to
modify, in the testing phase, the effectiveness of the deep learning method
used. In particular, we will demonstrate, through the tool of saliency maps,
how the focus of the neural network is captured in part by these ArUco markers.
Finally, a new dataset, obtained by applying geometric tools to Linemod, will
be proposed in order to demonstrate our hypothesis and uncovering the bias. Our
results demonstrate the potential for bias in 6DOF pose estimation networks,
and suggest methods for reducing this bias when training with markers.
Related papers
- Mitigating Bias Using Model-Agnostic Data Attribution [2.9868610316099335]
Mitigating bias in machine learning models is a critical endeavor for ensuring fairness and equity.
We propose a novel approach to address bias by leveraging pixel image attributions to identify and regularize regions of images containing bias attributes.
arXiv Detail & Related papers (2024-05-08T13:00:56Z) - Image and AIS Data Fusion Technique for Maritime Computer Vision
Applications [1.482087972733629]
We develop a technique that fuses Automatic Identification System (AIS) data with vessels detected in images to create datasets.
Our approach associates detected ships to their corresponding AIS messages by estimating distance and azimuth.
This technique is useful for creating datasets for waterway traffic management, encounter detection, and surveillance.
arXiv Detail & Related papers (2023-12-07T20:54:49Z) - Exploiting the Distortion-Semantic Interaction in Fisheye Data [12.633032175875865]
Fisheye data has the wider field of view advantage over other types of cameras, but this comes at the expense of high distortion.
objects further from the center exhibit deformations that make it difficult for a model to identify their semantic context.
We introduce an approach to exploit this relationship by first extracting distortion class labels based on an object's distance from the center of the image.
We then shape a backbone's representation space with a weighted contrastive loss that constrains objects of the same semantic class and distortion class to be close to each other.
arXiv Detail & Related papers (2023-04-28T20:23:38Z) - Improving Fairness in Image Classification via Sketching [14.154930352612926]
Deep neural networks (DNNs) tend to make unfair predictions when the training data are collected from different sub-populations.
We propose to use sketching to handle this phenomenon.
We evaluate our method through extensive experiments on both general scene dataset and medical scene dataset.
arXiv Detail & Related papers (2022-10-31T22:26:32Z) - Semantic keypoint-based pose estimation from single RGB frames [64.80395521735463]
We present an approach to estimating the continuous 6-DoF pose of an object from a single RGB image.
The approach combines semantic keypoints predicted by a convolutional network (convnet) with a deformable shape model.
We show that our approach can accurately recover the 6-DoF object pose for both instance- and class-based scenarios.
arXiv Detail & Related papers (2022-04-12T15:03:51Z) - Object-aware Contrastive Learning for Debiased Scene Representation [74.30741492814327]
We develop a novel object-aware contrastive learning framework that localizes objects in a self-supervised manner.
We also introduce two data augmentations based on ContraCAM, object-aware random crop and background mixup, which reduce contextual and background biases during contrastive self-supervised learning.
arXiv Detail & Related papers (2021-07-30T19:24:07Z) - Data Augmentation for Object Detection via Differentiable Neural
Rendering [71.00447761415388]
It is challenging to train a robust object detector when annotated data is scarce.
Existing approaches to tackle this problem include semi-supervised learning that interpolates labeled data from unlabeled data.
We introduce an offline data augmentation method for object detection, which semantically interpolates the training data with novel views.
arXiv Detail & Related papers (2021-03-04T06:31:06Z) - Spatial Attention Improves Iterative 6D Object Pose Estimation [52.365075652976735]
We propose a new method for 6D pose estimation refinement from RGB images.
Our main insight is that after the initial pose estimate, it is important to pay attention to distinct spatial features of the object.
We experimentally show that this approach learns to attend to salient spatial features and learns to ignore occluded parts of the object, leading to better pose estimation across datasets.
arXiv Detail & Related papers (2021-01-05T17:18:52Z) - Graph Embedding with Data Uncertainty [113.39838145450007]
spectral-based subspace learning is a common data preprocessing step in many machine learning pipelines.
Most subspace learning methods do not take into consideration possible measurement inaccuracies or artifacts that can lead to data with high uncertainty.
arXiv Detail & Related papers (2020-09-01T15:08:23Z) - Distilling Localization for Self-Supervised Representation Learning [82.79808902674282]
Contrastive learning has revolutionized unsupervised representation learning.
Current contrastive models are ineffective at localizing the foreground object.
We propose a data-driven approach for learning in variance to backgrounds.
arXiv Detail & Related papers (2020-04-14T16:29:42Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.