Probabilistic Debiasing of Scene Graphs
- URL: http://arxiv.org/abs/2211.06444v1
- Date: Fri, 11 Nov 2022 19:06:49 GMT
- Title: Probabilistic Debiasing of Scene Graphs
- Authors: Bashirul Azam Biswas and Qiang Ji
- Abstract summary: The quality of scene graphs generated by the state-of-the-art (SOTA) models is compromised due to the long-tail nature of the relationships and their parent object pairs.
We propose virtual evidence incorporated within-triplet Bayesian Network (BN) to preserve the object-conditional distribution of the relationship label and to eradicate the bias created by the marginal probability of the relationships.
- Score: 43.111851503211724
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: The quality of scene graphs generated by the state-of-the-art (SOTA) models
is compromised due to the long-tail nature of the relationships and their
parent object pairs. Training of the scene graphs is dominated by the majority
relationships of the majority pairs and, therefore, the object-conditional
distributions of relationship in the minority pairs are not preserved after the
training is converged. Consequently, the biased model performs well on more
frequent relationships in the marginal distribution of relationships such as
`on' and `wearing', and performs poorly on the less frequent relationships such
as `eating' or `hanging from'. In this work, we propose virtual evidence
incorporated within-triplet Bayesian Network (BN) to preserve the
object-conditional distribution of the relationship label and to eradicate the
bias created by the marginal probability of the relationships. The insufficient
number of relationships in the minority classes poses a significant problem in
learning the within-triplet Bayesian network. We address this insufficiency by
embedding-based augmentation of triplets where we borrow samples of the
minority triplet classes from its neighborhood triplets in the semantic space.
We perform experiments on two different datasets and achieve a significant
improvement in the mean recall of the relationships. We also achieve better
balance between recall and mean recall performance compared to the SOTA
de-biasing techniques of scene graph models.
Related papers
- RelVAE: Generative Pretraining for few-shot Visual Relationship
Detection [2.2230760534775915]
We present the first pretraining method for few-shot predicate classification that does not require any annotated relations.
We construct few-shot training splits and show quantitative experiments on VG200 and VRD datasets.
arXiv Detail & Related papers (2023-11-27T19:08:08Z) - Document-level Relation Extraction with Relation Correlations [15.997345900917058]
Document-level relation extraction faces two overlooked challenges: long-tail problem and multi-label problem.
We analyze the co-occurrence correlation of relations, and introduce it into DocRE task for the first time.
arXiv Detail & Related papers (2022-12-20T11:17:52Z) - Learning Relation Prototype from Unlabeled Texts for Long-tail Relation
Extraction [84.64435075778988]
We propose a general approach to learn relation prototypes from unlabeled texts.
We learn relation prototypes as an implicit factor between entities.
We conduct experiments on two publicly available datasets: New York Times and Google Distant Supervision.
arXiv Detail & Related papers (2020-11-27T06:21:12Z) - One-shot Learning for Temporal Knowledge Graphs [49.41854171118697]
We propose a one-shot learning framework for link prediction in temporal knowledge graphs.
Our proposed method employs a self-attention mechanism to effectively encode temporal interactions between entities.
Our experiments show that the proposed algorithm outperforms the state of the art baselines for two well-studied benchmarks.
arXiv Detail & Related papers (2020-10-23T03:24:44Z) - Learning to Decouple Relations: Few-Shot Relation Classification with
Entity-Guided Attention and Confusion-Aware Training [49.9995628166064]
We propose CTEG, a model equipped with two mechanisms to learn to decouple easily-confused relations.
On the one hand, an EGA mechanism is introduced to guide the attention to filter out information causing confusion.
On the other hand, a Confusion-Aware Training (CAT) method is proposed to explicitly learn to distinguish relations.
arXiv Detail & Related papers (2020-10-21T11:07:53Z) - Improving Long-Tail Relation Extraction with Collaborating
Relation-Augmented Attention [63.26288066935098]
We propose a novel neural network, Collaborating Relation-augmented Attention (CoRA), to handle both the wrong labeling and long-tail relations.
In the experiments on the popular benchmark dataset NYT, the proposed CoRA improves the prior state-of-the-art performance by a large margin.
arXiv Detail & Related papers (2020-10-08T05:34:43Z) - Addressing Class Imbalance in Scene Graph Parsing by Learning to
Contrast and Score [65.18522219013786]
Scene graph parsing aims to detect objects in an image scene and recognize their relations.
Recent approaches have achieved high average scores on some popular benchmarks, but fail in detecting rare relations.
This paper introduces a novel integrated framework of classification and ranking to resolve the class imbalance problem.
arXiv Detail & Related papers (2020-09-28T13:57:59Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.