Unbiased Scene Graph Generation via Two-stage Causal Modeling
- URL: http://arxiv.org/abs/2307.05276v1
- Date: Tue, 11 Jul 2023 14:11:24 GMT
- Title: Unbiased Scene Graph Generation via Two-stage Causal Modeling
- Authors: Shuzhou Sun, Shuaifeng Zhi, Qing Liao, Janne Heikkil\"a, Li Liu
- Abstract summary: We propose Two-stage Causal Modeling (TsCM) for the Scene Graph Generation (SGG) task.
TsCM takes the long-tailed distribution and semantic confusion as confounders to the Structural Causal Model (SCM) and then decouples the causal intervention into two stages.
Our method can achieve state-of-the-art performance in terms of mean recall rate.
- Score: 11.390135312161044
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Despite the impressive performance of recent unbiased Scene Graph Generation
(SGG) methods, the current debiasing literature mainly focuses on the
long-tailed distribution problem, whereas it overlooks another source of bias,
i.e., semantic confusion, which makes the SGG model prone to yield false
predictions for similar relationships. In this paper, we explore a debiasing
procedure for the SGG task leveraging causal inference. Our central insight is
that the Sparse Mechanism Shift (SMS) in causality allows independent
intervention on multiple biases, thereby potentially preserving head category
performance while pursuing the prediction of high-informative tail
relationships. However, the noisy datasets lead to unobserved confounders for
the SGG task, and thus the constructed causal models are always
causal-insufficient to benefit from SMS. To remedy this, we propose Two-stage
Causal Modeling (TsCM) for the SGG task, which takes the long-tailed
distribution and semantic confusion as confounders to the Structural Causal
Model (SCM) and then decouples the causal intervention into two stages. The
first stage is causal representation learning, where we use a novel Population
Loss (P-Loss) to intervene in the semantic confusion confounder. The second
stage introduces the Adaptive Logit Adjustment (AL-Adjustment) to eliminate the
long-tailed distribution confounder to complete causal calibration learning.
These two stages are model agnostic and thus can be used in any SGG model that
seeks unbiased predictions. Comprehensive experiments conducted on the popular
SGG backbones and benchmarks show that our TsCM can achieve state-of-the-art
performance in terms of mean recall rate. Furthermore, TsCM can maintain a
higher recall rate than other debiasing methods, which indicates that our
method can achieve a better tradeoff between head and tail relationships.
Related papers
- Towards Robust Text Classification: Mitigating Spurious Correlations with Causal Learning [2.7813683000222653]
We propose the Causally Calibrated Robust ( CCR) to reduce models' reliance on spurious correlations.
CCR integrates a causal feature selection method based on counterfactual reasoning, along with an inverse propensity weighting (IPW) loss function.
We show that CCR state-of-the-art performance among methods without group labels, and in some cases, it can compete with the models that utilize group labels.
arXiv Detail & Related papers (2024-11-01T21:29:07Z) - Delving into Identify-Emphasize Paradigm for Combating Unknown Bias [52.76758938921129]
We propose an effective bias-conflicting scoring method (ECS) to boost the identification accuracy.
We also propose gradient alignment (GA) to balance the contributions of the mined bias-aligned and bias-conflicting samples.
Experiments are conducted on multiple datasets in various settings, demonstrating that the proposed solution can mitigate the impact of unknown biases.
arXiv Detail & Related papers (2023-02-22T14:50:24Z) - Dual-branch Hybrid Learning Network for Unbiased Scene Graph Generation [87.13847750383778]
We propose a Dual-branch Hybrid Learning network (DHL) to take care of both head predicates and tail ones for Scene Graph Generation (SGG)
We show that our approach achieves a new state-of-the-art performance on VG and GQA datasets.
arXiv Detail & Related papers (2022-07-16T11:53:50Z) - Adaptive Fine-Grained Predicates Learning for Scene Graph Generation [122.4588401267544]
General Scene Graph Generation (SGG) models tend to predict head predicates and re-balancing strategies prefer tail categories.
We propose an Adaptive Fine-Grained Predicates Learning (FGPL-A) which aims at differentiating hard-to-distinguish predicates for SGG.
Our proposed model-agnostic strategy significantly boosts performance of benchmark models on VG-SGG and GQA-SGG datasets by up to 175% and 76% on Mean Recall@100, achieving new state-of-the-art performance.
arXiv Detail & Related papers (2022-07-11T03:37:57Z) - Diffusion Causal Models for Counterfactual Estimation [18.438307666925425]
We consider the task of counterfactual estimation from observational imaging data given a known causal structure.
We propose Diff-SCM, a deep structural causal model that builds on recent advances of generative energy-based models.
We find that Diff-SCM produces more realistic and minimal counterfactuals than baselines on MNIST data and can also be applied to ImageNet data.
arXiv Detail & Related papers (2022-02-21T12:23:01Z) - General Greedy De-bias Learning [163.65789778416172]
We propose a General Greedy De-bias learning framework (GGD), which greedily trains the biased models and the base model like gradient descent in functional space.
GGD can learn a more robust base model under the settings of both task-specific biased models with prior knowledge and self-ensemble biased model without prior knowledge.
arXiv Detail & Related papers (2021-12-20T14:47:32Z) - BCD Nets: Scalable Variational Approaches for Bayesian Causal Discovery [97.79015388276483]
A structural equation model (SEM) is an effective framework to reason over causal relationships represented via a directed acyclic graph (DAG)
Recent advances enabled effective maximum-likelihood point estimation of DAGs from observational data.
We propose BCD Nets, a variational framework for estimating a distribution over DAGs characterizing a linear-Gaussian SEM.
arXiv Detail & Related papers (2021-12-06T03:35:21Z) - PCPL: Predicate-Correlation Perception Learning for Unbiased Scene Graph
Generation [58.98802062945709]
We propose a novel Predicate-Correlation Perception Learning scheme to adaptively seek out appropriate loss weights.
Our PCPL framework is further equipped with a graph encoder module to better extract context features.
arXiv Detail & Related papers (2020-09-02T08:30:09Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.