Importance Weighted Structure Learning for Scene Graph Generation
- URL: http://arxiv.org/abs/2205.07017v1
- Date: Sat, 14 May 2022 09:25:14 GMT
- Title: Importance Weighted Structure Learning for Scene Graph Generation
- Authors: Daqi Liu, Miroslaw Bober, Josef Kittler
- Abstract summary: We propose a novel importance weighted structure learning method for scene graph generation.
A generic entropic mirror descent algorithm is applied to solve the resulting constrained variational inference task.
The proposed method achieves the state-of-the-art performance on various popular scene graph generation benchmarks.
- Score: 40.46394569128303
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Scene graph generation is a structured prediction task aiming to explicitly
model objects and their relationships via constructing a visually-grounded
scene graph for an input image. Currently, the message passing neural network
based mean field variational Bayesian methodology is the ubiquitous solution
for such a task, in which the variational inference objective is often assumed
to be the classical evidence lower bound. However, the variational
approximation inferred from such loose objective generally underestimates the
underlying posterior, which often leads to inferior generation performance. In
this paper, we propose a novel importance weighted structure learning method
aiming to approximate the underlying log-partition function with a tighter
importance weighted lower bound, which is computed from multiple samples drawn
from a reparameterizable Gumbel-Softmax sampler. A generic entropic mirror
descent algorithm is applied to solve the resulting constrained variational
inference task. The proposed method achieves the state-of-the-art performance
on various popular scene graph generation benchmarks.
Related papers
- Doubly Reparameterized Importance Weighted Structure Learning for Scene
Graph Generation [40.46394569128303]
Scene graph generation, given an input image, aims to explicitly model objects and their relationships by constructing a visually-grounded scene graph.
We propose a novel doubly re parameterized importance weighted structure learning method, which employs a tighter importance weighted lower bound as the variational inference objective.
The proposed method achieves the state-of-the-art performance on various popular scene graph generation benchmarks.
arXiv Detail & Related papers (2022-06-22T20:00:25Z) - Constrained Structure Learning for Scene Graph Generation [40.46394569128303]
We present a constrained structure learning method, for which an explicit constrained variational inference objective is proposed.
We validate the proposed generic model on various popular scene graph generation benchmarks and show that it outperforms the state-of-the-art methods.
arXiv Detail & Related papers (2022-01-27T17:47:37Z) - Bayesian Graph Contrastive Learning [55.36652660268726]
We propose a novel perspective of graph contrastive learning methods showing random augmentations leads to encoders.
Our proposed method represents each node by a distribution in the latent space in contrast to existing techniques which embed each node to a deterministic vector.
We show a considerable improvement in performance compared to existing state-of-the-art methods on several benchmark datasets.
arXiv Detail & Related papers (2021-12-15T01:45:32Z) - Neural Belief Propagation for Scene Graph Generation [31.9682610869767]
We propose a novel neural belief propagation method to generate the resulting scene graph.
It employs a structural Bethe approximation rather than the mean field approximation to infer the associated marginals.
It achieves the state-of-the-art performance on various popular scene graph generation benchmarks.
arXiv Detail & Related papers (2021-12-10T18:30:27Z) - Instance-Level Relative Saliency Ranking with Graph Reasoning [126.09138829920627]
We present a novel unified model to segment salient instances and infer relative saliency rank order.
A novel loss function is also proposed to effectively train the saliency ranking branch.
experimental results demonstrate that our proposed model is more effective than previous methods.
arXiv Detail & Related papers (2021-07-08T13:10:42Z) - Scaling Up Graph Neural Networks Via Graph Coarsening [18.176326897605225]
Scalability of graph neural networks (GNNs) is one of the major challenges in machine learning.
In this paper, we propose to use graph coarsening for scalable training of GNNs.
We show that, simply applying off-the-shelf coarsening methods, we can reduce the number of nodes by up to a factor of ten without causing a noticeable downgrade in classification accuracy.
arXiv Detail & Related papers (2021-06-09T15:46:17Z) - Deep Magnification-Flexible Upsampling over 3D Point Clouds [103.09504572409449]
We propose a novel end-to-end learning-based framework to generate dense point clouds.
We first formulate the problem explicitly, which boils down to determining the weights and high-order approximation errors.
Then, we design a lightweight neural network to adaptively learn unified and sorted weights as well as the high-order refinements.
arXiv Detail & Related papers (2020-11-25T14:00:18Z) - Learning What Makes a Difference from Counterfactual Examples and
Gradient Supervision [57.14468881854616]
We propose an auxiliary training objective that improves the generalization capabilities of neural networks.
We use pairs of minimally-different examples with different labels, a.k.a counterfactual or contrasting examples, which provide a signal indicative of the underlying causal structure of the task.
Models trained with this technique demonstrate improved performance on out-of-distribution test sets.
arXiv Detail & Related papers (2020-04-20T02:47:49Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.