GPS-Net: Graph Property Sensing Network for Scene Graph Generation
- URL: http://arxiv.org/abs/2003.12962v1
- Date: Sun, 29 Mar 2020 07:22:31 GMT
- Title: GPS-Net: Graph Property Sensing Network for Scene Graph Generation
- Authors: Xin Lin, Changxing Ding, Jinquan Zeng, Dacheng Tao
- Abstract summary: Scene graph generation (SGG) aims to detect objects in an image along with their pairwise relationships.
GPS-Net fully explores three properties for SGG: edge direction information, the difference in priority between nodes, and the long-tailed distribution of relationships.
GPS-Net achieves state-of-the-art performance on three popular databases: VG, OI, and VRD by significant gains under various settings and metrics.
- Score: 91.60326359082408
- License: http://creativecommons.org/licenses/by-sa/4.0/
- Abstract: Scene graph generation (SGG) aims to detect objects in an image along with
their pairwise relationships. There are three key properties of scene graph
that have been underexplored in recent works: namely, the edge direction
information, the difference in priority between nodes, and the long-tailed
distribution of relationships. Accordingly, in this paper, we propose a Graph
Property Sensing Network (GPS-Net) that fully explores these three properties
for SGG. First, we propose a novel message passing module that augments the
node feature with node-specific contextual information and encodes the edge
direction information via a tri-linear model. Second, we introduce a node
priority sensitive loss to reflect the difference in priority between nodes
during training. This is achieved by designing a mapping function that adjusts
the focusing parameter in the focal loss. Third, since the frequency of
relationships is affected by the long-tailed distribution problem, we mitigate
this issue by first softening the distribution and then enabling it to be
adjusted for each subject-object pair according to their visual appearance.
Systematic experiments demonstrate the effectiveness of the proposed
techniques. Moreover, GPS-Net achieves state-of-the-art performance on three
popular databases: VG, OI, and VRD by significant gains under various settings
and metrics. The code and models are available at
\url{https://github.com/taksau/GPS-Net}.
Related papers
- SpaGBOL: Spatial-Graph-Based Orientated Localisation [15.324623975476348]
Cross-View Geo-Localisation within urban regions is challenging in part due to the lack of geo-spatial structuring within current datasets and techniques.
We propose utilising graph representations to model sequences of local observations and the connectivity of the target location.
arXiv Detail & Related papers (2024-09-23T20:04:29Z) - Graph Transformer GANs with Graph Masked Modeling for Architectural
Layout Generation [153.92387500677023]
We present a novel graph Transformer generative adversarial network (GTGAN) to learn effective graph node relations.
The proposed graph Transformer encoder combines graph convolutions and self-attentions in a Transformer to model both local and global interactions.
We also propose a novel self-guided pre-training method for graph representation learning.
arXiv Detail & Related papers (2024-01-15T14:36:38Z) - Network Intrusion Detection with Edge-Directed Graph Multi-Head Attention Networks [13.446986347747325]
This paper proposes novel Edge-Directed Graph Multi-Head Attention Networks (EDGMAT) for network intrusion detection.
The proposed EDGMAT model introduces a multi-head attention mechanism into the intrusion detection model. Additional weight learning is realized through the combination of a multi-head attention mechanism and edge features.
arXiv Detail & Related papers (2023-10-26T12:30:11Z) - RU-Net: Regularized Unrolling Network for Scene Graph Generation [92.95032610978511]
Scene graph generation (SGG) aims to detect objects and predict the relationships between each pair of objects.
Existing SGG methods usually suffer from several issues, including 1) ambiguous object representations, and 2) low diversity in relationship predictions.
We propose a regularized unrolling network (RU-Net) to address both problems.
arXiv Detail & Related papers (2022-05-03T04:21:15Z) - Relation Regularized Scene Graph Generation [206.76762860019065]
Scene graph generation (SGG) is built on top of detected objects to predict object pairwise visual relations.
We propose a relation regularized network (R2-Net) which can predict whether there is a relationship between two objects.
Our R2-Net can effectively refine object labels and generate scene graphs.
arXiv Detail & Related papers (2022-02-22T11:36:49Z) - Graph Neural Networks with Feature and Structure Aware Random Walk [7.143879014059894]
We show that in typical heterphilous graphs, the edges may be directed, and whether to treat the edges as is or simply make them undirected greatly affects the performance of the GNN models.
We develop a model that adaptively learns the directionality of the graph, and exploits the underlying long-distance correlations between nodes.
arXiv Detail & Related papers (2021-11-19T08:54:21Z) - GraphMI: Extracting Private Graph Data from Graph Neural Networks [59.05178231559796]
We present textbfGraph textbfModel textbfInversion attack (GraphMI), which aims to extract private graph data of the training graph by inverting GNN.
Specifically, we propose a projected gradient module to tackle the discreteness of graph edges while preserving the sparsity and smoothness of graph features.
We design a graph auto-encoder module to efficiently exploit graph topology, node attributes, and target model parameters for edge inference.
arXiv Detail & Related papers (2021-06-05T07:07:52Z) - Graph Feature Gating Networks [31.20878472589719]
We propose a general graph feature gating network (GFGN) based on the graph signal denoising problem.
We also introduce three graph filters under GFGN to allow different levels of contributions from feature dimensions.
arXiv Detail & Related papers (2021-05-10T16:33:58Z) - Structural Temporal Graph Neural Networks for Anomaly Detection in
Dynamic Graphs [54.13919050090926]
We propose an end-to-end structural temporal Graph Neural Network model for detecting anomalous edges in dynamic graphs.
In particular, we first extract the $h$-hop enclosing subgraph centered on the target edge and propose the node labeling function to identify the role of each node in the subgraph.
Based on the extracted features, we utilize Gated recurrent units (GRUs) to capture the temporal information for anomaly detection.
arXiv Detail & Related papers (2020-05-15T09:17:08Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.