Saliency Prediction with External Knowledge
- URL: http://arxiv.org/abs/2007.13839v1
- Date: Mon, 27 Jul 2020 20:12:28 GMT
- Title: Saliency Prediction with External Knowledge
- Authors: Yifeng Zhang, Ming Jiang, Qi Zhao
- Abstract summary: We develop a new Graph Semantic Saliency Network (GraSSNet) that constructs a graph that encodes semantic relationships learned from external knowledge.
A Spatial Graph Attention Network is then developed to update saliency features based on the learned graph.
Experiments show that the proposed model learns to predict saliency from the external knowledge and outperforms the state-of-the-art on four saliency benchmarks.
- Score: 27.75589849982756
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The last decades have seen great progress in saliency prediction, with the
success of deep neural networks that are able to encode high-level semantics.
Yet, while humans have the innate capability in leveraging their knowledge to
decide where to look (e.g. people pay more attention to familiar faces such as
celebrities), saliency prediction models have only been trained with large
eye-tracking datasets. This work proposes to bridge this gap by explicitly
incorporating external knowledge for saliency models as humans do. We develop
networks that learn to highlight regions by incorporating prior knowledge of
semantic relationships, be it general or domain-specific, depending on the task
of interest. At the core of the method is a new Graph Semantic Saliency Network
(GraSSNet) that constructs a graph that encodes semantic relationships learned
from external knowledge. A Spatial Graph Attention Network is then developed to
update saliency features based on the learned graph. Experiments show that the
proposed model learns to predict saliency from the external knowledge and
outperforms the state-of-the-art on four saliency benchmarks.
Related papers
- Synergistic Signals: Exploiting Co-Engagement and Semantic Links via
Graph Neural Networks [4.261438296177923]
We study the problem in the context recommender systems at Netflix.
We propose a novel graph-based approach called SemanticGNN.
Our key technical contributions are twofold: (1) we develop a novel relation-aware attention graph neural network (GNN) to handle the imbalanced distribution of relation types in our graph; (2) to handle web-scale graph data that has millions of nodes and billions of edges, we develop a novel distributed graph training paradigm.
arXiv Detail & Related papers (2023-12-07T06:29:26Z) - Intrinsically motivated graph exploration using network theories of
human curiosity [71.2717061477241]
We propose a novel approach for exploring graph-structured data motivated by two theories of human curiosity.
We use these proposed features as rewards for graph neural-network-based reinforcement learning.
arXiv Detail & Related papers (2023-07-11T01:52:08Z) - Deep networks for system identification: a Survey [56.34005280792013]
System identification learns mathematical descriptions of dynamic systems from input-output data.
Main aim of the identified model is to predict new data from previous observations.
We discuss architectures commonly adopted in the literature, like feedforward, convolutional, and recurrent networks.
arXiv Detail & Related papers (2023-01-30T12:38:31Z) - State of the Art and Potentialities of Graph-level Learning [54.68482109186052]
Graph-level learning has been applied to many tasks including comparison, regression, classification, and more.
Traditional approaches to learning a set of graphs rely on hand-crafted features, such as substructures.
Deep learning has helped graph-level learning adapt to the growing scale of graphs by extracting features automatically and encoding graphs into low-dimensional representations.
arXiv Detail & Related papers (2023-01-14T09:15:49Z) - A Variational Graph Autoencoder for Manipulation Action Recognition and
Prediction [1.1816942730023883]
We introduce a deep graph autoencoder to jointly learn recognition and prediction of manipulation tasks from symbolic scene graphs.
Our network has a variational autoencoder structure with two branches: one for identifying the input graph type and one for predicting the future graphs.
We benchmark our new model against different state-of-the-art methods on two different datasets, MANIAC and MSRC-9, and show that our proposed model can achieve better performance.
arXiv Detail & Related papers (2021-10-25T21:40:42Z) - Learning through structure: towards deep neuromorphic knowledge graph
embeddings [0.5906031288935515]
We propose a strategy to map deep graph learning architectures for knowledge graph reasoning to neuromorphic architectures.
Based on the insight that randomly and untrained graph neural networks are able to preserve local graph structures, we compose a frozen neural network shallow knowledge graph embedding models.
We experimentally show that already on conventional computing hardware, this leads to a significant speedup and memory reduction while maintaining a competitive performance level.
arXiv Detail & Related papers (2021-09-21T18:01:04Z) - Learning Graph Representations [0.0]
Graph Neural Networks (GNNs) are efficient ways to get insight into large dynamic graph datasets.
In this paper, we discuss the graph convolutional neural networks graph autoencoders and Social-temporal graph neural networks.
arXiv Detail & Related papers (2021-02-03T12:07:55Z) - Exploiting Heterogeneous Graph Neural Networks with Latent Worker/Task
Correlation Information for Label Aggregation in Crowdsourcing [72.34616482076572]
Crowdsourcing has attracted much attention for its convenience to collect labels from non-expert workers instead of experts.
We propose a novel framework based on graph neural networks for aggregating crowd labels.
arXiv Detail & Related papers (2020-10-25T10:12:37Z) - Learning to Extrapolate Knowledge: Transductive Few-shot Out-of-Graph
Link Prediction [69.1473775184952]
We introduce a realistic problem of few-shot out-of-graph link prediction.
We tackle this problem with a novel transductive meta-learning framework.
We validate our model on multiple benchmark datasets for knowledge graph completion and drug-drug interaction prediction.
arXiv Detail & Related papers (2020-06-11T17:42:46Z) - A Heterogeneous Graph with Factual, Temporal and Logical Knowledge for
Question Answering Over Dynamic Contexts [81.4757750425247]
We study question answering over a dynamic textual environment.
We develop a graph neural network over the constructed graph, and train the model in an end-to-end manner.
arXiv Detail & Related papers (2020-04-25T04:53:54Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.