A Survey on Visual Transfer Learning using Knowledge Graphs
- URL: http://arxiv.org/abs/2201.11794v1
- Date: Thu, 27 Jan 2022 20:19:55 GMT
- Title: A Survey on Visual Transfer Learning using Knowledge Graphs
- Authors: Sebastian Monka, Lavdim Halilaj, Achim Rettinger
- Abstract summary: This survey focuses on visual transfer learning approaches using knowledge graphs (KGs)
KGs can represent auxiliary knowledge either in an underlying graph-structured schema or in a vector-based knowledge graph embedding.
We provide a broad overview of knowledge graph embedding methods and describe several joint training objectives suitable to combine them with high dimensional visual embeddings.
- Score: 0.8701566919381223
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Recent approaches of computer vision utilize deep learning methods as they
perform quite well if training and testing domains follow the same underlying
data distribution. However, it has been shown that minor variations in the
images that occur when using these methods in the real world can lead to
unpredictable errors. Transfer learning is the area of machine learning that
tries to prevent these errors. Especially, approaches that augment image data
using auxiliary knowledge encoded in language embeddings or knowledge graphs
(KGs) have achieved promising results in recent years. This survey focuses on
visual transfer learning approaches using KGs. KGs can represent auxiliary
knowledge either in an underlying graph-structured schema or in a vector-based
knowledge graph embedding. Intending to enable the reader to solve visual
transfer learning problems with the help of specific KG-DL configurations we
start with a description of relevant modeling structures of a KG of various
expressions, such as directed labeled graphs, hypergraphs, and hyper-relational
graphs. We explain the notion of feature extractor, while specifically
referring to visual and semantic features. We provide a broad overview of
knowledge graph embedding methods and describe several joint training
objectives suitable to combine them with high dimensional visual embeddings.
The main section introduces four different categories on how a KG can be
combined with a DL pipeline: 1) Knowledge Graph as a Reviewer; 2) Knowledge
Graph as a Trainee; 3) Knowledge Graph as a Trainer; and 4) Knowledge Graph as
a Peer. To help researchers find evaluation benchmarks, we provide an overview
of generic KGs and a set of image processing datasets and benchmarks including
various types of auxiliary knowledge. Last, we summarize related surveys and
give an outlook about challenges and open issues for future research.
Related papers
- Knowledge Probing for Graph Representation Learning [12.960185655357495]
We propose a novel graph probing framework (GraphProbe) to investigate and interpret whether the family of graph learning methods has encoded different levels of knowledge in graph representation learning.
Based on the intrinsic properties of graphs, we design three probes to systematically investigate the graph representation learning process from different perspectives.
We construct a thorough evaluation benchmark with nine representative graph learning methods from random walk based approaches, basic graph neural networks and self-supervised graph methods, and probe them on six benchmark datasets for node classification, link prediction and graph classification.
arXiv Detail & Related papers (2024-08-07T16:27:45Z) - KGrEaT: A Framework to Evaluate Knowledge Graphs via Downstream Tasks [1.8722948221596285]
KGrEaT is a framework to estimate the quality of knowledge graphs via actual downstream tasks like classification, clustering, or recommendation.
The framework takes a knowledge graph as input, automatically maps it to the datasets to be evaluated on, and computes performance metrics for the defined tasks.
arXiv Detail & Related papers (2023-08-21T07:43:10Z) - State of the Art and Potentialities of Graph-level Learning [54.68482109186052]
Graph-level learning has been applied to many tasks including comparison, regression, classification, and more.
Traditional approaches to learning a set of graphs rely on hand-crafted features, such as substructures.
Deep learning has helped graph-level learning adapt to the growing scale of graphs by extracting features automatically and encoding graphs into low-dimensional representations.
arXiv Detail & Related papers (2023-01-14T09:15:49Z) - Semantic Enhanced Knowledge Graph for Large-Scale Zero-Shot Learning [74.6485604326913]
We provide a new semantic enhanced knowledge graph that contains both expert knowledge and categories semantic correlation.
To propagate information on the knowledge graph, we propose a novel Residual Graph Convolutional Network (ResGCN)
Experiments conducted on the widely used large-scale ImageNet-21K dataset and AWA2 dataset show the effectiveness of our method.
arXiv Detail & Related papers (2022-12-26T13:18:36Z) - SGEITL: Scene Graph Enhanced Image-Text Learning for Visual Commonsense
Reasoning [61.57887011165744]
multimodal Transformers have made great progress in the task of Visual Commonsense Reasoning.
We propose a Scene Graph Enhanced Image-Text Learning framework to incorporate visual scene graphs in commonsense reasoning.
arXiv Detail & Related papers (2021-12-16T03:16:30Z) - GCNBoost: Artwork Classification by Label Propagation through a
Knowledge Graph [32.129005474301735]
Contextual information is often the key to structure such real world data, and we propose to use it in form of a knowledge graph.
We propose a novel use of a knowledge graph, that is constructed on annotated data and pseudo-labeled data.
With label propagation, we boost artwork classification by training a model using a graph convolutional network.
arXiv Detail & Related papers (2021-05-25T11:50:05Z) - Graph Representation Learning by Ensemble Aggregating Subgraphs via
Mutual Information Maximization [5.419711903307341]
We introduce a self-supervised learning method to enhance the representations of graph-level learned by Graph Neural Networks.
To get a comprehensive understanding of the graph structure, we propose an ensemble-learning like subgraph method.
And to achieve efficient and effective contrasive learning, a Head-Tail contrastive samples construction method is proposed.
arXiv Detail & Related papers (2021-03-24T12:06:12Z) - Knowledge-Guided Multi-Label Few-Shot Learning for General Image
Recognition [75.44233392355711]
KGGR framework exploits prior knowledge of statistical label correlations with deep neural networks.
It first builds a structured knowledge graph to correlate different labels based on statistical label co-occurrence.
Then, it introduces the label semantics to guide learning semantic-specific features.
It exploits a graph propagation network to explore graph node interactions.
arXiv Detail & Related papers (2020-09-20T15:05:29Z) - All About Knowledge Graphs for Actions [82.39684757372075]
We propose a better understanding of knowledge graphs (KGs) that can be utilized for zero-shot and few-shot action recognition.
We study three different construction mechanisms for KGs: action embeddings, action-object embeddings, visual embeddings.
We present extensive analysis of the impact of different KGs on different experimental setups.
arXiv Detail & Related papers (2020-08-28T01:44:01Z) - GCC: Graph Contrastive Coding for Graph Neural Network Pre-Training [62.73470368851127]
Graph representation learning has emerged as a powerful technique for addressing real-world problems.
We design Graph Contrastive Coding -- a self-supervised graph neural network pre-training framework.
We conduct experiments on three graph learning tasks and ten graph datasets.
arXiv Detail & Related papers (2020-06-17T16:18:35Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.