Graph Data Augmentation for Graph Machine Learning: A Survey
- URL: http://arxiv.org/abs/2202.08871v1
- Date: Thu, 17 Feb 2022 19:14:17 GMT
- Title: Graph Data Augmentation for Graph Machine Learning: A Survey
- Authors: Tong Zhao, Gang Liu, Stephan G\"unnemann, Meng Jiang
- Abstract summary: This paper aims to clarify the landscape of existing literature in graph data augmentation and motivate additional work in this area.
We first categorize graph data augmentation operations based on the components of graph data they modify or create.
Next, we introduce recent advances in graph data augmentation, separating by their learning objectives and methodologies.
- Score: 19.372562034069084
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Data augmentation has recently seen increased interest in graph machine
learning given its ability of creating extra training data and improving model
generalization. Despite this recent upsurge, this area is still relatively
underexplored, due to the challenges brought by complex, non-Euclidean
structure of graph data, which limits the direct analogizing of traditional
augmentation operations on other types of data. In this paper, we present a
comprehensive and systematic survey of graph data augmentation that summarizes
the literature in a structured manner. We first categorize graph data
augmentation operations based on the components of graph data they modify or
create. Next, we introduce recent advances in graph data augmentation,
separating by their learning objectives and methodologies. We conclude by
outlining currently unsolved challenges as well as directions for future
research. Overall, this paper aims to clarify the landscape of existing
literature in graph data augmentation and motivate additional work in this
area. We provide a GitHub repository
(https://github.com/zhao-tong/graph-data-augmentation-papers) with a reading
list that will be continuously updated.
Related papers
- Parametric Graph Representations in the Era of Foundation Models: A Survey and Position [69.48708136448694]
Graphs have been widely used in the past decades of big data and AI to model comprehensive relational data.
Identifying meaningful graph laws can significantly enhance the effectiveness of various applications.
arXiv Detail & Related papers (2024-10-16T00:01:31Z) - Graph data augmentation with Gromow-Wasserstein Barycenters [0.0]
It has been proposed a novel augmentation strategy for graphs that operates in a non-Euclidean space.
A non-Euclidean distance, specifically the Gromow-Wasserstein distance, results in better approximations of the graphon.
This framework also provides a means to validate different graphon estimation approaches.
arXiv Detail & Related papers (2024-04-12T10:22:55Z) - A Survey of Data-Efficient Graph Learning [16.053913182723143]
We introduce a novel concept of Data-Efficient Graph Learning (DEGL) as a research frontier.
We systematically review recent advances on several key aspects, including self-supervised graph learning, semi-supervised graph learning, and few-shot graph learning.
arXiv Detail & Related papers (2024-02-01T09:28:48Z) - Graph Domain Adaptation: Challenges, Progress and Prospects [61.9048172631524]
We propose graph domain adaptation as an effective knowledge-transfer paradigm across graphs.
GDA introduces a bunch of task-related graphs as source graphs and adapts the knowledge learnt from source graphs to the target graphs.
We outline the research status and challenges, propose a taxonomy, introduce the details of representative works, and discuss the prospects.
arXiv Detail & Related papers (2024-02-01T02:44:32Z) - Towards Data-centric Graph Machine Learning: Review and Outlook [120.64417630324378]
We introduce a systematic framework, Data-centric Graph Machine Learning (DC-GML), that encompasses all stages of the graph data lifecycle.
A thorough taxonomy of each stage is presented to answer three critical graph-centric questions.
We pinpoint the future prospects of the DC-GML domain, providing insights to navigate its advancements and applications.
arXiv Detail & Related papers (2023-09-20T00:40:13Z) - A Comprehensive Survey on Graph Summarization with Graph Neural Networks [21.337505372979066]
In the past, most graph summarization techniques sought to capture the most important part of a graph statistically.
Today, the high dimensionality and complexity of modern graph data are making deep learning techniques more popular.
Our investigation includes a review of the current state-of-the-art approaches, including recurrent GNNs, convolutional GNNs, graph autoencoders, and graph attention networks.
arXiv Detail & Related papers (2023-02-13T05:43:24Z) - State of the Art and Potentialities of Graph-level Learning [54.68482109186052]
Graph-level learning has been applied to many tasks including comparison, regression, classification, and more.
Traditional approaches to learning a set of graphs rely on hand-crafted features, such as substructures.
Deep learning has helped graph-level learning adapt to the growing scale of graphs by extracting features automatically and encoding graphs into low-dimensional representations.
arXiv Detail & Related papers (2023-01-14T09:15:49Z) - Data Augmentation on Graphs: A Technical Survey [22.15025123705249]
graph representation learning has achieved remarkable success while suffering from low-quality data problems.
Data augmentation has also attracted increasing attention in graph domain.
This survey provides a comprehensive review and overview of graph data augmentation techniques.
arXiv Detail & Related papers (2022-12-20T02:51:28Z) - Graph Pooling for Graph Neural Networks: Progress, Challenges, and
Opportunities [128.55790219377315]
Graph neural networks have emerged as a leading architecture for many graph-level tasks.
graph pooling is indispensable for obtaining a holistic graph-level representation of the whole graph.
arXiv Detail & Related papers (2022-04-15T04:02:06Z) - Data Augmentation for Deep Graph Learning: A Survey [66.04015540536027]
We first propose a taxonomy for graph data augmentation and then provide a structured review by categorizing the related work based on the augmented information modalities.
Focusing on the two challenging problems in DGL (i.e., optimal graph learning and low-resource graph learning), we also discuss and review the existing learning paradigms which are based on graph data augmentation.
arXiv Detail & Related papers (2022-02-16T18:30:33Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.