GCN-Based Linkage Prediction for Face Clustering on Imbalanced Datasets:
An Empirical Study
- URL: http://arxiv.org/abs/2107.02477v2
- Date: Wed, 7 Jul 2021 08:08:15 GMT
- Title: GCN-Based Linkage Prediction for Face Clustering on Imbalanced Datasets:
An Empirical Study
- Authors: Huafeng Yang, Xingjian Chen, Fangyi Zhang, Guangyue Hei, Yunjie Wang
and Rong Du
- Abstract summary: We present a new method to alleviate the imbalanced labels and also augment graph representations using a Reverse-Imbalance Weighted Sampling strategy.
The code and a series of imbalanced benchmark datasets are available at https://github.com/espectre/GCNs_on_imbalanced_datasets.
- Score: 5.416933126354173
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: In recent years, benefiting from the expressive power of Graph Convolutional
Networks (GCNs), significant breakthroughs have been made in face clustering.
However, rare attention has been paid to GCN-based clustering on imbalanced
data. Although imbalance problem has been extensively studied, the impact of
imbalanced data on GCN-based linkage prediction task is quite different, which
would cause problems in two aspects: imbalanced linkage labels and biased graph
representations. The problem of imbalanced linkage labels is similar to that in
image classification task, but the latter is a particular problem in GCN-based
clustering via linkage prediction. Significantly biased graph representations
in training can cause catastrophic overfitting of a GCN model. To tackle these
problems, we evaluate the feasibility of those existing methods for imbalanced
image classification problem on graphs with extensive experiments, and present
a new method to alleviate the imbalanced labels and also augment graph
representations using a Reverse-Imbalance Weighted Sampling (RIWS) strategy,
followed with insightful analyses and discussions. The code and a series of
imbalanced benchmark datasets synthesized from MS-Celeb-1M and DeepFashion are
available on https://github.com/espectre/GCNs_on_imbalanced_datasets.
Related papers
- HeNCler: Node Clustering in Heterophilous Graphs through Learned Asymmetric Similarity [55.27586970082595]
HeNCler is a novel approach for Heterophilous Node Clustering.
We show that HeNCler significantly enhances performance in node clustering tasks within heterophilous graph contexts.
arXiv Detail & Related papers (2024-05-27T11:04:05Z) - Deep Contrastive Graph Learning with Clustering-Oriented Guidance [61.103996105756394]
Graph Convolutional Network (GCN) has exhibited remarkable potential in improving graph-based clustering.
Models estimate an initial graph beforehand to apply GCN.
Deep Contrastive Graph Learning (DCGL) model is proposed for general data clustering.
arXiv Detail & Related papers (2024-02-25T07:03:37Z) - Learning to Reweight for Graph Neural Network [63.978102332612906]
Graph Neural Networks (GNNs) show promising results for graph tasks.
Existing GNNs' generalization ability will degrade when there exist distribution shifts between testing and training graph data.
We propose a novel nonlinear graph decorrelation method, which can substantially improve the out-of-distribution generalization ability.
arXiv Detail & Related papers (2023-12-19T12:25:10Z) - Rethinking Semi-Supervised Imbalanced Node Classification from
Bias-Variance Decomposition [18.3055496602884]
This paper introduces a new approach to address the issue of class imbalance in graph neural networks (GNNs) for learning on graph-structured data.
Our approach integrates imbalanced node classification and Bias-Variance Decomposition, establishing a theoretical framework that closely relates data imbalance to model variance.
arXiv Detail & Related papers (2023-10-28T17:28:07Z) - Heterophily-Based Graph Neural Network for Imbalanced Classification [19.51668009720269]
We introduce a unique approach that tackles imbalanced classification on graphs by considering graph heterophily.
We propose Fast Im-GBK, which integrates an imbalance classification strategy with heterophily-aware GNNs.
Our experiments on real-world graphs demonstrate our model's superiority in classification performance and efficiency for node classification tasks.
arXiv Detail & Related papers (2023-10-12T21:19:47Z) - ImGCL: Revisiting Graph Contrastive Learning on Imbalanced Node
Classification [26.0350727426613]
Graph contrastive learning (GCL) has attracted a surge of attention due to its superior performance for learning node/graph representations without labels.
In practice, the underlying class distribution of unlabeled nodes for the given graph is usually imbalanced.
We propose a principled GCL framework on Imbalanced node classification (ImGCL), which automatically and adaptively balances the representations learned from GCL without labels.
arXiv Detail & Related papers (2022-05-23T14:23:36Z) - Generalizing Graph Neural Networks on Out-Of-Distribution Graphs [51.33152272781324]
Graph Neural Networks (GNNs) are proposed without considering the distribution shifts between training and testing graphs.
In such a setting, GNNs tend to exploit subtle statistical correlations existing in the training set for predictions, even though it is a spurious correlation.
We propose a general causal representation framework, called StableGNN, to eliminate the impact of spurious correlations.
arXiv Detail & Related papers (2021-11-20T18:57:18Z) - Topology-Imbalance Learning for Semi-Supervised Node Classification [34.964665078512596]
We argue that graph data expose a unique source of imbalance from the asymmetric topological properties of the labeled nodes.
We devise an influence conflict detection -- based metric Totoro to measure the degree of graph topology imbalance.
We propose a model-agnostic method ReNode to address the topology-imbalance issue.
arXiv Detail & Related papers (2021-10-08T12:57:38Z) - AdaGCN:Adaptive Boosting Algorithm for Graph Convolutional Networks on
Imbalanced Node Classification [10.72543417177307]
We propose an ensemble model called AdaGCN, which uses a Graph Convolutional Network (GCN) as the base estimator during adaptive boosting.
Our model also improves state-of-the-art baselines on all of the challenging node classification tasks we consider.
arXiv Detail & Related papers (2021-05-25T02:43:31Z) - Graph Classification by Mixture of Diverse Experts [67.33716357951235]
We present GraphDIVE, a framework leveraging mixture of diverse experts for imbalanced graph classification.
With a divide-and-conquer principle, GraphDIVE employs a gating network to partition an imbalanced graph dataset into several subsets.
Experiments on real-world imbalanced graph datasets demonstrate the effectiveness of GraphDIVE.
arXiv Detail & Related papers (2021-03-29T14:03:03Z) - Contrastive and Generative Graph Convolutional Networks for Graph-based
Semi-Supervised Learning [64.98816284854067]
Graph-based Semi-Supervised Learning (SSL) aims to transfer the labels of a handful of labeled data to the remaining massive unlabeled data via a graph.
A novel GCN-based SSL algorithm is presented in this paper to enrich the supervision signals by utilizing both data similarities and graph structure.
arXiv Detail & Related papers (2020-09-15T13:59:28Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.