Multi-label Node Classification On Graph-Structured Data
- URL: http://arxiv.org/abs/2304.10398v4
- Date: Thu, 29 Feb 2024 10:23:45 GMT
- Title: Multi-label Node Classification On Graph-Structured Data
- Authors: Tianqi Zhao, Ngan Thi Dong, Alan Hanjalic, Megha Khosla
- Abstract summary: Graph Neural Networks (GNNs) have shown state-of-the-art improvements in node classification tasks on graphs.
A more general and realistic scenario in which each node could have multiple labels has so far received little attention.
We collect and release three real-world biological datasets and develop a multi-label graph generator.
- Score: 7.892731722253387
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Graph Neural Networks (GNNs) have shown state-of-the-art improvements in node
classification tasks on graphs. While these improvements have been largely
demonstrated in a multi-class classification scenario, a more general and
realistic scenario in which each node could have multiple labels has so far
received little attention. The first challenge in conducting focused studies on
multi-label node classification is the limited number of publicly available
multi-label graph datasets. Therefore, as our first contribution, we collect
and release three real-world biological datasets and develop a multi-label
graph generator to generate datasets with tunable properties. While high label
similarity (high homophily) is usually attributed to the success of GNNs, we
argue that a multi-label scenario does not follow the usual semantics of
homophily and heterophily so far defined for a multi-class scenario. As our
second contribution, we define homophily and Cross-Class Neighborhood
Similarity for the multi-label scenario and provide a thorough analyses of the
collected $9$ multi-label datasets. Finally, we perform a large-scale
comparative study with $8$ methods and $9$ datasets and analyse the
performances of the methods to assess the progress made by current state of the
art in the multi-label node classification scenario. We release our benchmark
at https://github.com/Tianqi-py/MLGNC.
Related papers
- GNN-MultiFix: Addressing the pitfalls for GNNs for multi-label node classification [1.857645719601748]
Graph neural networks (GNNs) have emerged as powerful models for learning representations of graph data.
We show that even the most expressive GNN may fail to learn in absence of node attributes and without using explicit label information as input.
We propose a straightforward approach, referred to as GNN-MultiFix, that integrates the feature, label, and positional information of a node.
arXiv Detail & Related papers (2024-11-21T12:59:39Z) - A data-centric approach for assessing progress of Graph Neural Networks [7.2249434861826325]
Graph Neural Networks (GNNs) have achieved state-of-the-art results in node classification tasks.
Most improvements are in multi-class classification, with less focus on the cases where each node could have multiple labels.
First challenge in studying multi-label node classification is the scarcity of publicly available datasets.
arXiv Detail & Related papers (2024-06-18T09:41:40Z) - Generalized Category Discovery with Clustering Assignment Consistency [56.92546133591019]
Generalized category discovery (GCD) is a recently proposed open-world task.
We propose a co-training-based framework that encourages clustering consistency.
Our method achieves state-of-the-art performance on three generic benchmarks and three fine-grained visual recognition datasets.
arXiv Detail & Related papers (2023-10-30T00:32:47Z) - KMF: Knowledge-Aware Multi-Faceted Representation Learning for Zero-Shot
Node Classification [75.95647590619929]
Zero-Shot Node Classification (ZNC) has been an emerging and crucial task in graph data analysis.
We propose a Knowledge-Aware Multi-Faceted framework (KMF) that enhances the richness of label semantics.
A novel geometric constraint is developed to alleviate the problem of prototype drift caused by node information aggregation.
arXiv Detail & Related papers (2023-08-15T02:38:08Z) - Type-supervised sequence labeling based on the heterogeneous star graph
for named entity recognition [6.25916397918329]
The representation learning of the heterogeneous star graph containing text nodes and type nodes is investigated in this paper.
The model performs the type-supervised sequence labeling after updating nodes in the graph.
Experiments on public NER datasets reveal the effectiveness of our model in extracting both flat and nested entities.
arXiv Detail & Related papers (2022-10-19T01:40:06Z) - Graph Attention Transformer Network for Multi-Label Image Classification [50.0297353509294]
We propose a general framework for multi-label image classification that can effectively mine complex inter-label relationships.
Our proposed methods can achieve state-of-the-art performance on three datasets.
arXiv Detail & Related papers (2022-03-08T12:39:05Z) - SHGNN: Structure-Aware Heterogeneous Graph Neural Network [77.78459918119536]
This paper proposes a novel Structure-Aware Heterogeneous Graph Neural Network (SHGNN) to address the above limitations.
We first utilize a feature propagation module to capture the local structure information of intermediate nodes in the meta-path.
Next, we use a tree-attention aggregator to incorporate the graph structure information into the aggregation module on the meta-path.
Finally, we leverage a meta-path aggregator to fuse the information aggregated from different meta-paths.
arXiv Detail & Related papers (2021-12-12T14:18:18Z) - Heterogeneous Graph Neural Networks for Multi-label Text Classification [5.290920289670573]
Multi-label text classification (MLTC) is an attractive and challenging task in natural language processing (NLP)
We propose a heterogeneous graph convolutional network model to solve the MLTC problem by modeling tokens and labels as nodes in a heterogeneous graph.
We evaluate our method on three real-world datasets and the experimental results show that it achieves significant improvements and outperforms state-of-the-art comparison methods.
arXiv Detail & Related papers (2021-03-26T17:33:31Z) - Generative Multi-Label Zero-Shot Learning [136.17594611722285]
Multi-label zero-shot learning strives to classify images into multiple unseen categories for which no data is available during training.
Our work is the first to tackle the problem of multi-label feature in the (generalized) zero-shot setting.
Our cross-level fusion-based generative approach outperforms the state-of-the-art on all three datasets.
arXiv Detail & Related papers (2021-01-27T18:56:46Z) - GNN-XML: Graph Neural Networks for Extreme Multi-label Text
Classification [23.79498916023468]
Extreme multi-label text classification (XMTC) aims to tag a text instance with the most relevant subset of labels from an extremely large label set.
GNN-XML is a scalable graph neural network framework tailored for XMTC problems.
arXiv Detail & Related papers (2020-12-10T18:18:34Z) - Knowledge-Guided Multi-Label Few-Shot Learning for General Image
Recognition [75.44233392355711]
KGGR framework exploits prior knowledge of statistical label correlations with deep neural networks.
It first builds a structured knowledge graph to correlate different labels based on statistical label co-occurrence.
Then, it introduces the label semantics to guide learning semantic-specific features.
It exploits a graph propagation network to explore graph node interactions.
arXiv Detail & Related papers (2020-09-20T15:05:29Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.