Classifying Wikipedia in a fine-grained hierarchy: what graphs can
contribute
- URL: http://arxiv.org/abs/2001.07558v2
- Date: Wed, 22 Jan 2020 08:24:59 GMT
- Title: Classifying Wikipedia in a fine-grained hierarchy: what graphs can
contribute
- Authors: Tiphaine Viard, Thomas McLachlan, Hamidreza Ghader, Satoshi Sekine
- Abstract summary: We address the task of integrating graph (i.e. structure) information to classify Wikipedia into a fine-grained named entity ontology (NE)
We conduct at-scale practical experiments, on a manually labeled subset of 22,000 pages extracted from the Japanese Wikipedia.
Our results show that integrating graph information succeeds at reducing sparsity of the input feature space, and yields classification results that are comparable or better than previous works.
- Score: 0.5530212768657543
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Wikipedia is a huge opportunity for machine learning, being the largest
semi-structured base of knowledge available. Because of this, many works
examine its contents, and focus on structuring it in order to make it usable in
learning tasks, for example by classifying it into an ontology. Beyond its
textual contents, Wikipedia also displays a typical graph structure, where
pages are linked together through citations. In this paper, we address the task
of integrating graph (i.e. structure) information to classify Wikipedia into a
fine-grained named entity ontology (NE), the Extended Named Entity hierarchy.
To address this task, we first start by assessing the relevance of the graph
structure for NE classification. We then explore two directions, one related to
feature vectors using graph descriptors commonly used in large-scale network
analysis, and one extending flat classification to a weighted model taking into
account semantic similarity. We conduct at-scale practical experiments, on a
manually labeled subset of 22,000 pages extracted from the Japanese Wikipedia.
Our results show that integrating graph information succeeds at reducing
sparsity of the input feature space, and yields classification results that are
comparable or better than previous works.
Related papers
- Coreference Graph Guidance for Mind-Map Generation [5.289044688419791]
Recently, a state-of-the-art method encodes the sentences of a document sequentially and converts them to a relation graph via sequence-to-graph.
We propose a coreference-guided mind-map generation network (CMGN) to incorporate external structure knowledge.
arXiv Detail & Related papers (2023-12-19T09:39:27Z) - Graph Few-shot Learning with Task-specific Structures [38.52226241144403]
Existing graph few-shot learning methods typically leverage Graph Neural Networks (GNNs)
We propose a novel framework that learns a task-specific structure for each meta-task.
In this way, we can learn node representations with the task-specific structure tailored for each meta-task.
arXiv Detail & Related papers (2022-10-21T17:40:21Z) - Joint Language Semantic and Structure Embedding for Knowledge Graph
Completion [66.15933600765835]
We propose to jointly embed the semantics in the natural language description of the knowledge triplets with their structure information.
Our method embeds knowledge graphs for the completion task via fine-tuning pre-trained language models.
Our experiments on a variety of knowledge graph benchmarks have demonstrated the state-of-the-art performance of our method.
arXiv Detail & Related papers (2022-09-19T02:41:02Z) - Semi-Supervised Hierarchical Graph Classification [54.25165160435073]
We study the node classification problem in the hierarchical graph where a 'node' is a graph instance.
We propose the Hierarchical Graph Mutual Information (HGMI) and present a way to compute HGMI with theoretical guarantee.
We demonstrate the effectiveness of this hierarchical graph modeling and the proposed SEAL-CI method on text and social network data.
arXiv Detail & Related papers (2022-06-11T04:05:29Z) - One-shot Scene Graph Generation [130.57405850346836]
We propose Multiple Structured Knowledge (Relational Knowledgesense Knowledge) for the one-shot scene graph generation task.
Our method significantly outperforms existing state-of-the-art methods by a large margin.
arXiv Detail & Related papers (2022-02-22T11:32:59Z) - Taxonomy Enrichment with Text and Graph Vector Representations [61.814256012166794]
We address the problem of taxonomy enrichment which aims at adding new words to the existing taxonomy.
We present a new method that allows achieving high results on this task with little effort.
We achieve state-of-the-art results across different datasets and provide an in-depth error analysis of mistakes.
arXiv Detail & Related papers (2022-01-21T09:01:12Z) - Sparse Structure Learning via Graph Neural Networks for Inductive
Document Classification [2.064612766965483]
We propose a novel GNN-based sparse structure learning model for inductive document classification.
Our model collects a set of trainable edges connecting disjoint words between sentences and employs structure learning to sparsely select edges with dynamic contextual dependencies.
Experiments on several real-world datasets demonstrate that the proposed model outperforms most state-of-the-art results.
arXiv Detail & Related papers (2021-12-13T02:36:04Z) - SHGNN: Structure-Aware Heterogeneous Graph Neural Network [77.78459918119536]
This paper proposes a novel Structure-Aware Heterogeneous Graph Neural Network (SHGNN) to address the above limitations.
We first utilize a feature propagation module to capture the local structure information of intermediate nodes in the meta-path.
Next, we use a tree-attention aggregator to incorporate the graph structure information into the aggregation module on the meta-path.
Finally, we leverage a meta-path aggregator to fuse the information aggregated from different meta-paths.
arXiv Detail & Related papers (2021-12-12T14:18:18Z) - Graph Classification Based on Skeleton and Component Features [9.681154082075698]
We propose a novel graph embedding algorithm named GraphCSC that realizes classification based on skeleton information.
Two graphs are similar if their skeletons and components are both similar, so in our model, we integrate both of them together into embeddings as graph homogeneity characterization.
We demonstrate our model on different datasets in comparison with a comprehensive list of up-to-date state-of-the-art baselines, and experiments show that our work is superior in real-world graph classification tasks.
arXiv Detail & Related papers (2021-02-02T10:52:17Z) - HOSE-Net: Higher Order Structure Embedded Network for Scene Graph
Generation [20.148175528691905]
This paper presents a novel structure-aware embedding-to-classifier(SEC) module to incorporate both local and global structural information of relationships into the output space.
We also propose a hierarchical semantic aggregation(HSA) module to reduce the number of subspaces by introducing higher order structural information.
The proposed HOSE-Net achieves the state-of-the-art performance on two popular benchmarks of Visual Genome and VRD.
arXiv Detail & Related papers (2020-08-12T07:58:13Z) - Structure-Augmented Text Representation Learning for Efficient Knowledge
Graph Completion [53.31911669146451]
Human-curated knowledge graphs provide critical supportive information to various natural language processing tasks.
These graphs are usually incomplete, urging auto-completion of them.
graph embedding approaches, e.g., TransE, learn structured knowledge via representing graph elements into dense embeddings.
textual encoding approaches, e.g., KG-BERT, resort to graph triple's text and triple-level contextualized representations.
arXiv Detail & Related papers (2020-04-30T13:50:34Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.