Taxonomy Enrichment with Text and Graph Vector Representations
- URL: http://arxiv.org/abs/2201.08598v1
- Date: Fri, 21 Jan 2022 09:01:12 GMT
- Title: Taxonomy Enrichment with Text and Graph Vector Representations
- Authors: Irina Nikishina, Mikhail Tikhomirov, Varvara Logacheva, Yuriy Nazarov,
Alexander Panchenko, Natalia Loukachevitch
- Abstract summary: We address the problem of taxonomy enrichment which aims at adding new words to the existing taxonomy.
We present a new method that allows achieving high results on this task with little effort.
We achieve state-of-the-art results across different datasets and provide an in-depth error analysis of mistakes.
- Score: 61.814256012166794
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Knowledge graphs such as DBpedia, Freebase or Wikidata always contain a
taxonomic backbone that allows the arrangement and structuring of various
concepts in accordance with the hypo-hypernym ("class-subclass") relationship.
With the rapid growth of lexical resources for specific domains, the problem of
automatic extension of the existing knowledge bases with new words is becoming
more and more widespread. In this paper, we address the problem of taxonomy
enrichment which aims at adding new words to the existing taxonomy.
We present a new method that allows achieving high results on this task with
little effort. It uses the resources which exist for the majority of languages,
making the method universal. We extend our method by incorporating deep
representations of graph structures like node2vec, Poincar\'e embeddings, GCN
etc. that have recently demonstrated promising results on various NLP tasks.
Furthermore, combining these representations with word embeddings allows us to
beat the state of the art.
We conduct a comprehensive study of the existing approaches to taxonomy
enrichment based on word and graph vector representations and their fusion
approaches. We also explore the ways of using deep learning architectures to
extend the taxonomic backbones of knowledge graphs. We create a number of
datasets for taxonomy extension for English and Russian. We achieve
state-of-the-art results across different datasets and provide an in-depth
error analysis of mistakes.
Related papers
- CodeTaxo: Enhancing Taxonomy Expansion with Limited Examples via Code Language Prompts [40.52605902842168]
textscCodeTaxo is a novel approach that leverages large language models through code language prompts to capture the taxonomic structure.
Experiments on five real-world benchmarks from different domains demonstrate that textscCodeTaxo consistently achieves superior performance across all evaluation metrics.
arXiv Detail & Related papers (2024-08-17T02:15:07Z) - Semantic Enhanced Knowledge Graph for Large-Scale Zero-Shot Learning [74.6485604326913]
We provide a new semantic enhanced knowledge graph that contains both expert knowledge and categories semantic correlation.
To propagate information on the knowledge graph, we propose a novel Residual Graph Convolutional Network (ResGCN)
Experiments conducted on the widely used large-scale ImageNet-21K dataset and AWA2 dataset show the effectiveness of our method.
arXiv Detail & Related papers (2022-12-26T13:18:36Z) - Joint Language Semantic and Structure Embedding for Knowledge Graph
Completion [66.15933600765835]
We propose to jointly embed the semantics in the natural language description of the knowledge triplets with their structure information.
Our method embeds knowledge graphs for the completion task via fine-tuning pre-trained language models.
Our experiments on a variety of knowledge graph benchmarks have demonstrated the state-of-the-art performance of our method.
arXiv Detail & Related papers (2022-09-19T02:41:02Z) - Sparse Structure Learning via Graph Neural Networks for Inductive
Document Classification [2.064612766965483]
We propose a novel GNN-based sparse structure learning model for inductive document classification.
Our model collects a set of trainable edges connecting disjoint words between sentences and employs structure learning to sparsely select edges with dynamic contextual dependencies.
Experiments on several real-world datasets demonstrate that the proposed model outperforms most state-of-the-art results.
arXiv Detail & Related papers (2021-12-13T02:36:04Z) - Enhancing Taxonomy Completion with Concept Generation via Fusing
Relational Representations [41.629471011165684]
Existing taxonomy expansion or completion methods assume that new concepts have been accurately extracted and their embedding vectors learned from the text corpus.
We propose GenTaxo to enhance taxonomy by identifying positions that need new concepts and then generating appropriate concept names.
Instead of relying on the corpus for concept embeddings, GenTaxo learns the contextual embeddings from their surrounding graph-based and language-based information, and leverages the corpus for pre-training a concept name generator.
arXiv Detail & Related papers (2021-06-05T21:50:13Z) - Entity Context Graph: Learning Entity Representations
fromSemi-Structured Textual Sources on the Web [44.92858943475407]
We propose an approach that processes entity centric textual knowledge sources to learn entity embeddings.
We show that the embeddings learned from our approach are: (i) high quality and comparable to a known knowledge graph-based embeddings and can be used to improve them further.
arXiv Detail & Related papers (2021-03-29T20:52:14Z) - Graphonomy: Universal Image Parsing via Graph Reasoning and Transfer [140.72439827136085]
We propose a graph reasoning and transfer learning framework named "Graphonomy"
It incorporates human knowledge and label taxonomy into the intermediate graph representation learning beyond local convolutions.
It learns the global and structured semantic coherency in multiple domains via semantic-aware graph reasoning and transfer.
arXiv Detail & Related papers (2021-01-26T08:19:03Z) - Studying Taxonomy Enrichment on Diachronic WordNet Versions [70.27072729280528]
We explore the possibilities of taxonomy extension in a resource-poor setting and present methods which are applicable to a large number of languages.
We create novel English and Russian datasets for training and evaluating taxonomy enrichment models and describe a technique of creating such datasets for other languages.
arXiv Detail & Related papers (2020-11-23T16:49:37Z) - Exploiting Structured Knowledge in Text via Graph-Guided Representation
Learning [73.0598186896953]
We present two self-supervised tasks learning over raw text with the guidance from knowledge graphs.
Building upon entity-level masked language models, our first contribution is an entity masking scheme.
In contrast to existing paradigms, our approach uses knowledge graphs implicitly, only during pre-training.
arXiv Detail & Related papers (2020-04-29T14:22:42Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.