A Benchmarking Study of Embedding-based Entity Alignment for Knowledge
Graphs
- URL: http://arxiv.org/abs/2003.07743v2
- Date: Mon, 20 Jul 2020 00:47:26 GMT
- Title: A Benchmarking Study of Embedding-based Entity Alignment for Knowledge
Graphs
- Authors: Zequn Sun and Qingheng Zhang and Wei Hu and Chengming Wang and Muhao
Chen and Farahnaz Akrami and Chengkai Li
- Abstract summary: Entity alignment seeks to find entities in different knowledge graphs that refer to the same real-world object.
Recent advancement in KG embedding impels the advent of embedding-based entity alignment.
We survey 23 recent embedding-based entity alignment approaches and categorize them based on their techniques and characteristics.
- Score: 30.296238600596997
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Entity alignment seeks to find entities in different knowledge graphs (KGs)
that refer to the same real-world object. Recent advancement in KG embedding
impels the advent of embedding-based entity alignment, which encodes entities
in a continuous embedding space and measures entity similarities based on the
learned embeddings. In this paper, we conduct a comprehensive experimental
study of this emerging field. We survey 23 recent embedding-based entity
alignment approaches and categorize them based on their techniques and
characteristics. We also propose a new KG sampling algorithm, with which we
generate a set of dedicated benchmark datasets with various heterogeneity and
distributions for a realistic evaluation. We develop an open-source library
including 12 representative embedding-based entity alignment approaches, and
extensively evaluate these approaches, to understand their strengths and
limitations. Additionally, for several directions that have not been explored
in current approaches, we perform exploratory experiments and report our
preliminary findings for future studies. The benchmark datasets, open-source
library and experimental results are all accessible online and will be duly
maintained.
Related papers
- Modeling Entities as Semantic Points for Visual Information Extraction
in the Wild [55.91783742370978]
We propose an alternative approach to precisely and robustly extract key information from document images.
We explicitly model entities as semantic points, i.e., center points of entities are enriched with semantic information describing the attributes and relationships of different entities.
The proposed method can achieve significantly enhanced performance on entity labeling and linking, compared with previous state-of-the-art models.
arXiv Detail & Related papers (2023-03-23T08:21:16Z) - DC-BENCH: Dataset Condensation Benchmark [79.18718490863908]
This work provides the first large-scale standardized benchmark on dataset condensation.
It consists of a suite of evaluations to comprehensively reflect the generability and effectiveness of condensation methods.
The benchmark library is open-sourced to facilitate future research and application.
arXiv Detail & Related papers (2022-07-20T03:54:05Z) - Entity Alignment For Knowledge Graphs: Progress, Challenges, and
Empirical Studies [4.221619479687067]
Entity alignment (EA) identifies entities across databases that refer to the same entity.
EA methods map entities to a low-dimension space and align them based on their similarities.
This paper presents a comprehensive analysis of various existing EA methods, elaborating their applications and limitations.
arXiv Detail & Related papers (2022-05-18T07:59:03Z) - Knowledge Graph Embedding Methods for Entity Alignment: An Experimental
Review [7.241438112282638]
We conduct the first meta-level analysis of popular embedding methods for entity alignment.
Our analysis reveals statistically significant correlations of different embedding methods with various meta-features extracted by KGs.
We rank them in a statistically significant way according to their effectiveness across all real-world KGs of our testbed.
arXiv Detail & Related papers (2022-03-17T12:11:58Z) - Informed Multi-context Entity Alignment [27.679124991733907]
We propose an Informed Multi-context Entity Alignment (IMEA) model to address these issues.
In particular, we introduce Transformer to flexibly capture the relation, path, and neighborhood contexts.
holistic reasoning is used to estimate alignment probabilities based on both embedding similarity and the relation/entity functionality.
Results on several benchmark datasets demonstrate the superiority of our IMEA model compared with existing state-of-the-art entity alignment methods.
arXiv Detail & Related papers (2022-01-02T06:29:30Z) - Few-Shot Named Entity Recognition: A Comprehensive Study [92.40991050806544]
We investigate three schemes to improve the model generalization ability for few-shot settings.
We perform empirical comparisons on 10 public NER datasets with various proportions of labeled data.
We create new state-of-the-art results on both few-shot and training-free settings.
arXiv Detail & Related papers (2020-12-29T23:43:16Z) - A Survey of Embedding Space Alignment Methods for Language and Knowledge
Graphs [77.34726150561087]
We survey the current research landscape on word, sentence and knowledge graph embedding algorithms.
We provide a classification of the relevant alignment techniques and discuss benchmark datasets used in this field of research.
arXiv Detail & Related papers (2020-10-26T16:08:13Z) - An Industry Evaluation of Embedding-based Entity Alignment [38.76701634692796]
Embedding-based entity alignment has been widely investigated in recent years, but most proposed methods still rely on an ideal supervised learning setting.
We evaluate those state-of-the-art methods in an industrial context, where the impact of seed mappings with different sizes and different biases is explored.
arXiv Detail & Related papers (2020-10-22T08:33:58Z) - CDEvalSumm: An Empirical Study of Cross-Dataset Evaluation for Neural
Summarization Systems [121.78477833009671]
We investigate the performance of different summarization models under a cross-dataset setting.
A comprehensive study of 11 representative summarization systems on 5 datasets from different domains reveals the effect of model architectures and generation ways.
arXiv Detail & Related papers (2020-10-11T02:19:15Z) - ORB: An Open Reading Benchmark for Comprehensive Evaluation of Machine
Reading Comprehension [53.037401638264235]
We present an evaluation server, ORB, that reports performance on seven diverse reading comprehension datasets.
The evaluation server places no restrictions on how models are trained, so it is a suitable test bed for exploring training paradigms and representation learning.
arXiv Detail & Related papers (2019-12-29T07:27:23Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.