Entity Alignment For Knowledge Graphs: Progress, Challenges, and
Empirical Studies
- URL: http://arxiv.org/abs/2205.08777v1
- Date: Wed, 18 May 2022 07:59:03 GMT
- Title: Entity Alignment For Knowledge Graphs: Progress, Challenges, and
Empirical Studies
- Authors: Deepak Chaurasiya, Anil Surisetty, Nitish Kumar, Alok Singh, Vikrant
Dey, Aakarsh Malhotra, Gaurav Dhama and Ankur Arora
- Abstract summary: Entity alignment (EA) identifies entities across databases that refer to the same entity.
EA methods map entities to a low-dimension space and align them based on their similarities.
This paper presents a comprehensive analysis of various existing EA methods, elaborating their applications and limitations.
- Score: 4.221619479687067
- License: http://creativecommons.org/licenses/by-sa/4.0/
- Abstract: Entity Alignment (EA) identifies entities across databases that refer to the
same entity. Knowledge graph-based embedding methods have recently dominated EA
techniques. Such methods map entities to a low-dimension space and align them
based on their similarities. With the corpus of EA methodologies growing
rapidly, this paper presents a comprehensive analysis of various existing EA
methods, elaborating their applications and limitations. Further, we
distinguish the methods based on their underlying algorithms and the
information they incorporate to learn entity representations. Based on
challenges in industrial datasets, we bring forward $4$ research questions
(RQs). These RQs empirically analyse the algorithms from the perspective of
\textit{Hubness, Degree distribution, Non-isomorphic neighbourhood,} and
\textit{Name bias}. For Hubness, where one entity turns up as the nearest
neighbour of many other entities, we define an $h$-score to quantify its effect
on the performance of various algorithms. Additionally, we try to level the
playing field for algorithms that rely primarily on name-bias existing in the
benchmarking open-source datasets by creating a low name bias dataset. We
further create an open-source repository for $14$ embedding-based EA methods
and present the analysis for invoking further research motivations in the field
of EA.
Related papers
- DERA: Dense Entity Retrieval for Entity Alignment in Knowledge Graphs [3.500936203815729]
We propose a dense entity retrieval framework for Entity Alignment (EA)
We leverage language models to uniformly encode various features of entities and facilitate nearest entity search across Knowledge Graphs (KGs)
Our approach achieves state-of-the-art performance compared to existing EA methods.
arXiv Detail & Related papers (2024-08-02T10:12:42Z) - DiscoveryBench: Towards Data-Driven Discovery with Large Language Models [50.36636396660163]
We present DiscoveryBench, the first comprehensive benchmark that formalizes the multi-step process of data-driven discovery.
Our benchmark contains 264 tasks collected across 6 diverse domains, such as sociology and engineering.
Our benchmark, thus, illustrates the challenges in autonomous data-driven discovery and serves as a valuable resource for the community to make progress.
arXiv Detail & Related papers (2024-07-01T18:58:22Z) - A Comprehensive Survey on Underwater Image Enhancement Based on Deep Learning [51.7818820745221]
Underwater image enhancement (UIE) presents a significant challenge within computer vision research.
Despite the development of numerous UIE algorithms, a thorough and systematic review is still absent.
arXiv Detail & Related papers (2024-05-30T04:46:40Z) - Toward Practical Entity Alignment Method Design: Insights from New
Highly Heterogeneous Knowledge Graph Datasets [32.68422342604253]
We study the performance of entity alignment (EA) methods in practical settings, specifically focusing on the alignment of highly heterogeneous KGs (HHKGs)
Our findings reveal that, in aligning HHKGs, valuable structure information can hardly be exploited through message-passing and aggregation mechanisms.
These findings shed light on the potential problems associated with the conventional application of GNN-based methods as a panacea for all EA datasets.
arXiv Detail & Related papers (2023-04-07T04:10:26Z) - Modeling Entities as Semantic Points for Visual Information Extraction
in the Wild [55.91783742370978]
We propose an alternative approach to precisely and robustly extract key information from document images.
We explicitly model entities as semantic points, i.e., center points of entities are enriched with semantic information describing the attributes and relationships of different entities.
The proposed method can achieve significantly enhanced performance on entity labeling and linking, compared with previous state-of-the-art models.
arXiv Detail & Related papers (2023-03-23T08:21:16Z) - EventEA: Benchmarking Entity Alignment for Event-centric Knowledge
Graphs [17.27027602556303]
We show that the progress made in the past was due to biased and unchallenging evaluation.
We construct a new dataset with heterogeneous relations and attributes based on event-centric KGs.
As a new approach to this difficult problem, we propose a time-aware literal encoder for entity alignment.
arXiv Detail & Related papers (2022-11-05T05:34:21Z) - Algorithmic Fairness Datasets: the Story so Far [68.45921483094705]
Data-driven algorithms are studied in diverse domains to support critical decisions, directly impacting people's well-being.
A growing community of researchers has been investigating the equity of existing algorithms and proposing novel ones, advancing the understanding of risks and opportunities of automated decision-making for historically disadvantaged populations.
Progress in fair Machine Learning hinges on data, which can be appropriately used only if adequately documented.
Unfortunately, the algorithmic fairness community suffers from a collective data documentation debt caused by a lack of information on specific resources (opacity) and scatteredness of available information (sparsity)
arXiv Detail & Related papers (2022-02-03T17:25:46Z) - Inferring Latent Domains for Unsupervised Deep Domain Adaptation [54.963823285456925]
Unsupervised Domain Adaptation (UDA) refers to the problem of learning a model in a target domain where labeled data are not available.
This paper introduces a novel deep architecture which addresses the problem of UDA by automatically discovering latent domains in visual datasets.
We evaluate our approach on publicly available benchmarks, showing that it outperforms state-of-the-art domain adaptation methods.
arXiv Detail & Related papers (2021-03-25T14:33:33Z) - Exploring and Evaluating Attributes, Values, and Structures for Entity
Alignment [100.19568734815732]
Entity alignment (EA) aims at building a unified Knowledge Graph (KG) of rich content by linking the equivalent entities from various KGs.
attribute triples can also provide crucial alignment signal but have not been well explored yet.
We propose to utilize an attributed value encoder and partition the KG into subgraphs to model the various types of attribute triples efficiently.
arXiv Detail & Related papers (2020-10-07T08:03:58Z) - A Benchmarking Study of Embedding-based Entity Alignment for Knowledge
Graphs [30.296238600596997]
Entity alignment seeks to find entities in different knowledge graphs that refer to the same real-world object.
Recent advancement in KG embedding impels the advent of embedding-based entity alignment.
We survey 23 recent embedding-based entity alignment approaches and categorize them based on their techniques and characteristics.
arXiv Detail & Related papers (2020-03-10T05:32:06Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.