ActiveEA: Active Learning for Neural Entity Alignment
- URL: http://arxiv.org/abs/2110.06474v1
- Date: Wed, 13 Oct 2021 03:38:04 GMT
- Title: ActiveEA: Active Learning for Neural Entity Alignment
- Authors: Bing Liu, Harrisen Scells, Guido Zuccon, Wen Hua, Genghong Zhao
- Abstract summary: Entity alignment (EA) aims to match equivalent entities across different Knowledge Graphs (KGs)
Current mainstream methods -- neural EA models -- rely on training with seed alignment, i.e., a set of pre-aligned entity pairs.
We devise a novel Active Learning (AL) framework for neural EA, aiming to create highly informative seed alignment.
- Score: 31.212894129845093
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Entity Alignment (EA) aims to match equivalent entities across different
Knowledge Graphs (KGs) and is an essential step of KG fusion. Current
mainstream methods -- neural EA models -- rely on training with seed alignment,
i.e., a set of pre-aligned entity pairs which are very costly to annotate. In
this paper, we devise a novel Active Learning (AL) framework for neural EA,
aiming to create highly informative seed alignment to obtain more effective EA
models with less annotation cost. Our framework tackles two main challenges
encountered when applying AL to EA: (1) How to exploit dependencies between
entities within the AL strategy. Most AL strategies assume that the data
instances to sample are independent and identically distributed. However,
entities in KGs are related. To address this challenge, we propose a
structure-aware uncertainty sampling strategy that can measure the uncertainty
of each entity as well as its impact on its neighbour entities in the KG. (2)
How to recognise entities that appear in one KG but not in the other KG (i.e.,
bachelors). Identifying bachelors would likely save annotation budget. To
address this challenge, we devise a bachelor recognizer paying attention to
alleviate the effect of sampling bias. Empirical results show that our proposed
AL strategy can significantly improve sampling quality with good generality
across different datasets, EA models and amount of bachelors.
Related papers
- Entity Alignment with Unlabeled Dangling Cases [49.86384156476041]
We propose a novel GNN-based dangling detection and entity alignment framework.
While the two tasks share the same GNN, the detected dangling entities are removed in the alignment.
Our framework is featured by a designed entity and relation attention mechanism for selective neighborhood aggregation in representation learning.
arXiv Detail & Related papers (2024-03-16T17:21:58Z) - One-bit Supervision for Image Classification: Problem, Solution, and
Beyond [114.95815360508395]
This paper presents one-bit supervision, a novel setting of learning with fewer labels, for image classification.
We propose a multi-stage training paradigm and incorporate negative label suppression into an off-the-shelf semi-supervised learning algorithm.
In multiple benchmarks, the learning efficiency of the proposed approach surpasses that using full-bit, semi-supervised supervision.
arXiv Detail & Related papers (2023-11-26T07:39:00Z) - LightEA: A Scalable, Robust, and Interpretable Entity Alignment
Framework via Three-view Label Propagation [27.483109233276632]
We argue that existing GNN-based EA methods inherit the inborn defects from their neural network lineage: weak scalability and poor interpretability.
We propose a non-neural EA framework -- LightEA, consisting of three efficient components: (i) Random Orthogonal Label Generation, (ii) Three-view Label propagation, and (iii) Sparse Sinkhorn Iteration.
According to the extensive experiments on public datasets, LightEA has impressive scalability, robustness, and interpretability.
arXiv Detail & Related papers (2022-10-19T10:07:08Z) - Deep Active Ensemble Sampling For Image Classification [8.31483061185317]
Active learning frameworks aim to reduce the cost of data annotation by actively requesting the labeling for the most informative data points.
Some proposed approaches include uncertainty-based techniques, geometric methods, implicit combination of uncertainty-based and geometric approaches.
We present an innovative integration of recent progress in both uncertainty-based and geometric frameworks to enable an efficient exploration/exploitation trade-off in sample selection strategy.
Our framework provides two advantages: (1) accurate posterior estimation, and (2) tune-able trade-off between computational overhead and higher accuracy.
arXiv Detail & Related papers (2022-10-11T20:20:20Z) - Be Your Own Neighborhood: Detecting Adversarial Example by the
Neighborhood Relations Built on Self-Supervised Learning [64.78972193105443]
This paper presents a novel AE detection framework, named trustworthy for predictions.
performs the detection by distinguishing the AE's abnormal relation with its augmented versions.
An off-the-shelf Self-Supervised Learning (SSL) model is used to extract the representation and predict the label.
arXiv Detail & Related papers (2022-08-31T08:18:44Z) - Entity-Graph Enhanced Cross-Modal Pretraining for Instance-level Product
Retrieval [152.3504607706575]
This research aims to conduct weakly-supervised multi-modal instance-level product retrieval for fine-grained product categories.
We first contribute the Product1M datasets, and define two real practical instance-level retrieval tasks.
We exploit to train a more effective cross-modal model which is adaptively capable of incorporating key concept information from the multi-modal data.
arXiv Detail & Related papers (2022-06-17T15:40:45Z) - ICLEA: Interactive Contrastive Learning for Self-supervised Entity
Alignment [27.449414854756913]
Self-supervised entity alignment (EA) aims to link equivalent entities across different knowledge graphs (KGs) without seed alignments.
The current SOTA self-supervised EA method draws inspiration from contrastive learning, originally designed in computer vision.
We propose an interactive contrastive learning model for self-supervised EA.
arXiv Detail & Related papers (2022-01-17T06:04:00Z) - EchoEA: Echo Information between Entities and Relations for Entity
Alignment [1.1470070927586016]
We propose a novel framework, Echo Entity Alignment (EchoEA), which leverages self-attention mechanism to spread entity information to relations and echo back to entities.
The experimental results on three real-world cross-lingual datasets are stable at around 96% at hits@1 on average.
arXiv Detail & Related papers (2021-07-07T07:34:21Z) - Towards Entity Alignment in the Open World: An Unsupervised Approach [29.337157862514204]
It is a pivotal step for integrating knowledge graphs (KGs) to increase knowledge coverage and quality.
State-of-the-art solutions tend to rely on labeled data for model training.
We offer an unsupervised framework that performs entity alignment in the open world.
arXiv Detail & Related papers (2021-01-26T03:10:24Z) - Exploring and Evaluating Attributes, Values, and Structures for Entity
Alignment [100.19568734815732]
Entity alignment (EA) aims at building a unified Knowledge Graph (KG) of rich content by linking the equivalent entities from various KGs.
attribute triples can also provide crucial alignment signal but have not been well explored yet.
We propose to utilize an attributed value encoder and partition the KG into subgraphs to model the various types of attribute triples efficiently.
arXiv Detail & Related papers (2020-10-07T08:03:58Z) - Pairwise Similarity Knowledge Transfer for Weakly Supervised Object
Localization [53.99850033746663]
We study the problem of learning localization model on target classes with weakly supervised image labels.
In this work, we argue that learning only an objectness function is a weak form of knowledge transfer.
Experiments on the COCO and ILSVRC 2013 detection datasets show that the performance of the localization model improves significantly with the inclusion of pairwise similarity function.
arXiv Detail & Related papers (2020-03-18T17:53:33Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.