Hierarchical Relation-Guided Type-Sentence Alignment for Long-Tail
Relation Extraction with Distant Supervision
- URL: http://arxiv.org/abs/2109.09036v1
- Date: Sun, 19 Sep 2021 00:46:57 GMT
- Title: Hierarchical Relation-Guided Type-Sentence Alignment for Long-Tail
Relation Extraction with Distant Supervision
- Authors: Yang Li, Guodong Long, Tao Shen, Jing Jiang
- Abstract summary: We propose a novel model to enrich distantly-supervised sentences with entity types.
It consists of (1) a pairwise type-enriched sentence encoding module injecting both context-free and -related backgrounds to alleviate sentence-level wrong labeling, and (2) a hierarchical type-sentence alignment module enriching a sentence with the triple fact's basic attributes to support long-tail relations.
- Score: 33.641404898790405
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Distant supervision uses triple facts in knowledge graphs to label a corpus
for relation extraction, leading to wrong labeling and long-tail problems. Some
works use the hierarchy of relations for knowledge transfer to long-tail
relations. However, a coarse-grained relation often implies only an attribute
(e.g., domain or topic) of the distant fact, making it hard to discriminate
relations based solely on sentence semantics. One solution is resorting to
entity types, but open questions remain about how to fully leverage the
information of entity types and how to align multi-granular entity types with
sentences. In this work, we propose a novel model to enrich
distantly-supervised sentences with entity types. It consists of (1) a pairwise
type-enriched sentence encoding module injecting both context-free and -related
backgrounds to alleviate sentence-level wrong labeling, and (2) a hierarchical
type-sentence alignment module enriching a sentence with the triple fact's
basic attributes to support long-tail relations. Our model achieves new
state-of-the-art results in overall and long-tail performance on benchmarks.
Related papers
- Graph-tree Fusion Model with Bidirectional Information Propagation for Long Document Classification [20.434941308959786]
Long document classification presents challenges due to their extensive content and complex structure.
Existing methods often struggle with token limits and fail to adequately model hierarchical relationships within documents.
Our approach integrates syntax trees for sentence encodings and document graphs for document encodings, which capture fine-grained syntactic relationships and broader document contexts.
arXiv Detail & Related papers (2024-10-03T19:25:01Z) - Multi-Label Classification for Implicit Discourse Relation Recognition [10.280148603465697]
We explore various multi-label classification frameworks to handle implicit discourse relation recognition.
We show that multi-label classification methods don't depress performance for single-label prediction.
arXiv Detail & Related papers (2024-06-06T19:37:25Z) - More than Classification: A Unified Framework for Event Temporal
Relation Extraction [61.44799147458621]
Event temporal relation extraction(ETRE) is usually formulated as a multi-label classification task.
We observe that all relations can be interpreted using the start and end time points of events.
We propose a unified event temporal relation extraction framework, which transforms temporal relations into logical expressions of time points.
arXiv Detail & Related papers (2023-05-28T02:09:08Z) - Relational Sentence Embedding for Flexible Semantic Matching [86.21393054423355]
We present Sentence Embedding (RSE), a new paradigm to discover further the potential of sentence embeddings.
RSE is effective and flexible in modeling sentence relations and outperforms a series of state-of-the-art embedding methods.
arXiv Detail & Related papers (2022-12-17T05:25:17Z) - R$^2$-Net: Relation of Relation Learning Network for Sentence Semantic
Matching [58.72111690643359]
We propose a Relation of Relation Learning Network (R2-Net) for sentence semantic matching.
We first employ BERT to encode the input sentences from a global perspective.
Then a CNN-based encoder is designed to capture keywords and phrase information from a local perspective.
To fully leverage labels for better relation information extraction, we introduce a self-supervised relation of relation classification task.
arXiv Detail & Related papers (2020-12-16T13:11:30Z) - Relation Clustering in Narrative Knowledge Graphs [71.98234178455398]
relational sentences in the original text are embedded (with SBERT) and clustered in order to merge together semantically similar relations.
Preliminary tests show that such clustering might successfully detect similar relations, and provide a valuable preprocessing for semi-supervised approaches.
arXiv Detail & Related papers (2020-11-27T10:43:04Z) - Learning Relation Prototype from Unlabeled Texts for Long-tail Relation
Extraction [84.64435075778988]
We propose a general approach to learn relation prototypes from unlabeled texts.
We learn relation prototypes as an implicit factor between entities.
We conduct experiments on two publicly available datasets: New York Times and Google Distant Supervision.
arXiv Detail & Related papers (2020-11-27T06:21:12Z) - Extracting N-ary Cross-sentence Relations using Constrained Subsequence
Kernel [36.86738081453646]
We propose a new formulation of the relation extraction task where the relations are more general than intra-sentence relations.
We propose a novel sequence representation to characterize instances of such relations.
We evaluate our approach on three datasets across two domains: biomedical and general domain.
arXiv Detail & Related papers (2020-06-15T07:23:58Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.