Subjectivity in the Annotation of Bridging Anaphora
- URL: http://arxiv.org/abs/2506.07297v1
- Date: Sun, 08 Jun 2025 21:40:09 GMT
- Title: Subjectivity in the Annotation of Bridging Anaphora
- Authors: Lauren Levine, Amir Zeldes,
- Abstract summary: Bridging refers to the associative relationship between inferable entities and their antecedents in a discourse.<n>It is difficult to achieve consistent agreement in the annotation of bridging anaphora and their antecedents.<n>We propose a newly developed classification system for bridging subtypes.
- Score: 10.942182034424714
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Bridging refers to the associative relationship between inferable entities in a discourse and the antecedents which allow us to understand them, such as understanding what "the door" means with respect to an aforementioned "house". As identifying associative relations between entities is an inherently subjective task, it is difficult to achieve consistent agreement in the annotation of bridging anaphora and their antecedents. In this paper, we explore the subjectivity involved in the annotation of bridging instances at three levels: anaphor recognition, antecedent resolution, and bridging subtype selection. To do this, we conduct an annotation pilot on the test set of the existing GUM corpus, and propose a newly developed classification system for bridging subtypes, which we compare to previously proposed schemes. Our results suggest that some previous resources are likely to be severely under-annotated. We also find that while agreement on the bridging subtype category was moderate, annotator overlap for exhaustively identifying instances of bridging is low, and that many disagreements resulted from subjective understanding of the entities involved.
Related papers
- From Disagreement to Understanding: The Case for Ambiguity Detection in NLI [0.0]
We show that content-based ambiguity offers a process-independent signal of divergent human perspectives.<n>A key limitation is the lack of annotated datasets for ambiguity and subtypes.<n>We propose addressing this gap through new annotated resources and unsupervised approaches to ambiguity detection.
arXiv Detail & Related papers (2025-07-20T20:27:35Z) - Leveraging Hierarchical Prototypes as the Verbalizer for Implicit Discourse Relation Recognition [7.149359970799236]
Implicit discourse relation recognition involves determining relationships between spans of text that are not linked by an explicit discourse connective.
Previous work solely relied on manual verbalizers for implicit discourse relation recognition.
We leverage the prototypes that capture certain class-level semantic features and the hierarchical label structure for different classes as the verbalizer.
arXiv Detail & Related papers (2024-11-22T12:01:04Z) - Disambiguate Entity Matching using Large Language Models through Relation Discovery [1.6317061277457001]
We propose a novel approach that shifts focus from purely semantic similarities to understanding and defining the "relations" between entities.
By predefining a set of relations relevant to the task at hand, our method allows analysts to navigate the spectrum of similarity more effectively.
arXiv Detail & Related papers (2024-03-26T03:07:32Z) - Coherent Entity Disambiguation via Modeling Topic and Categorical
Dependency [87.16283281290053]
Previous entity disambiguation (ED) methods adopt a discriminative paradigm, where prediction is made based on matching scores between mention context and candidate entities.
We propose CoherentED, an ED system equipped with novel designs aimed at enhancing the coherence of entity predictions.
We achieve new state-of-the-art results on popular ED benchmarks, with an average improvement of 1.3 F1 points.
arXiv Detail & Related papers (2023-11-06T16:40:13Z) - DiscoPrompt: Path Prediction Prompt Tuning for Implicit Discourse
Relation Recognition [27.977742959064916]
We propose a prompt-based path prediction method to utilize the interactive information and intrinsic senses among the hierarchy in IDRR.
This is the first work that injects such structure information into pre-trained language models via prompt tuning.
arXiv Detail & Related papers (2023-05-06T08:16:07Z) - Prototype-based Embedding Network for Scene Graph Generation [105.97836135784794]
Current Scene Graph Generation (SGG) methods explore contextual information to predict relationships among entity pairs.
Due to the diverse visual appearance of numerous possible subject-object combinations, there is a large intra-class variation within each predicate category.
Prototype-based Embedding Network (PE-Net) models entities/predicates with prototype-aligned compact and distinctive representations.
PL is introduced to help PE-Net efficiently learn such entitypredicate matching, and Prototype Regularization (PR) is devised to relieve the ambiguous entity-predicate matching.
arXiv Detail & Related papers (2023-03-13T13:30:59Z) - Entity-enhanced Adaptive Reconstruction Network for Weakly Supervised
Referring Expression Grounding [214.8003571700285]
Weakly supervised Referring Expression Grounding (REG) aims to ground a particular target in an image described by a language expression.
We design an entity-enhanced adaptive reconstruction network (EARN)
EARN includes three modules: entity enhancement, adaptive grounding, and collaborative reconstruction.
arXiv Detail & Related papers (2022-07-18T05:30:45Z) - Keywords and Instances: A Hierarchical Contrastive Learning Framework Unifying Hybrid Granularities for Text Generation [60.62039705180484]
We propose a hierarchical contrastive learning mechanism, which can unify hybrid granularities semantic meaning in the input text.<n> Experiments demonstrate that our model outperforms competitive baselines on paraphrasing, dialogue generation, and storytelling tasks.
arXiv Detail & Related papers (2022-05-26T13:26:03Z) - Scoring Coreference Chains with Split-Antecedent Anaphors [23.843305521306227]
We propose a solution to the technical problem of generalizing existing metrics for identity anaphora so that they can also be used to score cases of split-antecedents.
This is the first such proposal in the literature on anaphora or coreference, and has been successfully used to score both split-antecedent plural references and discourse deixis.
arXiv Detail & Related papers (2022-05-24T19:07:36Z) - Improving Human-Object Interaction Detection via Phrase Learning and
Label Composition [14.483347746239055]
Human-Object Interaction (HOI) detection is a fundamental task in high-level human-centric scene understanding.
We propose PhraseHOI, containing a HOI branch and a novel phrase branch, to leverage language prior and improve relation expression.
arXiv Detail & Related papers (2021-12-14T13:22:16Z) - Relation Clustering in Narrative Knowledge Graphs [71.98234178455398]
relational sentences in the original text are embedded (with SBERT) and clustered in order to merge together semantically similar relations.
Preliminary tests show that such clustering might successfully detect similar relations, and provide a valuable preprocessing for semi-supervised approaches.
arXiv Detail & Related papers (2020-11-27T10:43:04Z) - Learning to Decouple Relations: Few-Shot Relation Classification with
Entity-Guided Attention and Confusion-Aware Training [49.9995628166064]
We propose CTEG, a model equipped with two mechanisms to learn to decouple easily-confused relations.
On the one hand, an EGA mechanism is introduced to guide the attention to filter out information causing confusion.
On the other hand, a Confusion-Aware Training (CAT) method is proposed to explicitly learn to distinguish relations.
arXiv Detail & Related papers (2020-10-21T11:07:53Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.