Scoring Coreference Chains with Split-Antecedent Anaphors
- URL: http://arxiv.org/abs/2205.12323v1
- Date: Tue, 24 May 2022 19:07:36 GMT
- Title: Scoring Coreference Chains with Split-Antecedent Anaphors
- Authors: Silviu Paun and Juntao Yu and Nafise Sadat Moosavi and Massimo Poesio
- Abstract summary: We propose a solution to the technical problem of generalizing existing metrics for identity anaphora so that they can also be used to score cases of split-antecedents.
This is the first such proposal in the literature on anaphora or coreference, and has been successfully used to score both split-antecedent plural references and discourse deixis.
- Score: 23.843305521306227
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Anaphoric reference is an aspect of language interpretation covering a
variety of types of interpretation beyond the simple case of identity reference
to entities introduced via nominal expressions covered by the traditional
coreference task in its most recent incarnation in ONTONOTES and similar
datasets. One of these cases that go beyond simple coreference is anaphoric
reference to entities that must be added to the discourse model via
accommodation, and in particular split-antecedent references to entities
constructed out of other entities, as in split-antecedent plurals and in some
cases of discourse deixis. Although this type of anaphoric reference is now
annotated in many datasets, systems interpreting such references cannot be
evaluated using the Reference coreference scorer Pradhan et al. (2014). As part
of the work towards a new scorer for anaphoric reference able to evaluate all
aspects of anaphoric interpretation in the coverage of the Universal Anaphora
initiative, we propose in this paper a solution to the technical problem of
generalizing existing metrics for identity anaphora so that they can also be
used to score cases of split-antecedents. This is the first such proposal in
the literature on anaphora or coreference, and has been successfully used to
score both split-antecedent plural references and discourse deixis in the
recent CODI/CRAC anaphora resolution in dialogue shared tasks.
Related papers
- Coherent Entity Disambiguation via Modeling Topic and Categorical
Dependency [87.16283281290053]
Previous entity disambiguation (ED) methods adopt a discriminative paradigm, where prediction is made based on matching scores between mention context and candidate entities.
We propose CoherentED, an ED system equipped with novel designs aimed at enhancing the coherence of entity predictions.
We achieve new state-of-the-art results on popular ED benchmarks, with an average improvement of 1.3 F1 points.
arXiv Detail & Related papers (2023-11-06T16:40:13Z) - Not All Metrics Are Guilty: Improving NLG Evaluation by Diversifying References [123.39034752499076]
Div-Ref is a method to enhance evaluation benchmarks by enriching the number of references.
We conduct experiments to empirically demonstrate that diversifying the expression of reference can significantly enhance the correlation between automatic evaluation and human evaluation.
arXiv Detail & Related papers (2023-05-24T11:53:29Z) - Robust Saliency-Aware Distillation for Few-shot Fine-grained Visual
Recognition [57.08108545219043]
Recognizing novel sub-categories with scarce samples is an essential and challenging research topic in computer vision.
Existing literature addresses this challenge by employing local-based representation approaches.
This article proposes a novel model, Robust Saliency-aware Distillation (RSaD), for few-shot fine-grained visual recognition.
arXiv Detail & Related papers (2023-05-12T00:13:17Z) - ezCoref: Towards Unifying Annotation Guidelines for Coreference
Resolution [28.878540389202367]
We develop a crowdsourcing-friendly coreference annotation methodology, ezCoref, consisting of an annotation tool and an interactive tutorial.
We use ezCoref to re-annotate 240 passages from seven existing English coreference datasets (spanning fiction, news, and multiple other domains) while teaching annotators only cases that are treated similarly across these datasets.
Surprisingly, we find that reasonable quality annotations were already achievable (>90% agreement between the crowd and expert annotations) even without extensive training.
arXiv Detail & Related papers (2022-10-13T17:09:59Z) - Entity Disambiguation with Entity Definitions [50.01142092276296]
Local models have recently attained astounding performances in Entity Disambiguation (ED)
Previous works limited their studies to using, as the textual representation of each candidate, only its Wikipedia title.
In this paper, we address this limitation and investigate to what extent more expressive textual representations can mitigate it.
We report a new state of the art on 2 out of 6 benchmarks we consider and strongly improve the generalization capability over unseen patterns.
arXiv Detail & Related papers (2022-10-11T17:46:28Z) - Aggregating Crowdsourced and Automatic Judgments to Scale Up a Corpus of
Anaphoric Reference for Fiction and Wikipedia Texts [16.42217979543271]
This paper introduces a new release of a corpus for anaphoric reference labelled via a game-with-a-purpose.
It is comparable in size to the largest existing corpora for anaphoric reference due in part to substantial activity by the players.
The proposed method could be adopted to greatly speed up annotation time in other projects involving games-with-a-purpose.
arXiv Detail & Related papers (2022-10-11T16:13:57Z) - Author Clustering and Topic Estimation for Short Texts [69.54017251622211]
We propose a novel model that expands on the Latent Dirichlet Allocation by modeling strong dependence among the words in the same document.
We also simultaneously cluster users, removing the need for post-hoc cluster estimation.
Our method performs as well as -- or better -- than traditional approaches to problems arising in short text.
arXiv Detail & Related papers (2021-06-15T20:55:55Z) - Stay Together: A System for Single and Split-antecedent Anaphora
Resolution [19.98823717287972]
Split-antecedent anaphora is rarer and more complex to resolve than single-antecedent anaphora.
We introduce a system that resolves both single and split-antecedent anaphors, and evaluate it in a more realistic setting.
arXiv Detail & Related papers (2021-04-12T10:01:08Z) - Weakly-Supervised Aspect-Based Sentiment Analysis via Joint
Aspect-Sentiment Topic Embedding [71.2260967797055]
We propose a weakly-supervised approach for aspect-based sentiment analysis.
We learn sentiment, aspect> joint topic embeddings in the word embedding space.
We then use neural models to generalize the word-level discriminative information.
arXiv Detail & Related papers (2020-10-13T21:33:24Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.