Coreference Resolution System for Indonesian Text with Mention Pair
Method and Singleton Exclusion using Convolutional Neural Network
- URL: http://arxiv.org/abs/2009.05675v1
- Date: Fri, 11 Sep 2020 22:21:19 GMT
- Title: Coreference Resolution System for Indonesian Text with Mention Pair
Method and Singleton Exclusion using Convolutional Neural Network
- Authors: Turfa Auliarachman (1), Ayu Purwarianti (1) ((1) Institut Teknologi
Bandung)
- Abstract summary: We propose a new coreference resolution system for Indonesian text with mention pair method.
In addition to lexical and syntactic features, in order to learn the representation of the mentions words and context, we use word embeddings and feed them to CNN.
Our proposed system outperforms the state-of-the-art system.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Neural network has shown promising performance on coreference resolution
systems that uses mention pair method. With deep neural network, it can learn
hidden and deep relations between two mentions. However, there is no work on
coreference resolution for Indonesian text that uses this learning technique.
The state-of-the-art system for Indonesian text only states the use of lexical
and syntactic features can improve the existing coreference resolution system.
In this paper, we propose a new coreference resolution system for Indonesian
text with mention pair method that uses deep neural network to learn the
relations of the two mentions. In addition to lexical and syntactic features,
in order to learn the representation of the mentions words and context, we use
word embeddings and feed them to Convolutional Neural Network (CNN).
Furthermore, we do singleton exclusion using singleton classifier component to
prevent singleton mentions entering any entity clusters at the end. Achieving
67.37% without singleton exclusion, 63.27% with trained singleton classifier,
and 75.95% with gold singleton classifier on CoNLL average F1 score, our
proposed system outperforms the state-of-the-art system.
Related papers
- SPLICE: A Singleton-Enhanced PipeLIne for Coreference REsolution [11.062090350704617]
Singleton mentions, i.e.entities mentioned only once in a text, are important to how humans understand discourse from a theoretical perspective.
Previous attempts to incorporate their detection in end-to-end neural coreference resolution for English have been hampered by the lack of singleton mention spans in the OntoNotes benchmark.
This paper addresses this limitation by combining predicted mentions from existing nested NER systems and features derived from OntoNotes syntax trees.
arXiv Detail & Related papers (2024-03-25T22:46:16Z) - DenoSent: A Denoising Objective for Self-Supervised Sentence
Representation Learning [59.4644086610381]
We propose a novel denoising objective that inherits from another perspective, i.e., the intra-sentence perspective.
By introducing both discrete and continuous noise, we generate noisy sentences and then train our model to restore them to their original form.
Our empirical evaluations demonstrate that this approach delivers competitive results on both semantic textual similarity (STS) and a wide range of transfer tasks.
arXiv Detail & Related papers (2024-01-24T17:48:45Z) - SMDDH: Singleton Mention detection using Deep Learning in Hindi Text [0.0]
This paper proposes a singleton mention detection module based on a fully connected network and a Convolutional neural network for Hindi text.
In terms of Precision, Recall, and F-measure, the experimental findings obtained are excellent.
arXiv Detail & Related papers (2023-01-23T10:58:18Z) - Hybrid Rule-Neural Coreference Resolution System based on Actor-Critic
Learning [53.73316523766183]
Coreference resolution systems need to tackle two main tasks.
One task is to detect all of the potential mentions, the other is to learn the linking of an antecedent for each possible mention.
We propose a hybrid rule-neural coreference resolution system based on actor-critic learning.
arXiv Detail & Related papers (2022-12-20T08:55:47Z) - Neural Coreference Resolution based on Reinforcement Learning [53.73316523766183]
Coreference resolution systems need to solve two subtasks.
One task is to detect all of the potential mentions, the other is to learn the linking of an antecedent for each possible mention.
We propose a reinforcement learning actor-critic-based neural coreference resolution system.
arXiv Detail & Related papers (2022-12-18T07:36:35Z) - Neural Implicit Dictionary via Mixture-of-Expert Training [111.08941206369508]
We present a generic INR framework that achieves both data and training efficiency by learning a Neural Implicit Dictionary (NID)
Our NID assembles a group of coordinate-based Impworks which are tuned to span the desired function space.
Our experiments show that, NID can improve reconstruction of 2D images or 3D scenes by 2 orders of magnitude faster with up to 98% less input data.
arXiv Detail & Related papers (2022-07-08T05:07:19Z) - Incorporating Constituent Syntax for Coreference Resolution [50.71868417008133]
We propose a graph-based method to incorporate constituent syntactic structures.
We also explore to utilise higher-order neighbourhood information to encode rich structures in constituent trees.
Experiments on the English and Chinese portions of OntoNotes 5.0 benchmark show that our proposed model either beats a strong baseline or achieves new state-of-the-art performance.
arXiv Detail & Related papers (2022-02-22T07:40:42Z) - Preliminary study on using vector quantization latent spaces for TTS/VC
systems with consistent performance [55.10864476206503]
We investigate the use of quantized vectors to model the latent linguistic embedding.
By enforcing different policies over the latent spaces in the training, we are able to obtain a latent linguistic embedding.
Our experiments show that the voice cloning system built with vector quantization has only a small degradation in terms of perceptive evaluations.
arXiv Detail & Related papers (2021-06-25T07:51:35Z) - Semi-Supervised Neural System for Tagging, Parsing and Lematization [1.6752182911522522]
This paper describes the ICS PAS system which took part in CoNLL 2018 shared task on Multilingual Parsing from Raw Text to Universal Dependencies.
The system consists of jointly trained tagger, lemmatizer, and dependency which are based on features extracted by a biLSTM network.
arXiv Detail & Related papers (2020-04-26T18:29:31Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.