Revisiting Negation in Neural Machine Translation
- URL: http://arxiv.org/abs/2107.12203v1
- Date: Mon, 26 Jul 2021 13:19:57 GMT
- Title: Revisiting Negation in Neural Machine Translation
- Authors: Gongbo Tang, Philipp R\"onchen, Rico Sennrich, Joakim Nivre
- Abstract summary: We show that the ability of neural machine translation (NMT) models to translate negation has improved with deeper and more advanced networks.
The accuracy of manual evaluation in English--German (EN--DE) and English--Chinese (EN--ZH) is 95.7%, 94.8%, 93.4%, and 91.7%, respectively.
- Score: 26.694559863395877
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: In this paper, we evaluate the translation of negation both automatically and
manually, in English--German (EN--DE) and English--Chinese (EN--ZH). We show
that the ability of neural machine translation (NMT) models to translate
negation has improved with deeper and more advanced networks, although the
performance varies between language pairs and translation directions. The
accuracy of manual evaluation in EN-DE, DE-EN, EN-ZH, and ZH-EN is 95.7%,
94.8%, 93.4%, and 91.7%, respectively. In addition, we show that
under-translation is the most significant error type in NMT, which contrasts
with the more diverse error profile previously observed for statistical machine
translation. To better understand the root of the under-translation of
negation, we study the model's information flow and training data. While our
information flow analysis does not reveal any deficiencies that could be used
to detect or fix the under-translation of negation, we find that negation is
often rephrased during training, which could make it more difficult for the
model to learn a reliable link between source and target negation. We finally
conduct intrinsic analysis and extrinsic probing tasks on negation, showing
that NMT models can distinguish negation and non-negation tokens very well and
encode a lot of information about negation in hidden states but nevertheless
leave room for improvement.
Related papers
- Revisiting subword tokenization: A case study on affixal negation in large language models [57.75279238091522]
We measure the impact of affixal negation on modern English large language models (LLMs)
We conduct experiments using LLMs with different subword tokenization methods.
We show that models can, on the whole, reliably recognize the meaning of affixal negation.
arXiv Detail & Related papers (2024-04-03T03:14:27Z) - CONDAQA: A Contrastive Reading Comprehension Dataset for Reasoning about
Negation [21.56001677478673]
We present the first English reading comprehension dataset which requires reasoning about the implications of negated statements in paragraphs.
CONDAQA features 14,182 question-answer pairs with over 200 unique negation cues.
The best performing model on CONDAQA (UnifiedQA-v2-3b) achieves only 42% on our consistency metric, well below human performance which is 81%.
arXiv Detail & Related papers (2022-11-01T06:10:26Z) - Understanding and Mitigating the Uncertainty in Zero-Shot Translation [92.25357943169601]
We aim to understand and alleviate the off-target issues from the perspective of uncertainty in zero-shot translation.
We propose two lightweight and complementary approaches to denoise the training data for model training.
Our approaches significantly improve the performance of zero-shot translation over strong MNMT baselines.
arXiv Detail & Related papers (2022-05-20T10:29:46Z) - Improving negation detection with negation-focused pre-training [58.32362243122714]
Negation is a common linguistic feature that is crucial in many language understanding tasks.
Recent work has shown that state-of-the-art NLP models underperform on samples containing negation.
We propose a new negation-focused pre-training strategy, involving targeted data augmentation and negation masking.
arXiv Detail & Related papers (2022-05-09T02:41:11Z) - ChrEnTranslate: Cherokee-English Machine Translation Demo with Quality
Estimation and Corrective Feedback [70.5469946314539]
ChrEnTranslate is an online machine translation demonstration system for translation between English and an endangered language Cherokee.
It supports both statistical and neural translation models as well as provides quality estimation to inform users of reliability.
arXiv Detail & Related papers (2021-07-30T17:58:54Z) - Detecting over/under-translation errors for determining adequacy in
human translations [0.0]
We present a novel approach to detecting over and under translations (OT/UT) as part of adequacy error checks in translation evaluation.
We do not restrict ourselves to machine translation (MT) outputs and specifically target applications with human generated translation pipeline.
The goal of our system is to identify OT/UT errors from human translated video subtitles with high error recall.
arXiv Detail & Related papers (2021-04-01T06:06:36Z) - It's not a Non-Issue: Negation as a Source of Error in Machine
Translation [33.991817055535854]
We investigate whether translating negation is an issue for modern machine translation systems using 17 translation directions as test bed.
We find that indeed the presence of negation can significantly impact downstream quality, in some cases resulting in quality reductions of more than 60%.
arXiv Detail & Related papers (2020-10-12T03:34:44Z) - It's Easier to Translate out of English than into it: Measuring Neural
Translation Difficulty by Cross-Mutual Information [90.35685796083563]
Cross-mutual information (XMI) is an asymmetric information-theoretic metric of machine translation difficulty.
XMI exploits the probabilistic nature of most neural machine translation models.
We present the first systematic and controlled study of cross-lingual translation difficulties using modern neural translation systems.
arXiv Detail & Related papers (2020-05-05T17:38:48Z) - On the Inference Calibration of Neural Machine Translation [54.48932804996506]
We study the correlation between calibration and translation performance and linguistic properties of miscalibration.
We propose a new graduated label smoothing method that can improve both inference calibration and translation performance.
arXiv Detail & Related papers (2020-05-03T02:03:56Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.