DICTDIS: Dictionary Constrained Disambiguation for Improved NMT
- URL: http://arxiv.org/abs/2210.06996v2
- Date: Sun, 21 May 2023 12:47:52 GMT
- Title: DICTDIS: Dictionary Constrained Disambiguation for Improved NMT
- Authors: Ayush Maheshwari, Piyush Sharma, Preethi Jyothi, Ganesh Ramakrishnan
- Abstract summary: We present dictdis, a lexically constrained NMT system that disambiguates between multiple candidate translations derived from dictionaries.
We demonstrate the utility of dictdis via extensive experiments on English-Hindi and English-German sentences in a variety of domains including regulatory, finance, engineering.
- Score: 41.612825615273906
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Domain-specific neural machine translation (NMT) systems (\eg, in educational
applications) are socially significant with the potential to help make
information accessible to a diverse set of users in multilingual societies. It
is desirable that such NMT systems be lexically constrained and draw from
domain-specific dictionaries. Dictionaries could present multiple candidate
translations for a source word/phrase due to the polysemous nature of words.
The onus is then on the NMT model to choose the contextually most appropriate
candidate. Prior work has largely ignored this problem and focused on the
single candidate constraint setting wherein the target word or phrase is
replaced by a single constraint. In this work we present \dictdis, a lexically
constrained NMT system that disambiguates between multiple candidate
translations derived from dictionaries. We achieve this by augmenting training
data with multiple dictionary candidates to actively encourage disambiguation
during training by implicitly aligning multiple candidate constraints. We
demonstrate the utility of \dictdis\ via extensive experiments on English-Hindi
and English-German sentences in a variety of domains including regulatory,
finance, engineering. We also present comparisons on standard benchmark test
datasets. In comparison with existing approaches for lexically constrained and
unconstrained NMT, we demonstrate superior performance with respect to
constraint copy and disambiguation related measures on all domains while also
obtaining improved fluency of up to 2-3 BLEU points on some domains.
Related papers
- A General and Flexible Multi-concept Parsing Framework for Multilingual Semantic Matching [60.51839859852572]
We propose to resolve the text into multi concepts for multilingual semantic matching to liberate the model from the reliance on NER models.
We conduct comprehensive experiments on English datasets QQP and MRPC, and Chinese dataset Medical-SM.
arXiv Detail & Related papers (2024-03-05T13:55:16Z) - Towards Effective Disambiguation for Machine Translation with Large
Language Models [65.80775710657672]
We study the capabilities of large language models to translate "ambiguous sentences"
Experiments show that our methods can match or outperform state-of-the-art systems such as DeepL and NLLB in four out of five language directions.
arXiv Detail & Related papers (2023-09-20T22:22:52Z) - Improving Long Context Document-Level Machine Translation [51.359400776242786]
Document-level context for neural machine translation (NMT) is crucial to improve translation consistency and cohesion.
Many works have been published on the topic of document-level NMT, but most restrict the system to just local context.
We propose a constrained attention variant that focuses the attention on the most relevant parts of the sequence, while simultaneously reducing the memory consumption.
arXiv Detail & Related papers (2023-06-08T13:28:48Z) - Phrase-level Active Learning for Neural Machine Translation [107.28450614074002]
We propose an active learning setting where we can spend a given budget on translating in-domain data.
We select both full sentences and individual phrases from unlabelled data in the new domain for routing to human translators.
In a German-English translation task, our active learning approach achieves consistent improvements over uncertainty-based sentence selection methods.
arXiv Detail & Related papers (2021-06-21T19:20:42Z) - Improving Lexically Constrained Neural Machine Translation with
Source-Conditioned Masked Span Prediction [6.46964825569749]
In this paper, we tackle a more challenging setup consisting of domain-specific corpora with much longer n-gram and highly specialized terms.
To encourage span-level representations in generation, we additionally impose a source-sentence conditioned masked span prediction loss in the decoder.
Experimental results on three domain-specific corpora in two language pairs demonstrate that the proposed training scheme can improve the performance of existing lexically constrained methods.
arXiv Detail & Related papers (2021-05-12T08:11:33Z) - Sentence Alignment with Parallel Documents Helps Biomedical Machine
Translation [0.5430741734728369]
This work presents a new unsupervised sentence alignment method and explores features in training biomedical neural machine translation (NMT) systems.
We use a simple but effective way to build bilingual word embeddings to evaluate bilingual word similarity.
The proposed method achieved high accuracy in both 1-to-1 and many-to-many cases.
arXiv Detail & Related papers (2021-04-17T16:09:30Z) - FDMT: A Benchmark Dataset for Fine-grained Domain Adaptation in Machine
Translation [53.87731008029645]
We present a real-world fine-grained domain adaptation task in machine translation (FDMT)
The FDMT dataset consists of four sub-domains of information technology: autonomous vehicles, AI education, real-time networks and smart phone.
We make quantitative experiments and deep analyses in this new setting, which benchmarks the fine-grained domain adaptation task.
arXiv Detail & Related papers (2020-12-31T17:15:09Z) - Constraint Translation Candidates: A Bridge between Neural Query
Translation and Cross-lingual Information Retrieval [45.88734029123836]
We propose a novel approach to alleviate problems by limiting the open target vocabulary search space of QT to a set of important words mined from search index database.
The proposed methods are exploited and examined in a real-word CLIR system--Aliexpress e-Commerce search engine.
arXiv Detail & Related papers (2020-10-26T15:27:51Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.