Incorporating Human Translator Style into English-Turkish Literary
Machine Translation
- URL: http://arxiv.org/abs/2307.11457v1
- Date: Fri, 21 Jul 2023 09:39:50 GMT
- Title: Incorporating Human Translator Style into English-Turkish Literary
Machine Translation
- Authors: Zeynep Yirmibe\c{s}o\u{g}lu, Olgun Dursun, Harun Dall{\i}, Mehmet
\c{S}ahin, Ena Hodzik, Sabri G\"urses, Tunga G\"ung\"or
- Abstract summary: We develop machine translation models that take into account the stylistic features of translators.
We show that the human translator style can be highly recreated in the target machine translations.
- Score: 0.26168876987285306
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Although machine translation systems are mostly designed to serve in the
general domain, there is a growing tendency to adapt these systems to other
domains like literary translation. In this paper, we focus on English-Turkish
literary translation and develop machine translation models that take into
account the stylistic features of translators. We fine-tune a pre-trained
machine translation model by the manually-aligned works of a particular
translator. We make a detailed analysis of the effects of manual and automatic
alignments, data augmentation methods, and corpus size on the translations. We
propose an approach based on stylistic features to evaluate the style of a
translator in the output translations. We show that the human translator style
can be highly recreated in the target machine translations by adapting the
models to the style of the translator.
Related papers
- (Perhaps) Beyond Human Translation: Harnessing Multi-Agent Collaboration for Translating Ultra-Long Literary Texts [52.18246881218829]
We introduce a novel multi-agent framework based on large language models (LLMs) for literary translation, implemented as a company called TransAgents.
To evaluate the effectiveness of our system, we propose two innovative evaluation strategies: Monolingual Human Preference (MHP) and Bilingual LLM Preference (BLP)
arXiv Detail & Related papers (2024-05-20T05:55:08Z) - Advancing Translation Preference Modeling with RLHF: A Step Towards
Cost-Effective Solution [57.42593422091653]
We explore leveraging reinforcement learning with human feedback to improve translation quality.
A reward model with strong language capabilities can more sensitively learn the subtle differences in translation quality.
arXiv Detail & Related papers (2024-02-18T09:51:49Z) - Synslator: An Interactive Machine Translation Tool with Online Learning [19.6690111454278]
Interactive machine translation (IMT) has emerged as a progression of the computer-aided translation paradigm.
This paper introduces Synslator, a user-friendly computer-aided translation (CAT) tool that not only supports IMT, but is adept at online learning with real-time translation memories.
arXiv Detail & Related papers (2023-10-08T06:05:55Z) - Decomposed Prompting for Machine Translation Between Related Languages
using Large Language Models [55.35106713257871]
We introduce DecoMT, a novel approach of few-shot prompting that decomposes the translation process into a sequence of word chunk translations.
We show that DecoMT outperforms the strong few-shot prompting BLOOM model with an average improvement of 8 chrF++ scores across the examined languages.
arXiv Detail & Related papers (2023-05-22T14:52:47Z) - The Best of Both Worlds: Combining Human and Machine Translations for
Multilingual Semantic Parsing with Active Learning [50.320178219081484]
We propose an active learning approach that exploits the strengths of both human and machine translations.
An ideal utterance selection can significantly reduce the error and bias in the translated data.
arXiv Detail & Related papers (2023-05-22T05:57:47Z) - Translate your gibberish: black-box adversarial attack on machine
translation systems [0.0]
We present a simple approach to fool state-of-the-art machine translation tools in the task of translation from Russian to English and vice versa.
We show that many online translation tools, such as Google, DeepL, and Yandex, may both produce wrong or offensive translations for nonsensical adversarial input queries.
This vulnerability may interfere with understanding a new language and simply worsen the user's experience while using machine translation systems.
arXiv Detail & Related papers (2023-03-20T09:52:52Z) - Applying Automated Machine Translation to Educational Video Courses [0.0]
We studied the capability of automated machine translation in the online video education space.
We applied text-to-speech synthesis and audio/video synchronization to build engaging videos in target languages.
arXiv Detail & Related papers (2023-01-09T01:44:29Z) - Improving Sentiment Analysis over non-English Tweets using Multilingual
Transformers and Automatic Translation for Data-Augmentation [77.69102711230248]
We propose the use of a multilingual transformer model, that we pre-train over English tweets and apply data-augmentation using automatic translation to adapt the model to non-English languages.
Our experiments in French, Spanish, German and Italian suggest that the proposed technique is an efficient way to improve the results of the transformers over small corpora of tweets in a non-English language.
arXiv Detail & Related papers (2020-10-07T15:44:55Z) - Computer Assisted Translation with Neural Quality Estimation and
Automatic Post-Editing [18.192546537421673]
We propose an end-to-end deep learning framework of the quality estimation and automatic post-editing of the machine translation output.
Our goal is to provide error correction suggestions and to further relieve the burden of human translators through an interpretable model.
arXiv Detail & Related papers (2020-09-19T00:29:00Z) - Translation Artifacts in Cross-lingual Transfer Learning [51.66536640084888]
We show that machine translation can introduce subtle artifacts that have a notable impact in existing cross-lingual models.
In natural language inference, translating the premise and the hypothesis independently can reduce the lexical overlap between them.
We also improve the state-of-the-art in XNLI for the translate-test and zero-shot approaches by 4.3 and 2.8 points, respectively.
arXiv Detail & Related papers (2020-04-09T17:54:30Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.