Chat Translation Error Detection for Assisting Cross-lingual
Communications
- URL: http://arxiv.org/abs/2308.01044v1
- Date: Wed, 2 Aug 2023 09:38:29 GMT
- Title: Chat Translation Error Detection for Assisting Cross-lingual
Communications
- Authors: Yunmeng Li, Jun Suzuki, Makoto Morishita, Kaori Abe, Ryoko Tokuhisa,
Ana Brassard, Kentaro Inui
- Abstract summary: We train an error detector as the baseline of the system and construct a new Japanese-English bilingual chat corpus, BPersona-chat.
The error detector can serve as an encouraging foundation for more advanced erroneous translation detection systems.
- Score: 35.09508360315392
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In this paper, we describe the development of a communication support system
that detects erroneous translations to facilitate crosslingual communications
due to the limitations of current machine chat translation methods. We trained
an error detector as the baseline of the system and constructed a new
Japanese-English bilingual chat corpus, BPersona-chat, which comprises
multiturn colloquial chats augmented with crowdsourced quality ratings. The
error detector can serve as an encouraging foundation for more advanced
erroneous translation detection systems.
Related papers
- A Context-aware Framework for Translation-mediated Conversations [29.169155271343083]
We present a framework to improve large language model-based translation systems by incorporating contextual information in bilingual conversational settings.
We validate both components of our framework on two task-oriented domains: customer chat and user-assistant interaction.
Our framework consistently results in better translations than state-of-the-art systems like GPT-4o and TowerInstruct.
arXiv Detail & Related papers (2024-12-05T14:41:05Z) - Detection of Human and Machine-Authored Fake News in Urdu [2.013675429941823]
Social media has amplified the spread of fake news.
Traditional fake news detection methods relying on linguistic cues become less effective.
We propose a hierarchical detection strategy to improve the accuracy and robustness.
arXiv Detail & Related papers (2024-10-25T12:42:07Z) - An Investigation of Warning Erroneous Chat Translations in Cross-lingual Communication [35.69695355173317]
Machine translation models are still inappropriate for translating chats, despite the popularity of translation software and plug-in applications.
Instead of pursuing a flawless translation system, a more practical approach would be to issue warning messages about potential mistranslations to reduce confusion.
This paper tackles to investigate this question and demonstrates the warning messages' contribution to making chat translation systems effective.
arXiv Detail & Related papers (2024-08-28T05:36:25Z) - Interactively Diagnosing Errors in a Semantic Parser [7.136205674624813]
We present work in progress on an interactive error diagnosis system for the CNLU.
We show how the first two stages of the INLD pipeline can be cast as a model-based diagnosis problem.
We demonstrate our system's ability to diagnose semantic errors on synthetic examples.
arXiv Detail & Related papers (2024-07-08T21:16:09Z) - AdaCCD: Adaptive Semantic Contrasts Discovery Based Cross Lingual
Adaptation for Code Clone Detection [69.79627042058048]
AdaCCD is a novel cross-lingual adaptation method that can detect cloned codes in a new language without annotations in that language.
We evaluate the cross-lingual adaptation results of AdaCCD by constructing a multilingual code clone detection benchmark consisting of 5 programming languages.
arXiv Detail & Related papers (2023-11-13T12:20:48Z) - DiariST: Streaming Speech Translation with Speaker Diarization [53.595990270899414]
We propose DiariST, the first streaming ST and SD solution.
It is built upon a neural transducer-based streaming ST system and integrates token-level serialized output training and t-vector.
Our system achieves a strong ST and SD capability compared to offline systems based on Whisper, while performing streaming inference for overlapping speech.
arXiv Detail & Related papers (2023-09-14T19:33:27Z) - Towards cross-language prosody transfer for dialog [3.3758186776249928]
Speech-to-speech translation systems do not adequately support use for dialog purposes.
In particular, nuances of speaker intent and stance can be lost due to improper prosody transfer.
We develop a data collection protocol in which bilingual speakers re-enact utterances from an earlier conversation in their other language.
arXiv Detail & Related papers (2023-07-09T08:32:14Z) - Cognitive Semantic Communication Systems Driven by Knowledge Graph:
Principle, Implementation, and Performance Evaluation [74.38561925376996]
Two cognitive semantic communication frameworks are proposed for the single-user and multiple-user communication scenarios.
An effective semantic correction algorithm is proposed by mining the inference rule from the knowledge graph.
For the multi-user cognitive semantic communication system, a message recovery algorithm is proposed to distinguish messages of different users.
arXiv Detail & Related papers (2023-03-15T12:01:43Z) - Cross-lingual Transfer Learning for Fake News Detector in a Low-Resource
Language [0.8122270502556374]
Development of methods to detect fake news (FN) in low-resource languages has been impeded by a lack of training data.
In this study, we solve the problem by using only training data from a high-resource language.
Our FN-detection system permitted this strategy by applying adversarial learning that transfers the detection knowledge through languages.
arXiv Detail & Related papers (2022-08-26T07:41:27Z) - Zero-Shot Cross-lingual Semantic Parsing [56.95036511882921]
We study cross-lingual semantic parsing as a zero-shot problem without parallel data for 7 test languages.
We propose a multi-task encoder-decoder model to transfer parsing knowledge to additional languages using only English-Logical form paired data.
Our system frames zero-shot parsing as a latent-space alignment problem and finds that pre-trained models can be improved to generate logical forms with minimal cross-lingual transfer penalty.
arXiv Detail & Related papers (2021-04-15T16:08:43Z) - On the Robustness of Language Encoders against Grammatical Errors [66.05648604987479]
We collect real grammatical errors from non-native speakers and conduct adversarial attacks to simulate these errors on clean text data.
Results confirm that the performance of all tested models is affected but the degree of impact varies.
arXiv Detail & Related papers (2020-05-12T11:01:44Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.