Comprehension of Subtitles from Re-Translating Simultaneous Speech
Translation
- URL: http://arxiv.org/abs/2203.02458v1
- Date: Fri, 4 Mar 2022 17:41:39 GMT
- Title: Comprehension of Subtitles from Re-Translating Simultaneous Speech
Translation
- Authors: D\'avid Javorsk\'y, Dominik Mach\'a\v{c}ek, Ond\v{r}ej Bojar
- Abstract summary: In simultaneous speech translation, one can vary the size of the output window, system latency and sometimes the allowed level of rewriting.
The effect of these properties on readability and comprehensibility has not been tested with modern neural translation systems.
It is a pilot study with 14 users on 2 hours of German documentaries or speeches with online translations into Czech.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: In simultaneous speech translation, one can vary the size of the output
window, system latency and sometimes the allowed level of rewriting. The effect
of these properties on readability and comprehensibility has not been tested
with modern neural translation systems. In this work, we propose an evaluation
method and investigate the effects on comprehension and user preferences. It is
a pilot study with 14 users on 2 hours of German documentaries or speeches with
online translations into Czech. We collect continuous feedback and answers on
factual questions. Our results show that the subtitling layout or flicker have
a little effect on comprehension, in contrast to machine translation itself and
individual competence. Other results show that users with a limited knowledge
of the source language have different preferences to stability and latency than
the users with zero knowledge. The results are statistically insignificant,
however, we show that our method works and can be reproduced in larger volume.
Related papers
- Translation Deserves Better: Analyzing Translation Artifacts in Cross-lingual Visual Question Answering [29.703811132512573]
Building a reliable visual question answering(VQA) system across different languages is a challenging problem.
Recent studies have employed machine translation systems for the cross-lingual VQA task.
arXiv Detail & Related papers (2024-06-04T14:00:02Z) - Prosody in Cascade and Direct Speech-to-Text Translation: a case study
on Korean Wh-Phrases [79.07111754406841]
This work proposes using contrastive evaluation to measure the ability of direct S2TT systems to disambiguate utterances where prosody plays a crucial role.
Our results clearly demonstrate the value of direct translation systems over cascade translation models.
arXiv Detail & Related papers (2024-02-01T14:46:35Z) - Analyzing and Reducing the Performance Gap in Cross-Lingual Transfer
with Fine-tuning Slow and Fast [50.19681990847589]
Existing research has shown that a multilingual pre-trained language model fine-tuned with one (source) language also performs well on downstream tasks for non-source languages.
This paper analyzes the fine-tuning process, discovers when the performance gap changes and identifies which network weights affect the overall performance most.
arXiv Detail & Related papers (2023-05-19T06:04:21Z) - How Robust is Neural Machine Translation to Language Imbalance in
Multilingual Tokenizer Training? [86.48323488619629]
We analyze how translation performance changes as the data ratios among languages vary in the tokenizer training corpus.
We find that while relatively better performance is often observed when languages are more equally sampled, the downstream performance is more robust to language imbalance than we usually expected.
arXiv Detail & Related papers (2022-04-29T17:50:36Z) - ChrEnTranslate: Cherokee-English Machine Translation Demo with Quality
Estimation and Corrective Feedback [70.5469946314539]
ChrEnTranslate is an online machine translation demonstration system for translation between English and an endangered language Cherokee.
It supports both statistical and neural translation models as well as provides quality estimation to inform users of reliability.
arXiv Detail & Related papers (2021-07-30T17:58:54Z) - Simultaneous Speech Translation for Live Subtitling: from Delay to
Display [13.35771688595446]
We explore the feasibility of simultaneous speech translation (SimulST) for live subtitling.
We adapt SimulST systems to predict subtitle breaks along with the translation.
We propose a display mode that exploits the predicted break structure by presenting the subtitles in scrolling lines.
arXiv Detail & Related papers (2021-07-19T12:35:49Z) - Lost in Interpreting: Speech Translation from Source or Interpreter? [0.0]
We release 10 hours of recordings and transcripts of European Parliament speeches in English, with simultaneous interpreting into Czech and German.
We evaluate quality and latency of speaker-based and interpreter-based spoken translation systems from English to Czech.
arXiv Detail & Related papers (2021-06-17T09:32:49Z) - Backtranslation Feedback Improves User Confidence in MT, Not Quality [18.282199360280433]
We show three ways in which user confidence in the outbound translation, as well as its overall final quality, can be affected.
In this paper, we describe an experiment on outbound translation from English to Czech and Estonian.
We show that backward translation feedback has a mixed effect on the whole process: it increases user confidence in the produced translation, but not the objective quality.
arXiv Detail & Related papers (2021-04-12T17:50:24Z) - "Listen, Understand and Translate": Triple Supervision Decouples
End-to-end Speech-to-text Translation [49.610188741500274]
An end-to-end speech-to-text translation (ST) takes audio in a source language and outputs the text in a target language.
Existing methods are limited by the amount of parallel corpus.
We build a system to fully utilize signals in a parallel ST corpus.
arXiv Detail & Related papers (2020-09-21T09:19:07Z) - Translation Artifacts in Cross-lingual Transfer Learning [51.66536640084888]
We show that machine translation can introduce subtle artifacts that have a notable impact in existing cross-lingual models.
In natural language inference, translating the premise and the hypothesis independently can reduce the lexical overlap between them.
We also improve the state-of-the-art in XNLI for the translate-test and zero-shot approaches by 4.3 and 2.8 points, respectively.
arXiv Detail & Related papers (2020-04-09T17:54:30Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.