OLR 2021 Challenge: Datasets, Rules and Baselines
- URL: http://arxiv.org/abs/2107.11113v1
- Date: Fri, 23 Jul 2021 09:57:29 GMT
- Title: OLR 2021 Challenge: Datasets, Rules and Baselines
- Authors: Binling Wang, Wenxuan Hu, Jing Li, Yiming Zhi, Zheng Li, Qingyang
Hong, Lin Li, Dong Wang, Liming Song and Cheng Yang
- Abstract summary: The data profile, four tasks, two baselines, and the evaluation principles are introduced in this paper.
In addition to the Language Identification (LID) tasks, multilingual Automatic Speech Recognition (ASR) tasks are introduced to OLR 2021 Challenge for the first time.
- Score: 23.878103387338918
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: This paper introduces the sixth Oriental Language Recognition (OLR) 2021
Challenge, which intends to improve the performance of language recognition
systems and speech recognition systems within multilingual scenarios. The data
profile, four tasks, two baselines, and the evaluation principles are
introduced in this paper. In addition to the Language Identification (LID)
tasks, multilingual Automatic Speech Recognition (ASR) tasks are introduced to
OLR 2021 Challenge for the first time. The challenge this year focuses on more
practical and challenging problems, with four tasks: (1) constrained LID, (2)
unconstrained LID, (3) constrained multilingual ASR, (4) unconstrained
multilingual ASR. Baselines for LID tasks and multilingual ASR tasks are
provided, respectively. The LID baseline system is an extended TDNN x-vector
model constructed with Pytorch. A transformer-based end-to-end model is
provided as the multilingual ASR baseline system. These recipes will be online
published, and available for participants to construct their own LID or ASR
systems. The baseline results demonstrate that those tasks are rather
challenging and deserve more effort to achieve better performance.
Related papers
- Efficiently Aligned Cross-Lingual Transfer Learning for Conversational
Tasks using Prompt-Tuning [98.60739735409243]
Cross-lingual transfer of language models trained on high-resource languages like English has been widely studied for many NLP tasks.
We introduce XSGD for cross-lingual alignment pretraining, a parallel and large-scale multilingual conversation dataset.
To facilitate aligned cross-lingual representations, we develop an efficient prompt-tuning-based method for learning alignment prompts.
arXiv Detail & Related papers (2023-04-03T18:46:01Z) - Bridging Cross-Lingual Gaps During Leveraging the Multilingual
Sequence-to-Sequence Pretraining for Text Generation [80.16548523140025]
We extend the vanilla pretrain-finetune pipeline with extra code-switching restore task to bridge the gap between the pretrain and finetune stages.
Our approach could narrow the cross-lingual sentence representation distance and improve low-frequency word translation with trivial computational cost.
arXiv Detail & Related papers (2022-04-16T16:08:38Z) - Oriental Language Recognition (OLR) 2020: Summary and Analysis [21.212345251874513]
The fifth Oriental Language Recognition (OLR) Challenge focuses on language recognition in a variety of complex environments.
This paper describes the three tasks, the database profile, and the final results of the challenge.
arXiv Detail & Related papers (2021-07-05T12:42:40Z) - Dual Script E2E framework for Multilingual and Code-Switching ASR [4.697788649564087]
We train multilingual and code-switching ASR systems for Indian languages.
Inspired by results in text-to-speech synthesis, we use an in-house rule-based common label set ( CLS) representation.
We show our results on the multilingual and code-switching tasks of the Indic ASR Challenge 2021.
arXiv Detail & Related papers (2021-06-02T18:08:27Z) - Low-Resource Spoken Language Identification Using Self-Attentive Pooling
and Deep 1D Time-Channel Separable Convolutions [0.0]
We show that a convolutional neural network with a Self-Attentive Pooling layer shows promising results in low-resource setting for the language identification task.
We also substantiate the hypothesis that whenever the dataset is diverse enough so that the other classification factors, like gender, age etc. are well-averaged, the confusion matrix for LID system bears the language similarity measure.
arXiv Detail & Related papers (2021-05-31T18:35:27Z) - Crossing the Conversational Chasm: A Primer on Multilingual
Task-Oriented Dialogue Systems [51.328224222640614]
Current state-of-the-art ToD models based on large pretrained neural language models are data hungry.
Data acquisition for ToD use cases is expensive and tedious.
arXiv Detail & Related papers (2021-04-17T15:19:56Z) - Multilingual and code-switching ASR challenges for low resource Indian
languages [59.2906853285309]
We focus on building multilingual and code-switching ASR systems through two different subtasks related to a total of seven Indian languages.
We provide a total of 600 hours of transcribed speech data, comprising train and test sets, in these languages.
We also provide a baseline recipe for both the tasks with a WER of 30.73% and 32.45% on the test sets of multilingual and code-switching subtasks, respectively.
arXiv Detail & Related papers (2021-04-01T03:37:01Z) - Adversarial Meta Sampling for Multilingual Low-Resource Speech
Recognition [159.9312272042253]
We develop a novel adversarial meta sampling (AMS) approach to improve multilingual meta-learning ASR (MML-ASR)
AMS adaptively determines the task sampling probability for each source language.
Experiment results on two multilingual datasets show significant performance improvement when applying our AMS on MML-ASR.
arXiv Detail & Related papers (2020-12-22T09:33:14Z) - Streaming End-to-End Bilingual ASR Systems with Joint Language
Identification [19.09014345299161]
We introduce streaming, end-to-end, bilingual systems that perform both ASR and language identification.
The proposed method is applied to two language pairs: English-Spanish as spoken in the United States, and English-Hindi as spoken in India.
arXiv Detail & Related papers (2020-07-08T05:00:25Z) - AP20-OLR Challenge: Three Tasks and Their Baselines [29.652143329022817]
The data profile, three tasks, the corresponding baselines, and the evaluation principles are introduced in this paper.
The AP20-OLR challenge includes more languages, dialects and real-life data provided by Speechocean and the NSFC M2ASR project.
arXiv Detail & Related papers (2020-06-04T16:29:21Z) - XGLUE: A New Benchmark Dataset for Cross-lingual Pre-training,
Understanding and Generation [100.09099800591822]
XGLUE is a new benchmark dataset that can be used to train large-scale cross-lingual pre-trained models.
XGLUE provides 11 diversified tasks that cover both natural language understanding and generation scenarios.
arXiv Detail & Related papers (2020-04-03T07:03:12Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.