An Investigation of Indian Native Language Phonemic Influences on L2
English Pronunciations
- URL: http://arxiv.org/abs/2212.09284v1
- Date: Mon, 19 Dec 2022 07:41:39 GMT
- Title: An Investigation of Indian Native Language Phonemic Influences on L2
English Pronunciations
- Authors: Shelly Jain, Priyanshi Pal, Anil Vuppala, Prasanta Ghosh, Chiranjeevi
Yarra
- Abstract summary: Growing number of L2 English speakers in India reinforces need to study accents and L1-L2 interactions.
We investigate the accents of Indian English (IE) speakers and report in detail our observations, both specific and common to all regions.
We demonstrate the influence of 18 Indian languages on IE by comparing the native language pronunciations with IE pronunciations obtained jointly from existing literature studies and phonetically annotated speech of 80 speakers.
- Score: 5.3956335232250385
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Speech systems are sensitive to accent variations. This is especially
challenging in the Indian context, with an abundance of languages but a dearth
of linguistic studies characterising pronunciation variations. The growing
number of L2 English speakers in India reinforces the need to study accents and
L1-L2 interactions. We investigate the accents of Indian English (IE) speakers
and report in detail our observations, both specific and common to all regions.
In particular, we observe the phonemic variations and phonotactics occurring in
the speakers' native languages and apply this to their English pronunciations.
We demonstrate the influence of 18 Indian languages on IE by comparing the
native language pronunciations with IE pronunciations obtained jointly from
existing literature studies and phonetically annotated speech of 80 speakers.
Consequently, we are able to validate the intuitions of Indian language
influences on IE pronunciations by justifying pronunciation rules from the
perspective of Indian language phonology. We obtain a comprehensive description
in terms of universal and region-specific characteristics of IE, which
facilitates accent conversion and adaptation of existing ASR and TTS systems to
different Indian accents.
Related papers
- Improving Pronunciation and Accent Conversion through Knowledge Distillation And Synthetic Ground-Truth from Native TTS [52.89324095217975]
Previous approaches on accent conversion mainly aimed at making non-native speech sound more native.
We develop a new AC approach that not only focuses on accent conversion but also improves pronunciation of non-native accented speaker.
arXiv Detail & Related papers (2024-10-19T06:12:31Z) - What Do Dialect Speakers Want? A Survey of Attitudes Towards Language Technology for German Dialects [60.8361859783634]
We survey speakers of dialects and regional languages related to German.
We find that respondents are especially in favour of potential NLP tools that work with dialectal input.
arXiv Detail & Related papers (2024-02-19T09:15:28Z) - Multi-lingual and Multi-cultural Figurative Language Understanding [69.47641938200817]
Figurative language permeates human communication, but is relatively understudied in NLP.
We create a dataset for seven diverse languages associated with a variety of cultures: Hindi, Indonesian, Javanese, Kannada, Sundanese, Swahili and Yoruba.
Our dataset reveals that each language relies on cultural and regional concepts for figurative expressions, with the highest overlap between languages originating from the same region.
All languages exhibit a significant deficiency compared to English, with variations in performance reflecting the availability of pre-training and fine-tuning data.
arXiv Detail & Related papers (2023-05-25T15:30:31Z) - Svarah: Evaluating English ASR Systems on Indian Accents [12.197514367387692]
We create Svarah, a benchmark that contains 9.6 hours of transcribed English audio from 117 speakers across 65 geographic locations throughout India.
Svarah comprises both read speech and spontaneous conversational data, covering various domains, such as history, culture, tourism, etc., ensuring a diverse vocabulary.
We evaluate 6 open source ASR models and 2 commercial ASR systems on Svarah and show that there is clear scope for improvement on Indian accents.
arXiv Detail & Related papers (2023-05-25T06:20:29Z) - Improve Bilingual TTS Using Dynamic Language and Phonology Embedding [10.244215079409797]
This paper builds a Mandarin-English TTS system to acquire more standard spoken English speech from a monolingual Chinese speaker.
We specially design an embedding strength modulator to capture the dynamic strength of language and phonology.
arXiv Detail & Related papers (2022-12-07T03:46:18Z) - Study of Indian English Pronunciation Variabilities relative to Received
Pronunciation [5.3956335232250385]
We consider a corpus, IndicTIMIT, which is rich in the diversity of IE varieties and is curated in a nativity balanced manner.
We present an approach to validate the phonetic rules of IE along with reporting unexplored rules derived using a data-driven manner.
arXiv Detail & Related papers (2022-04-13T16:35:52Z) - A study on native American English speech recognition by Indian
listeners with varying word familiarity level [62.14295630922855]
We have three kinds of responses from each listener while they recognize an utterance.
From these transcriptions, word error rate (WER) is calculated and used as a metric to evaluate the similarity between the recognized and the original sentences.
Speaker nativity wise analysis shows that utterances from speakers of some nativity are more difficult to recognize by Indian listeners compared to few other nativities.
arXiv Detail & Related papers (2021-12-08T07:43:38Z) - Non-native English lexicon creation for bilingual speech synthesis [9.533867546985887]
The intelligibility of a bilingual text-to-speech system depends on a lexicon that captures the phoneme sequence used by non-native speakers.
Due to the lack of non-native English lexicon, existing bilingual TTS systems employ native English lexicons that are widely available.
We propose a generic approach to obtain rules based on letter to phoneme alignment to map native English lexicon to their non-native version.
arXiv Detail & Related papers (2021-06-21T06:07:14Z) - Automatic Speech Recognition in Sanskrit: A New Speech Corpus and
Modelling Insights [25.666767669695044]
We release a 78 hour ASR dataset for Sanskrit, which faithfully captures several of the linguistic characteristics expressed by the language.
We propose a new modelling unit, inspired by the syllable level unit selection, that captures character sequences from one vowel in the word to the next vowel.
We extend these insights from Sanskrit ASR for building ASR systems in two other Indic languages, Gujarati and Telugu.
arXiv Detail & Related papers (2021-06-02T18:06:32Z) - Phoneme Recognition through Fine Tuning of Phonetic Representations: a
Case Study on Luhya Language Varieties [77.2347265289855]
We focus on phoneme recognition using Allosaurus, a method for multilingual recognition based on phonetic annotation.
To evaluate in a challenging real-world scenario, we curate phone recognition datasets for Bukusu and Saamia, two varieties of the Luhya language cluster of western Kenya and eastern Uganda.
We find that fine-tuning of Allosaurus, even with just 100 utterances, leads to significant improvements in phone error rates.
arXiv Detail & Related papers (2021-04-04T15:07:55Z) - AlloVera: A Multilingual Allophone Database [137.3686036294502]
AlloVera provides mappings from 218 allophones to phonemes for 14 languages.
We show that a "universal" allophone model, Allosaurus, built with AlloVera, outperforms "universal" phonemic models and language-specific models on a speech-transcription task.
arXiv Detail & Related papers (2020-04-17T02:02:18Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.