Prosody Labelled Dataset for Hindi using Semi-Automated Approach
- URL: http://arxiv.org/abs/2112.05973v1
- Date: Sat, 11 Dec 2021 13:11:36 GMT
- Title: Prosody Labelled Dataset for Hindi using Semi-Automated Approach
- Authors: Esha Banerjee, Atul Kr. Ojha, Girish Nath Jha
- Abstract summary: This study aims to develop a semi-automatically labelled prosody database for Hindi.
No single standard for prosody labelling exists in Hindi.
The accuracy of the trained models for pitch accent, intermediate phrase boundaries and accentual phrase boundaries is 73.40%, 93.20%, and 43% respectively.
- Score: 0.19733467999508417
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: This study aims to develop a semi-automatically labelled prosody database for
Hindi, for enhancing the intonation component in ASR and TTS systems, which is
also helpful for building Speech to Speech Machine Translation systems.
Although no single standard for prosody labelling exists in Hindi, researchers
in the past have employed perceptual and statistical methods in literature to
draw inferences about the behaviour of prosody patterns in Hindi. Based on such
existing research and largely agreed upon theories of intonation in Hindi, this
study attempts to first develop a manually annotated prosodic corpus of Hindi
speech data, which is then used for training prediction models for generating
automatic prosodic labels. A total of 5,000 sentences (23,500 words) for
declarative and interrogative types have been labelled. The accuracy of the
trained models for pitch accent, intermediate phrase boundaries and accentual
phrase boundaries is 73.40%, 93.20%, and 43% respectively.
Related papers
- Enhancing Plagiarism Detection in Marathi with a Weighted Ensemble of TF-IDF and BERT Embeddings for Low-Resource Language Processing [0.0]
It is crucial to design robust plagiarism detection systems tailored for low-resource languages.
This paper presents a method to enhance the accuracy of plagiarism detection for Marathi texts.
arXiv Detail & Related papers (2025-01-09T14:14:18Z) - Online Writer Retrieval with Chinese Handwritten Phrases: A Synergistic Temporal-Frequency Representation Learning Approach [53.189911918976655]
We propose DOLPHIN, a novel retrieval model designed to enhance handwriting representations through synergistic temporal-frequency analysis.<n>We introduce OLIWER, a large-scale online writer retrieval dataset encompassing over 670,000 Chinese handwritten phrases from 1,731 individuals.<n>Our findings emphasize the significance of point sampling frequency and pressure features in improving handwriting representation quality.
arXiv Detail & Related papers (2024-12-16T11:19:22Z) - Navigating Text-to-Image Generative Bias across Indic Languages [53.92640848303192]
This research investigates biases in text-to-image (TTI) models for the Indic languages widely spoken across India.
It evaluates and compares the generative performance and cultural relevance of leading TTI models in these languages against their performance in English.
arXiv Detail & Related papers (2024-08-01T04:56:13Z) - Speech-to-Speech Translation For A Real-world Unwritten Language [62.414304258701804]
We study speech-to-speech translation (S2ST) that translates speech from one language into another language.
We present an end-to-end solution from training data collection, modeling choices to benchmark dataset release.
arXiv Detail & Related papers (2022-11-11T20:21:38Z) - Aggression in Hindi and English Speech: Acoustic Correlates and
Automatic Identification [0.802904964931021]
The study is based on a corpus of slightly over 10 hours of political discourse.
We develop two automatic classification systems for identifying aggression in English and Hindi speech.
arXiv Detail & Related papers (2022-04-06T13:29:25Z) - Automatic Dialect Density Estimation for African American English [74.44807604000967]
We explore automatic prediction of dialect density of the African American English (AAE) dialect.
dialect density is defined as the percentage of words in an utterance that contain characteristics of the non-standard dialect.
We show a significant correlation between our predicted and ground truth dialect density measures for AAE speech in this database.
arXiv Detail & Related papers (2022-04-03T01:34:48Z) - Towards automatic identification of linguistic politeness in Hindi texts [1.2691047660244332]
I have used the manually annotated corpus of over 25,000 blog comments to train an SVM.
The trained system gives a significantly high accuracy of over 77% which is within 2% of human accuracy.
arXiv Detail & Related papers (2021-11-30T10:32:17Z) - Leveraging Pre-trained Language Model for Speech Sentiment Analysis [58.78839114092951]
We explore the use of pre-trained language models to learn sentiment information of written texts for speech sentiment analysis.
We propose a pseudo label-based semi-supervised training strategy using a language model on an end-to-end speech sentiment approach.
arXiv Detail & Related papers (2021-06-11T20:15:21Z) - Coarse and Fine-Grained Hostility Detection in Hindi Posts using Fine
Tuned Multilingual Embeddings [4.3012765978447565]
The hostility detection task has been well explored for resource-rich languages like English, but is unexplored for resource-constrained languages like Hindidue to the unavailability of large suitable data.
We propose an effective neural network-based technique for hostility detection in Hindi posts.
arXiv Detail & Related papers (2021-01-13T11:00:31Z) - Infusing Finetuning with Semantic Dependencies [62.37697048781823]
We show that, unlike syntax, semantics is not brought to the surface by today's pretrained models.
We then use convolutional graph encoders to explicitly incorporate semantic parses into task-specific finetuning.
arXiv Detail & Related papers (2020-12-10T01:27:24Z) - A Survey on Text Classification: From Shallow to Deep Learning [83.47804123133719]
The last decade has seen a surge of research in this area due to the unprecedented success of deep learning.
This paper fills the gap by reviewing the state-of-the-art approaches from 1961 to 2021.
We create a taxonomy for text classification according to the text involved and the models used for feature extraction and classification.
arXiv Detail & Related papers (2020-08-02T00:09:03Z) - Validation and Normalization of DCS corpus using Sanskrit Heritage tools
to build a tagged Gold Corpus [0.0]
The Digital Corpus of Sanskrit records around 650,000 sentences along with their morphological and lexical tagging.
The Sanskrit Heritage Engine's Reader produces all possible segmentations with morphological and lexical analyses.
arXiv Detail & Related papers (2020-05-13T19:23:43Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.