SER_AMPEL: a multi-source dataset for speech emotion recognition of
Italian older adults
- URL: http://arxiv.org/abs/2311.14483v2
- Date: Thu, 14 Dec 2023 15:39:04 GMT
- Title: SER_AMPEL: a multi-source dataset for speech emotion recognition of
Italian older adults
- Authors: Alessandra Grossi and Francesca Gasparini
- Abstract summary: SER_AMPEL is a multi-source dataset for speech emotion recognition (SER)
It is collected with the aim of providing a reference for speech emotion recognition in case of Italian older adults.
The evidence of the need for such a dataset emerges from the analysis of the state of the art.
- Score: 58.49386651361823
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: In this paper, SER_AMPEL, a multi-source dataset for speech emotion
recognition (SER) is presented. The peculiarity of the dataset is that it is
collected with the aim of providing a reference for speech emotion recognition
in case of Italian older adults. The dataset is collected following different
protocols, in particular considering acted conversations, extracted from movies
and TV series, and recording natural conversations where the emotions are
elicited by proper questions. The evidence of the need for such a dataset
emerges from the analysis of the state of the art. Preliminary considerations
on the critical issues of SER are reported analyzing the classification results
on a subset of the proposed dataset.
Related papers
- PSCon: Toward Conversational Product Search [55.94925947614474]
We introduce a new CPS data collection protocol and present PSCon, a novel CPS dataset designed to assist product search via human-like conversations.
The dataset is constructed using a coached human-to-human data collection protocol and supports two languages and dual markets.
arXiv Detail & Related papers (2025-02-19T17:05:42Z) - M-ABSA: A Multilingual Dataset for Aspect-Based Sentiment Analysis [23.523947343171926]
We present M-ABSA, a comprehensive dataset spanning 7 domains and 21 languages.
Our primary focus is on triplet extraction, which involves identifying aspect terms, aspect categories, and sentiment polarities.
Our empirical findings highlight that the dataset enables diverse evaluation tasks, such as multilingual and multi-domain transfer learning.
arXiv Detail & Related papers (2025-02-17T14:16:01Z) - LibriheavyMix: A 20,000-Hour Dataset for Single-Channel Reverberant Multi-Talker Speech Separation, ASR and Speaker Diarization [31.01716151301142]
We present a large-scale far-field overlapping speech dataset to advance research in speech separation, recognition, and speaker diarization.
This dataset is a critical resource for decoding Who said What and When'' in multi-talker, reverberant environments.
arXiv Detail & Related papers (2024-09-01T19:23:08Z) - Generating Data with Text-to-Speech and Large-Language Models for Conversational Speech Recognition [48.527630771422935]
We propose a synthetic data generation pipeline for multi-speaker conversational ASR.
We conduct evaluation by fine-tuning the Whisper ASR model for telephone and distant conversational speech settings.
arXiv Detail & Related papers (2024-08-17T14:47:05Z) - What Does it Take to Generalize SER Model Across Datasets? A Comprehensive Benchmark [13.820963986497128]
Speech emotion recognition (SER) is essential for enhancing human-computer interaction in speech-based applications.
Despite improvements in specific emotional datasets, there is still a research gap in SER's capability to generalize across real-world situations.
In this paper, we investigate approaches to generalize the SER system across different emotion datasets.
arXiv Detail & Related papers (2024-06-14T11:27:19Z) - Sentiment recognition of Italian elderly through domain adaptation on
cross-corpus speech dataset [77.99182201815763]
The aim of this work is to define a speech emotion recognition (SER) model able to recognize positive, neutral and negative emotions in natural conversations of Italian elderly people.
arXiv Detail & Related papers (2022-11-14T12:39:41Z) - Dialogue Term Extraction using Transfer Learning and Topological Data
Analysis [0.8185867455104834]
We explore different features that can enable systems to discover realizations of domains, slots, and values in dialogues in a purely data-driven fashion.
To examine the utility of each feature set, we train a seed model based on the widely used MultiWOZ data-set.
Our method outperforms the previously proposed approach that relies solely on word embeddings.
arXiv Detail & Related papers (2022-08-22T17:04:04Z) - Automatic Dialect Density Estimation for African American English [74.44807604000967]
We explore automatic prediction of dialect density of the African American English (AAE) dialect.
dialect density is defined as the percentage of words in an utterance that contain characteristics of the non-standard dialect.
We show a significant correlation between our predicted and ground truth dialect density measures for AAE speech in this database.
arXiv Detail & Related papers (2022-04-03T01:34:48Z) - Dialog speech sentiment classification for imbalanced datasets [7.84604505907019]
In this paper, we use single and bi-modal analysis of short dialog utterances and gain insights on the main factors that aid in sentiment detection.
We propose an architecture which uses a learning rate scheduler and different monitoring criteria and provides state-of-the-art results for the SWITCHBOARD imbalanced sentiment dataset.
arXiv Detail & Related papers (2021-09-15T11:43:04Z) - Vyaktitv: A Multimodal Peer-to-Peer Hindi Conversations based Dataset
for Personality Assessment [50.15466026089435]
We present a novel peer-to-peer Hindi conversation dataset- Vyaktitv.
It consists of high-quality audio and video recordings of the participants, with Hinglish textual transcriptions for each conversation.
The dataset also contains a rich set of socio-demographic features, like income, cultural orientation, amongst several others, for all the participants.
arXiv Detail & Related papers (2020-08-31T17:44:28Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.