Song Emotion Classification of Lyrics with Out-of-Domain Data under Label Scarcity
- URL: http://arxiv.org/abs/2410.05778v1
- Date: Tue, 8 Oct 2024 07:58:15 GMT
- Title: Song Emotion Classification of Lyrics with Out-of-Domain Data under Label Scarcity
- Authors: Jonathan Sakunkoo, Annabella Sakunkoo,
- Abstract summary: There is a scarcity of large, high quality in-domain datasets for lyrics-based song emotion classification.
CNN models trained on a large Reddit comments dataset achieve satisfactory performance and generalizability to lyrical emotion classification.
- Score: 0.0
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Songs have been found to profoundly impact human emotions, with lyrics having significant power to stimulate emotional changes in the audience. There is a scarcity of large, high quality in-domain datasets for lyrics-based song emotion classification (Edmonds and Sedoc, 2021; Zhou, 2022). It has been noted that in-domain training datasets are often difficult to acquire (Zhang and Miao, 2023) and that label acquisition is often limited by cost, time, and other factors (Azad et al., 2018). We examine the novel usage of a large out-of-domain dataset as a creative solution to the challenge of training data scarcity in the emotional classification of song lyrics. We find that CNN models trained on a large Reddit comments dataset achieve satisfactory performance and generalizability to lyrical emotion classification, thus giving insights into and a promising possibility in leveraging large, publicly available out-of-domain datasets for domains whose in-domain data are lacking or costly to acquire.
Related papers
- Semi-Supervised Self-Learning Enhanced Music Emotion Recognition [6.315220462630698]
Music emotion recognition (MER) aims to identify the emotions conveyed in a given musical piece.
Currently, the available public datasets have limited sample sizes.
We propose a semi-supervised self-learning (SSSL) method, which can differentiate between samples with correct and incorrect labels in a self-learning manner.
arXiv Detail & Related papers (2024-10-29T09:42:07Z) - Synthetic Lyrics Detection Across Languages and Genres [4.987546582439803]
Large language models (LLMs) to generate music content, particularly lyrics, has gained in popularity.
Previous research has explored content detection in various domains, but no work has focused on the modality of lyrics in music.
We curated a diverse dataset of real and synthetic lyrics from multiple languages, music genres, and artists.
arXiv Detail & Related papers (2024-06-21T15:19:21Z) - Unsupervised Melody-to-Lyric Generation [91.29447272400826]
We propose a method for generating high-quality lyrics without training on any aligned melody-lyric data.
We leverage the segmentation and rhythm alignment between melody and lyrics to compile the given melody into decoding constraints.
Our model can generate high-quality lyrics that are more on-topic, singable, intelligible, and coherent than strong baselines.
arXiv Detail & Related papers (2023-05-30T17:20:25Z) - Unsupervised Melody-Guided Lyrics Generation [84.22469652275714]
We propose to generate pleasantly listenable lyrics without training on melody-lyric aligned data.
We leverage the crucial alignments between melody and lyrics and compile the given melody into constraints to guide the generation process.
arXiv Detail & Related papers (2023-05-12T20:57:20Z) - Tollywood Emotions: Annotation of Valence-Arousal in Telugu Song Lyrics [0.0]
We present a new manually annotated dataset of Telugu songs' lyrics collected from Spotify.
We create two music emotion recognition models by using two classification techniques.
We make the dataset publicly available with lyrics, annotations and Spotify IDs.
arXiv Detail & Related papers (2023-03-16T14:47:52Z) - Cross-domain Sentiment Classification in Spanish [18.563342761346608]
We study the ability of a classification system trained with a large database of product reviews to generalize to different Spanish domains.
Results suggest that generalization across domains is feasible though very challenging when trained with these product reviews.
arXiv Detail & Related papers (2023-03-15T23:11:30Z) - Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on
Data-Driven Deep Learning [70.30713251031052]
We propose a data-driven deep learning model, i.e. StrengthNet, to improve the generalization of emotion strength assessment for seen and unseen speech.
Experiments show that the predicted emotion strength of the proposed StrengthNet is highly correlated with ground truth scores for both seen and unseen speech.
arXiv Detail & Related papers (2022-06-15T01:25:32Z) - Robust wav2vec 2.0: Analyzing Domain Shift in Self-Supervised
Pre-Training [67.71228426496013]
We show that using target domain data during pre-training leads to large performance improvements across a variety of setups.
We find that pre-training on multiple domains improves performance generalization on domains not seen during training.
arXiv Detail & Related papers (2021-04-02T12:53:15Z) - Neural Network architectures to classify emotions in Indian Classical
Music [0.0]
We present a new dataset called JUMusEmoDB which presently has 400 audio clips (30 seconds each)
For supervised classification purposes, we have used 4 existing deep Convolutional Neural Network (CNN) based architectures.
This type of CNN based classification algorithm using a rich corpus of Indian Classical Music is unique even in the global perspective.
arXiv Detail & Related papers (2021-02-01T03:41:25Z) - Emotional Semantics-Preserved and Feature-Aligned CycleGAN for Visual
Emotion Adaptation [85.20533077846606]
Unsupervised domain adaptation (UDA) studies the problem of transferring models trained on one labeled source domain to another unlabeled target domain.
In this paper, we focus on UDA in visual emotion analysis for both emotion distribution learning and dominant emotion classification.
We propose a novel end-to-end cycle-consistent adversarial model, termed CycleEmotionGAN++.
arXiv Detail & Related papers (2020-11-25T01:31:01Z) - Melody-Conditioned Lyrics Generation with SeqGANs [81.2302502902865]
We propose an end-to-end melody-conditioned lyrics generation system based on Sequence Generative Adversarial Networks (SeqGAN)
We show that the input conditions have no negative impact on the evaluation metrics while enabling the network to produce more meaningful results.
arXiv Detail & Related papers (2020-10-28T02:35:40Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.