LyCon: Lyrics Reconstruction from the Bag-of-Words Using Large Language Models
- URL: http://arxiv.org/abs/2408.14750v1
- Date: Tue, 27 Aug 2024 03:01:48 GMT
- Title: LyCon: Lyrics Reconstruction from the Bag-of-Words Using Large Language Models
- Authors: Haven Kim, Kahyun Choi,
- Abstract summary: Our study introduces a novel method for generating copyright-free lyrics from publicly available Bag-of-Words datasets.
We have compiled and made available a dataset of reconstructed lyrics, LyCon, aligned with metadata from renowned sources.
We believe that the integration of metadata such as mood annotations or genres enables a variety of academic experiments on lyrics.
- Score: 1.1510009152620668
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: This paper addresses the unique challenge of conducting research in lyric studies, where direct use of lyrics is often restricted due to copyright concerns. Unlike typical data, internet-sourced lyrics are frequently protected under copyright law, necessitating alternative approaches. Our study introduces a novel method for generating copyright-free lyrics from publicly available Bag-of-Words (BoW) datasets, which contain the vocabulary of lyrics but not the lyrics themselves. Utilizing metadata associated with BoW datasets and large language models, we successfully reconstructed lyrics. We have compiled and made available a dataset of reconstructed lyrics, LyCon, aligned with metadata from renowned sources including the Million Song Dataset, Deezer Mood Detection Dataset, and AllMusic Genre Dataset, available for public access. We believe that the integration of metadata such as mood annotations or genres enables a variety of academic experiments on lyrics, such as conditional lyric generation.
Related papers
- SongCreator: Lyrics-based Universal Song Generation [53.248473603201916]
SongCreator is a song-generation system designed to tackle the challenge of generating songs with both vocals and accompaniment given lyrics.
The model features two novel designs: a meticulously designed dual-sequence language model (M) to capture the information of vocals and accompaniment for song generation, and a series of attention mask strategies for DSLM.
Experiments demonstrate the effectiveness of SongCreator by achieving state-of-the-art or competitive performances on all eight tasks.
arXiv Detail & Related papers (2024-09-09T19:37:07Z) - REFFLY: Melody-Constrained Lyrics Editing Model [50.03960548399128]
We introduce REFFLY, the first revision framework designed to edit arbitrary forms of plain text draft into high-quality, full-fledged song lyrics.
Our approach ensures that the generated lyrics retain the original meaning of the draft, align with the melody, and adhere to the desired song structures.
arXiv Detail & Related papers (2024-08-30T23:22:34Z) - Detecting Synthetic Lyrics with Few-Shot Inference [5.448536338411993]
We have curated the first dataset of high-quality synthetic lyrics.
Our best few-shot detector, based on LLM2Vec, surpasses stylistic and statistical methods.
This study emphasizes the need for further research on creative content detection.
arXiv Detail & Related papers (2024-06-21T15:19:21Z) - The Song Describer Dataset: a Corpus of Audio Captions for
Music-and-Language Evaluation [18.984512029792235]
We introduce the Song Describer dataset (SDD), a new crowdsourced corpus of high-quality audio-caption pairs.
The dataset consists of 1.1k human-written natural language descriptions of 706 music recordings.
arXiv Detail & Related papers (2023-11-16T17:52:21Z) - Sudowoodo: a Chinese Lyric Imitation System with Source Lyrics [10.99860269567001]
We introduce textbftextitSudowoodo, a Chinese lyrics imitation system that can generate new lyrics based on the text of source lyrics.
During the inference process, we utilize a post-processing module to filter and rank the generated lyrics, selecting the highest-quality ones.
The human evaluation results show that our framework can perform better lyric imitation.
arXiv Detail & Related papers (2023-08-09T02:12:04Z) - LyricWhiz: Robust Multilingual Zero-shot Lyrics Transcription by Whispering to ChatGPT [48.28624219567131]
We introduce LyricWhiz, a robust, multilingual, and zero-shot automatic lyrics transcription method.
We use Whisper, a weakly supervised robust speech recognition model, and GPT-4, today's most performant chat-based large language model.
Our experiments show that LyricWhiz significantly reduces Word Error Rate compared to existing methods in English.
arXiv Detail & Related papers (2023-06-29T17:01:51Z) - Unsupervised Melody-to-Lyric Generation [91.29447272400826]
We propose a method for generating high-quality lyrics without training on any aligned melody-lyric data.
We leverage the segmentation and rhythm alignment between melody and lyrics to compile the given melody into decoding constraints.
Our model can generate high-quality lyrics that are more on-topic, singable, intelligible, and coherent than strong baselines.
arXiv Detail & Related papers (2023-05-30T17:20:25Z) - Unsupervised Melody-Guided Lyrics Generation [84.22469652275714]
We propose to generate pleasantly listenable lyrics without training on melody-lyric aligned data.
We leverage the crucial alignments between melody and lyrics and compile the given melody into constraints to guide the generation process.
arXiv Detail & Related papers (2023-05-12T20:57:20Z) - Re-creation of Creations: A New Paradigm for Lyric-to-Melody Generation [158.54649047794794]
Re-creation of Creations (ROC) is a new paradigm for lyric-to-melody generation.
ROC achieves good lyric-melody feature alignment in lyric-to-melody generation.
arXiv Detail & Related papers (2022-08-11T08:44:47Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.