Musical Form Generation
- URL: http://arxiv.org/abs/2310.19842v1
- Date: Mon, 30 Oct 2023 08:02:08 GMT
- Title: Musical Form Generation
- Authors: Lilac Atassi
- Abstract summary: This paper introduces an approach for generating structured, arbitrarily long musical pieces.
Central to this approach is the creation of musical segments using a conditional generative model.
The generation of prompts that determine the high-level composition is distinct from the creation of finer, lower-level details.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: While recent generative models can produce engaging music, their utility is
limited. The variation in the music is often left to chance, resulting in
compositions that lack structure. Pieces extending beyond a minute can become
incoherent or repetitive. This paper introduces an approach for generating
structured, arbitrarily long musical pieces. Central to this approach is the
creation of musical segments using a conditional generative model, with
transitions between these segments. The generation of prompts that determine
the high-level composition is distinct from the creation of finer, lower-level
details. A large language model is then used to suggest the musical form.
Related papers
- PerceiverS: A Multi-Scale Perceiver with Effective Segmentation for Long-Term Expressive Symbolic Music Generation [5.201151187019607]
PerceiverS (Segmentation and Scale) is a novel architecture designed to generate long-structured and expressive music.
Our approach enhances symbolic music generation by simultaneously learning long-term structural dependencies and short-term expressive details.
The proposed model, evaluated on datasets like Maestro, demonstrates improvements in generating coherent and diverse music.
arXiv Detail & Related papers (2024-11-13T03:14:10Z) - Integrating Text-to-Music Models with Language Models: Composing Long Structured Music Pieces [0.0]
This paper proposes integrating a text-to-music model with a large language model to generate music with form.
The experimental results show that the proposed method can generate 2.5-minute-long music that is highly structured, strongly organized, and cohesive.
arXiv Detail & Related papers (2024-10-01T02:43:14Z) - Whole-Song Hierarchical Generation of Symbolic Music Using Cascaded Diffusion Models [5.736540322759929]
We make the first attempt to model a full music piece under the realization of compositional hierarchy.
High-level languages reveal whole-song form, phrase, and cadence, whereas the low-level languages focus on notes, chords, and their local patterns.
Experiments and analysis show that our model is capable of generating full-piece music with recognizable global verse-chorus structure and cadences.
arXiv Detail & Related papers (2024-05-16T08:48:23Z) - Graph-based Polyphonic Multitrack Music Generation [9.701208207491879]
This paper introduces a novel graph representation for music and a deep Variational Autoencoder that generates the structure and the content of musical graphs separately.
By separating the structure and content of musical graphs, it is possible to condition generation by specifying which instruments are played at certain times.
arXiv Detail & Related papers (2023-07-27T15:18:50Z) - Unsupervised Melody-to-Lyric Generation [91.29447272400826]
We propose a method for generating high-quality lyrics without training on any aligned melody-lyric data.
We leverage the segmentation and rhythm alignment between melody and lyrics to compile the given melody into decoding constraints.
Our model can generate high-quality lyrics that are more on-topic, singable, intelligible, and coherent than strong baselines.
arXiv Detail & Related papers (2023-05-30T17:20:25Z) - Unsupervised Melody-Guided Lyrics Generation [84.22469652275714]
We propose to generate pleasantly listenable lyrics without training on melody-lyric aligned data.
We leverage the crucial alignments between melody and lyrics and compile the given melody into constraints to guide the generation process.
arXiv Detail & Related papers (2023-05-12T20:57:20Z) - Noise2Music: Text-conditioned Music Generation with Diffusion Models [73.74580231353684]
We introduce Noise2Music, where a series of diffusion models is trained to generate high-quality 30-second music clips from text prompts.
We find that the generated audio is not only able to faithfully reflect key elements of the text prompt such as genre, tempo, instruments, mood, and era.
Pretrained large language models play a key role in this story -- they are used to generate paired text for the audio of the training set and to extract embeddings of the text prompts ingested by the diffusion models.
arXiv Detail & Related papers (2023-02-08T07:27:27Z) - Museformer: Transformer with Fine- and Coarse-Grained Attention for
Music Generation [138.74751744348274]
We propose Museformer, a Transformer with a novel fine- and coarse-grained attention for music generation.
Specifically, with the fine-grained attention, a token of a specific bar directly attends to all the tokens of the bars that are most relevant to music structures.
With the coarse-grained attention, a token only attends to the summarization of the other bars rather than each token of them so as to reduce the computational cost.
arXiv Detail & Related papers (2022-10-19T07:31:56Z) - Re-creation of Creations: A New Paradigm for Lyric-to-Melody Generation [158.54649047794794]
Re-creation of Creations (ROC) is a new paradigm for lyric-to-melody generation.
ROC achieves good lyric-melody feature alignment in lyric-to-melody generation.
arXiv Detail & Related papers (2022-08-11T08:44:47Z) - Controllable deep melody generation via hierarchical music structure
representation [14.891975420982511]
MusicFrameworks is a hierarchical music structure representation and a multi-step generative process to create a full-length melody.
To generate melody in each phrase, we generate rhythm and basic melody using two separate transformer-based networks.
To customize or add variety, one can alter chords, basic melody, and rhythm structure in the music frameworks, letting our networks generate the melody accordingly.
arXiv Detail & Related papers (2021-09-02T01:31:14Z) - Melody-Conditioned Lyrics Generation with SeqGANs [81.2302502902865]
We propose an end-to-end melody-conditioned lyrics generation system based on Sequence Generative Adversarial Networks (SeqGAN)
We show that the input conditions have no negative impact on the evaluation metrics while enabling the network to produce more meaningful results.
arXiv Detail & Related papers (2020-10-28T02:35:40Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.