Cadence Detection in Symbolic Classical Music using Graph Neural
Networks
- URL: http://arxiv.org/abs/2208.14819v1
- Date: Wed, 31 Aug 2022 12:39:57 GMT
- Title: Cadence Detection in Symbolic Classical Music using Graph Neural
Networks
- Authors: Emmanouil Karystinaios, Gerhard Widmer
- Abstract summary: We present a graph representation of symbolic scores as an intermediate means to solve the cadence detection task.
We approach cadence detection as an imbalanced node classification problem using a Graph Convolutional Network.
Our experiments suggest that graph convolution can learn non-local features that assist in cadence detection, freeing us from the need of having to devise specialized features that encode non-local context.
- Score: 7.817685358710508
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Cadences are complex structures that have been driving music from the
beginning of contrapuntal polyphony until today. Detecting such structures is
vital for numerous MIR tasks such as musicological analysis, key detection, or
music segmentation. However, automatic cadence detection remains challenging
mainly because it involves a combination of high-level musical elements like
harmony, voice leading, and rhythm. In this work, we present a graph
representation of symbolic scores as an intermediate means to solve the cadence
detection task. We approach cadence detection as an imbalanced node
classification problem using a Graph Convolutional Network. We obtain results
that are roughly on par with the state of the art, and we present a model
capable of making predictions at multiple levels of granularity, from
individual notes to beats, thanks to the fine-grained, note-by-note
representation. Moreover, our experiments suggest that graph convolution can
learn non-local features that assist in cadence detection, freeing us from the
need of having to devise specialized features that encode non-local context. We
argue that this general approach to modeling musical scores and classification
tasks has a number of potential advantages, beyond the specific recognition
task presented here.
Related papers
- Toward a More Complete OMR Solution [49.74172035862698]
Optical music recognition aims to convert music notation into digital formats.
One approach to tackle OMR is through a multi-stage pipeline, where the system first detects visual music notation elements in the image.
We introduce a music object detector based on YOLOv8, which improves detection performance.
Second, we introduce a supervised training pipeline that completes the notation assembly stage based on detection output.
arXiv Detail & Related papers (2024-08-31T01:09:12Z) - GraphMuse: A Library for Symbolic Music Graph Processing [3.997809845676912]
GraphMuse is a graph processing framework and library that facilitates efficient music graph processing.
New neighbor sampling technique specifically targeted toward meaningful behavior in musical scores is central to our contribution.
Our hope is that GraphMuse will lead to a boost in, and standardization of, symbolic music processing based on graph representations.
arXiv Detail & Related papers (2024-07-17T15:54:09Z) - Cluster and Separate: a GNN Approach to Voice and Staff Prediction for Score Engraving [5.572472212662453]
This paper approaches the problem of separating the notes from a quantized symbolic music piece (e.g., a MIDI file) into multiple voices and staves.
We propose an end-to-end system based on graph neural networks that notes that belong to the same chord and connect them with edges if they are part of a voice.
arXiv Detail & Related papers (2024-07-15T14:36:13Z) - Perception-Inspired Graph Convolution for Music Understanding Tasks [3.5570874721859016]
We propose a new graph convolutional block, MusGConv, specifically designed for the efficient processing of musical score data.
We evaluate our approach on four different musical understanding problems.
arXiv Detail & Related papers (2024-05-15T10:04:44Z) - Combinatorial music generation model with song structure graph analysis [18.71152526968065]
We construct a graph that uses information such as note sequence and instrument as node features, while the correlation between note sequences acts as the edge feature.
We trained a Graph Neural Network to obtain node representation in the graph, then we use node representation as input of Unet to generate CONLON pianoroll image latent.
arXiv Detail & Related papers (2023-12-24T04:09:30Z) - Temporal Graph Network Embedding with Causal Anonymous Walks
Representations [54.05212871508062]
We propose a novel approach for dynamic network representation learning based on Temporal Graph Network.
For evaluation, we provide a benchmark pipeline for the evaluation of temporal network embeddings.
We show the applicability and superior performance of our model in the real-world downstream graph machine learning task provided by one of the top European banks.
arXiv Detail & Related papers (2021-08-19T15:39:52Z) - Visual Attention for Musical Instrument Recognition [72.05116221011949]
We explore the use of attention mechanism in a timbral-temporal sense, a la visual attention, to improve the performance of musical instrument recognition.
The first approach applies attention mechanism to the sliding-window paradigm, where a prediction based on each timbral-temporal instance' is given an attention weight, before aggregation to produce the final prediction.
The second approach is based on a recurrent model of visual attention where the network only attends to parts of the spectrogram and decide where to attend to next, given a limited number of glimpses'
arXiv Detail & Related papers (2020-06-17T03:56:44Z) - Music Gesture for Visual Sound Separation [121.36275456396075]
"Music Gesture" is a keypoint-based structured representation to explicitly model the body and finger movements of musicians when they perform music.
We first adopt a context-aware graph network to integrate visual semantic context with body dynamics, and then apply an audio-visual fusion model to associate body movements with the corresponding audio signals.
arXiv Detail & Related papers (2020-04-20T17:53:46Z) - Audio Impairment Recognition Using a Correlation-Based Feature
Representation [85.08880949780894]
We propose a new representation of hand-crafted features that is based on the correlation of feature pairs.
We show superior performance in terms of compact feature dimensionality and improved computational speed in the test stage.
arXiv Detail & Related papers (2020-03-22T13:34:37Z) - Multi-Modal Music Information Retrieval: Augmenting Audio-Analysis with
Visual Computing for Improved Music Video Analysis [91.3755431537592]
This thesis combines audio-analysis with computer vision to approach Music Information Retrieval (MIR) tasks from a multi-modal perspective.
The main hypothesis of this work is based on the observation that certain expressive categories such as genre or theme can be recognized on the basis of the visual content alone.
The experiments are conducted for three MIR tasks Artist Identification, Music Genre Classification and Cross-Genre Classification.
arXiv Detail & Related papers (2020-02-01T17:57:14Z) - Modeling Musical Structure with Artificial Neural Networks [0.0]
I explore the application of artificial neural networks to different aspects of musical structure modeling.
I show how a connectionist model, the Gated Autoencoder (GAE), can be employed to learn transformations between musical fragments.
I propose a special predictive training of the GAE, which yields a representation of polyphonic music as a sequence of intervals.
arXiv Detail & Related papers (2020-01-06T18:35:57Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.