Transcribing Rhythmic Patterns of the Guitar Track in Polyphonic Music
- URL: http://arxiv.org/abs/2510.05756v1
- Date: Tue, 07 Oct 2025 10:22:31 GMT
- Title: Transcribing Rhythmic Patterns of the Guitar Track in Polyphonic Music
- Authors: Aleksandr Lukoianov, Anssi Klapuri,
- Abstract summary: We transcribe the rhythmic patterns in 410 popular songs and record cover versions where the guitar tracks followed those transcriptions.<n>We detect individual strums within the separated guitar audio, using a pre-trained foundation model (MERT) as a backbone.<n>We show that it is possible to transcribe the rhythmic patterns of the guitar track in polyphonic music with quite high accuracy.
- Score: 46.69593319852797
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Whereas chord transcription has received considerable attention during the past couple of decades, far less work has been devoted to transcribing and encoding the rhythmic patterns that occur in a song. The topic is especially relevant for instruments such as the rhythm guitar, which is typically played by strumming rhythmic patterns that repeat and vary over time. However, in many cases one cannot objectively define a single "right" rhythmic pattern for a given song section. To create a dataset with well-defined ground-truth labels, we asked expert musicians to transcribe the rhythmic patterns in 410 popular songs and record cover versions where the guitar tracks followed those transcriptions. To transcribe the strums and their corresponding rhythmic patterns, we propose a three-step framework. Firstly, we perform approximate stem separation to extract the guitar part from the polyphonic mixture. Secondly, we detect individual strums within the separated guitar audio, using a pre-trained foundation model (MERT) as a backbone. Finally, we carry out a pattern-decoding process in which the transcribed sequence of guitar strums is represented by patterns drawn from an expert-curated vocabulary. We show that it is possible to transcribe the rhythmic patterns of the guitar track in polyphonic music with quite high accuracy, producing a representation that is human-readable and includes automatically detected bar lines and time signature markers. We perform ablation studies and error analysis and propose a set of evaluation metrics to assess the accuracy and readability of the predicted rhythmic pattern sequence.
Related papers
- GuitarFlow: Realistic Electric Guitar Synthesis From Tablatures via Flow Matching and Style Transfer [7.72498447842112]
We introduce GuitarFlow, a model designed specifically for electric guitar synthesis.<n>The generative process is guided using tablatures, an ubiquitous and intuitive guitar-specific symbolic format.<n>We show significant improvement in the realism of the generated guitar audio from tablatures.
arXiv Detail & Related papers (2025-10-23T13:31:41Z) - A Machine Learning Approach for MIDI to Guitar Tablature Conversion [21.416973100105633]
This paper presents a method for assigning guitar tablature notation to a given MIDI-based musical part.<n>The strategy is based on machine learning and requires a basic assumption about how much fingers can stretch on a fretboard.<n>The proposed method also examines the transcription of music pieces that was not meant to be played or could not possibly be played by a guitar.
arXiv Detail & Related papers (2025-10-12T14:01:01Z) - Joint Transcription of Acoustic Guitar Strumming Directions and Chords [2.5398014196797614]
We extend a multimodal approach to guitar strumming transcription by introducing a novel dataset and a deep learning-based transcription model.<n>We collect 90 min of real-world guitar recordings using an ESP32 smartwatch motion sensor and a structured recording protocol.<n>A Convolutional Recurrent Neural Network (CRNN) model is trained to detect strumming events, classify their direction, and identify the corresponding chords using only microphone audio.
arXiv Detail & Related papers (2025-08-11T13:34:49Z) - TapToTab : Video-Based Guitar Tabs Generation using AI and Audio Analysis [0.0]
This paper introduces an advanced approach leveraging deep learning, specifically YOLO models for real-time fretboard detection.
Experimental results demonstrate substantial improvements in detection accuracy and robustness compared to traditional techniques.
This paper aims to revolutionize guitar instruction by automating the creation of guitar tabs from video recordings.
arXiv Detail & Related papers (2024-09-13T08:17:15Z) - MIDI-to-Tab: Guitar Tablature Inference via Masked Language Modeling [6.150307957212576]
We introduce a novel deep learning solution to symbolic guitar tablature estimation.
We train an encoder-decoder Transformer model in a masked language modeling paradigm to assign notes to strings.
The model is first pre-trained on DadaGP, a dataset of over 25K tablatures, and then fine-tuned on a curated set of professionally transcribed guitar performances.
arXiv Detail & Related papers (2024-08-09T12:25:23Z) - Modeling Bends in Popular Music Guitar Tablatures [49.64902130083662]
Tablature notation is widely used in popular music to transcribe and share guitar musical content.
This paper focuses on bends, which enable to progressively shift the pitch of a note, therefore circumventing physical limitations of the discrete fretted fingerboard.
Experiments are performed on a corpus of 932 lead guitar tablatures of popular music and show that a decision tree successfully predicts bend occurrences with an F1 score of 0.71 anda limited amount of false positive predictions.
arXiv Detail & Related papers (2023-08-22T07:50:58Z) - Unsupervised Melody-to-Lyric Generation [91.29447272400826]
We propose a method for generating high-quality lyrics without training on any aligned melody-lyric data.
We leverage the segmentation and rhythm alignment between melody and lyrics to compile the given melody into decoding constraints.
Our model can generate high-quality lyrics that are more on-topic, singable, intelligible, and coherent than strong baselines.
arXiv Detail & Related papers (2023-05-30T17:20:25Z) - Unsupervised Melody-Guided Lyrics Generation [84.22469652275714]
We propose to generate pleasantly listenable lyrics without training on melody-lyric aligned data.
We leverage the crucial alignments between melody and lyrics and compile the given melody into constraints to guide the generation process.
arXiv Detail & Related papers (2023-05-12T20:57:20Z) - Melody transcription via generative pre-training [86.08508957229348]
Key challenge in melody transcription is building methods which can handle broad audio containing any number of instrument ensembles and musical styles.
To confront this challenge, we leverage representations from Jukebox (Dhariwal et al. 2020), a generative model of broad music audio.
We derive a new dataset containing $50$ hours of melody transcriptions from crowdsourced annotations of broad music.
arXiv Detail & Related papers (2022-12-04T18:09:23Z) - Re-creation of Creations: A New Paradigm for Lyric-to-Melody Generation [158.54649047794794]
Re-creation of Creations (ROC) is a new paradigm for lyric-to-melody generation.
ROC achieves good lyric-melody feature alignment in lyric-to-melody generation.
arXiv Detail & Related papers (2022-08-11T08:44:47Z) - Unaligned Supervision For Automatic Music Transcription in The Wild [1.2183405753834562]
NoteEM is a method for simultaneously training a transcriber and aligning the scores to their corresponding performances.
We report SOTA note-level accuracy of the MAPS dataset, and large favorable margins on cross-dataset evaluations.
arXiv Detail & Related papers (2022-04-28T17:31:43Z) - A Data-Driven Methodology for Considering Feasibility and Pairwise
Likelihood in Deep Learning Based Guitar Tablature Transcription Systems [18.247508110198698]
In this work, symbolic tablature is leveraged to estimate the pairwise likelihood of notes on the guitar.
The output layer of a baseline tablature transcription model is reformulated, such that an inhibition loss can be incorporated to discourage the co-activation of unlikely note pairs.
This naturally enforces playability constraints for guitar, and yields tablature which is more consistent with the symbolic data used to estimate pairwise likelihoods.
arXiv Detail & Related papers (2022-04-17T22:10:37Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.