Understanding Human Perception of Music Plagiarism Through a Computational Approach
- URL: http://arxiv.org/abs/2601.02586v1
- Date: Mon, 05 Jan 2026 22:37:19 GMT
- Title: Understanding Human Perception of Music Plagiarism Through a Computational Approach
- Authors: Daeun Hwang, Hyeonbin Hwang,
- Abstract summary: We focus on the three commonly used musical features in similarity analysis: melody, rhythm, and chord progression.<n>We propose a LLM-as-a-judge framework that applies a systematic, step-by-step approach.
- Score: 4.404667592877916
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: There is a wide variety of music similarity detection algorithms, while discussions about music plagiarism in the real world are often based on audience perceptions. Therefore, we aim to conduct a study to examine the key criteria of human perception of music plagiarism, focusing on the three commonly used musical features in similarity analysis: melody, rhythm, and chord progression. After identifying the key features and levels of variation humans use in perceiving musical similarity, we propose a LLM-as-a-judge framework that applies a systematic, step-by-step approach, drawing on modules that extract such high-level attributes.
Related papers
- Music Flamingo: Scaling Music Understanding in Audio Language Models [98.94537017112704]
Music Flamingo is a novel large audio-language model designed to advance music understanding in foundational audio models.<n> MF-Skills is a dataset labeled through a multi-stage pipeline that yields rich captions and question-answer pairs covering harmony, structure, timbre, lyrics, and cultural context.<n>We introduce a post-training recipe: we first cold-start with MF-Think, a novel chain-of-thought dataset grounded in music theory, followed by GRPO-based reinforcement learning with custom rewards.
arXiv Detail & Related papers (2025-11-13T13:21:09Z) - Discovering "Words" in Music: Unsupervised Learning of Compositional Sparse Code for Symbolic Music [50.87225308217594]
This paper presents an unsupervised machine learning algorithm that identifies recurring patterns -- referred to as music-words'' -- from symbolic music data.<n>We formulate the task of music-word discovery as a statistical optimization problem and propose a two-stage Expectation-Maximization (EM)-based learning framework.
arXiv Detail & Related papers (2025-09-29T11:10:57Z) - Real-world Music Plagiarism Detection With Music Segment Transcription System [0.46412974300322135]
We propose a system for detecting music plagiarism by combining various MIR technologies.<n>We developed a music segment transcription system that extracts musically meaningful segments from audio recordings to detect plagiarism.<n>We also collected a Similar Music Pair dataset for musical similarity research using real-world cases.
arXiv Detail & Related papers (2025-09-10T04:55:48Z) - Musical composition and 2D cellular automata based on music intervals [0.0]
The aim of this study was to explore alternatives uses for a cellular automaton in the musical context.<n>We used the complex systems and humanities approaches as a framework for capturing the essence of creating music.
arXiv Detail & Related papers (2024-11-29T17:03:41Z) - Attention-guided Spectrogram Sequence Modeling with CNNs for Music Genre Classification [0.0]
We present an innovative model for classifying music genres using attention-based temporal signature modeling.
Our approach captures the most temporally significant moments within each piece, crafting a unique "signature" for genre identification.
This work bridges the gap between technical classification tasks and the nuanced, human experience of genre.
arXiv Detail & Related papers (2024-11-18T21:57:03Z) - Generating High-quality Symbolic Music Using Fine-grained Discriminators [42.200747558496055]
We propose to decouple the melody and rhythm from music, and design corresponding fine-grained discriminators to tackle the issues.
Specifically, equipped with a pitch augmentation strategy, the melody discriminator discerns the melody variations presented by the generated samples.
The rhythm discriminator, enhanced with bar-level relative positional encoding, focuses on the velocity of generated notes.
arXiv Detail & Related papers (2024-08-03T07:32:21Z) - Fairness Through Domain Awareness: Mitigating Popularity Bias For Music
Discovery [56.77435520571752]
We explore the intrinsic relationship between music discovery and popularity bias.
We propose a domain-aware, individual fairness-based approach which addresses popularity bias in graph neural network (GNNs) based recommender systems.
Our approach uses individual fairness to reflect a ground truth listening experience, i.e., if two songs sound similar, this similarity should be reflected in their representations.
arXiv Detail & Related papers (2023-08-28T14:12:25Z) - Knowledge-based Multimodal Music Similarity [0.0]
This research focuses on the study of musical similarity using both symbolic and audio content.
The aim of this research is to develop a fully explainable and interpretable system that can provide end-users with more control and understanding of music similarity and classification systems.
arXiv Detail & Related papers (2023-06-21T13:12:12Z) - Contrastive Learning with Positive-Negative Frame Mask for Music
Representation [91.44187939465948]
This paper proposes a novel Positive-nEgative frame mask for Music Representation based on the contrastive learning framework, abbreviated as PEMR.
We devise a novel contrastive learning objective to accommodate both self-augmented positives/negatives sampled from the same music.
arXiv Detail & Related papers (2022-03-17T07:11:42Z) - Music Gesture for Visual Sound Separation [121.36275456396075]
"Music Gesture" is a keypoint-based structured representation to explicitly model the body and finger movements of musicians when they perform music.
We first adopt a context-aware graph network to integrate visual semantic context with body dynamics, and then apply an audio-visual fusion model to associate body movements with the corresponding audio signals.
arXiv Detail & Related papers (2020-04-20T17:53:46Z) - Multi-Modal Music Information Retrieval: Augmenting Audio-Analysis with
Visual Computing for Improved Music Video Analysis [91.3755431537592]
This thesis combines audio-analysis with computer vision to approach Music Information Retrieval (MIR) tasks from a multi-modal perspective.
The main hypothesis of this work is based on the observation that certain expressive categories such as genre or theme can be recognized on the basis of the visual content alone.
The experiments are conducted for three MIR tasks Artist Identification, Music Genre Classification and Cross-Genre Classification.
arXiv Detail & Related papers (2020-02-01T17:57:14Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.