An approach to melodic segmentation and classification based on filtering with the Haar-wavelet
- URL: http://arxiv.org/abs/2504.20822v1
- Date: Tue, 29 Apr 2025 14:41:03 GMT
- Title: An approach to melodic segmentation and classification based on filtering with the Haar-wavelet
- Authors: Gissel Velarde, Tillman Weyde, David Meredith,
- Abstract summary: We present a novel method of classification and segmentation of melodies in symbolic representation.<n>The method is based on filtering pitch as a signal over time with the Haar-wavelet.<n>When used to classify 360 Dutch folk tunes into 26 tune families, the performance of the method is comparable to the use of pitch signals.
- Score: 2.4774640776820105
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: We present a novel method of classification and segmentation of melodies in symbolic representation. The method is based on filtering pitch as a signal over time with the Haar-wavelet, and we evaluate it on two tasks. The filtered signal corresponds to a single-scale signal ws from the continuous Haar wavelet transform. The melodies are first segmented using local maxima or zero-crossings of w_s. The segments of w_s are then classified using the k-nearest neighbour algorithm with Euclidian and city-block distances. The method proves more effective than using unfiltered pitch signals and Gestalt-based segmentation when used to recognize the parent works of segments from Bach's Two-Part Inventions (BWV 772-786). When used to classify 360 Dutch folk tunes into 26 tune families, the performance of the method is comparable to the use of pitch signals, but not as good as that of string-matching methods based on multiple features.
Related papers
- Wavelet-Filtering of Symbolic Music Representations for Folk Tune Segmentation and Classification [2.4774640776820105]
The aim of this study is to evaluate a machine-learning method in which symbolic representations of folk songs are segmented and classified into tune families with Haar-wavelet filtering.<n>We apply the continuous wavelet transform (CWT) with the Haar wavelet at specific scales, obtaining filtered versions of melodies emphasizing their information at particular time-scales.<n>We found that the wavelet based segmentation and wavelet-filtering of the pitch signal lead to better classification accuracy in cross-validated evaluation when the time-scale and other parameters are optimized.
arXiv Detail & Related papers (2025-04-29T08:02:37Z) - An Explainable Proxy Model for Multiabel Audio Segmentation [1.7611027732647493]
We propose an explainable multilabel segmentation model that solves speech activity (SAD), music (MD), noise (ND) and overlapped speech detection (OSD) simultaneously.
Experiments conducted on two datasets show similar performances as the pre-trained black box model while showing strong explainability features.
arXiv Detail & Related papers (2024-01-16T10:41:33Z) - SegRefiner: Towards Model-Agnostic Segmentation Refinement with Discrete
Diffusion Process [102.18226145874007]
We propose a model-agnostic solution called SegRefiner to enhance the quality of object masks produced by different segmentation models.
SegRefiner takes coarse masks as inputs and refines them using a discrete diffusion process.
It consistently improves both the segmentation metrics and boundary metrics across different types of coarse masks.
arXiv Detail & Related papers (2023-12-19T18:53:47Z) - CorrMatch: Label Propagation via Correlation Matching for
Semi-Supervised Semantic Segmentation [73.89509052503222]
This paper presents a simple but performant semi-supervised semantic segmentation approach, called CorrMatch.
We observe that the correlation maps not only enable clustering pixels of the same category easily but also contain good shape information.
We propose to conduct pixel propagation by modeling the pairwise similarities of pixels to spread the high-confidence pixels and dig out more.
Then, we perform region propagation to enhance the pseudo labels with accurate class-agnostic masks extracted from the correlation maps.
arXiv Detail & Related papers (2023-06-07T10:02:29Z) - Speech segmentation using multilevel hybrid filters [0.0]
A novel approach for speech segmentation is proposed, based on Multilevel Hybrid (mean/min) Filters (MHF)
The proposed method is based on spectral changes, with the goal of segmenting the voice into homogeneous acoustic segments.
This algorithm is being used for phoneticallysegmented speech coder, with successful results.
arXiv Detail & Related papers (2022-02-24T00:03:02Z) - A Novel Falling-Ball Algorithm for Image Segmentation [0.14337588659482517]
Region-based Falling-Ball algorithm is presented, which is a region-based segmentation algorithm.
The proposed algorithm detects the catchment basins by assuming that a ball falling from hilly terrains will stop in a catchment basin.
arXiv Detail & Related papers (2021-05-06T12:41:10Z) - PointFlow: Flowing Semantics Through Points for Aerial Image
Segmentation [96.76882806139251]
We propose a point-wise affinity propagation module based on the Feature Pyramid Network (FPN) framework, named PointFlow.
Rather than dense affinity learning, a sparse affinity map is generated upon selected points between the adjacent features.
Experimental results on three different aerial segmentation datasets suggest that the proposed method is more effective and efficient than state-of-the-art general semantic segmentation methods.
arXiv Detail & Related papers (2021-03-11T09:42:32Z) - Simultaneous Grouping and Denoising via Sparse Convex Wavelet Clustering [3.2116198597240846]
We develop a sparse convex wavelet clustering approach that simultaneously denoises and discovers groups.
Our method yields denoised (wavelet-sparse) cluster centroids that both improve interpretability and data compression.
arXiv Detail & Related papers (2020-12-08T22:00:38Z) - Learning Noise-Aware Encoder-Decoder from Noisy Labels by Alternating
Back-Propagation for Saliency Detection [54.98042023365694]
We propose a noise-aware encoder-decoder framework to disentangle a clean saliency predictor from noisy training examples.
The proposed model consists of two sub-models parameterized by neural networks.
arXiv Detail & Related papers (2020-07-23T18:47:36Z) - Differentiable Hierarchical Graph Grouping for Multi-Person Pose
Estimation [95.72606536493548]
Multi-person pose estimation is challenging because it localizes body keypoints for multiple persons simultaneously.
We propose a novel differentiable Hierarchical Graph Grouping (HGG) method to learn the graph grouping in bottom-up multi-person pose estimation task.
arXiv Detail & Related papers (2020-07-23T08:46:22Z) - Deep Affinity Net: Instance Segmentation via Affinity [48.498706304017674]
Deep Affinity Net is an effective affinity-based approach accompanied with a new graph partitioning algorithm Cascade-GAEC.
It achieves the best single-shot result as well as the fastest running time among all affinity-based models.
It also outperforms the region-based method Mask R-CNN.
arXiv Detail & Related papers (2020-03-15T15:22:56Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.