Related papers: Review of wavelet-based unsupervised texture segmentation, advantage of adaptive wavelets

Related papers

Wavelet-Filtering of Symbolic Music Representations for Folk Tune Segmentation and Classification [2.4774640776820105]
The aim of this study is to evaluate a machine-learning method in which symbolic representations of folk songs are segmented and classified into tune families with Haar-wavelet filtering. We apply the continuous wavelet transform (CWT) with the Haar wavelet at specific scales, obtaining filtered versions of melodies emphasizing their information at particular time-scales. We found that the wavelet based segmentation and wavelet-filtering of the pitch signal lead to better classification accuracy in cross-validated evaluation when the time-scale and other parameters are optimized.
arXiv Detail & Related papers (2025-04-29T08:02:37Z)
BELE: Blur Equivalent Linearized Estimator [0.8192907805418581]
This paper introduces a novel parametric model that separates perceptual effects due to strong edge degradations from those caused by texture distortions. The first is the Blur Equivalent Linearized Estimator, designed to measure blur on strong and isolated edges. The second is a Complex Peak Signal-to-Noise Ratio, which evaluates distortions affecting texture regions.
arXiv Detail & Related papers (2025-03-01T14:19:08Z)
WTDUN: Wavelet Tree-Structured Sampling and Deep Unfolding Network for Image Compressed Sensing [51.94493817128006]
We propose a novel wavelet-domain deep unfolding framework named WTDUN, which operates directly on the multi-scale wavelet subbands. Our method utilizes the intrinsic sparsity and multi-scale structure of wavelet coefficients to achieve a tree-structured sampling and reconstruction.
arXiv Detail & Related papers (2024-11-25T12:31:03Z)
Empirical curvelet based Fully Convolutional Network for supervised texture image segmentation [2.780132626494265]
We propose a new approach to perform supervised texture classification/segmentation. The proposed idea is to feed a Fully Convolutional Network with specific texture descriptors. Our approach is evaluated on several datasets and compare the results to various state-of-the-art algorithms.
arXiv Detail & Related papers (2024-10-28T21:49:40Z)
Generating Non-Stationary Textures using Self-Rectification [70.91414475376698]
This paper addresses the challenge of example-based non-stationary texture synthesis. We introduce a novel twostep approach wherein users first modify a reference texture using standard image editing tools. Our proposed method, termed "self-rectification", automatically refines this target into a coherent, seamless texture.
arXiv Detail & Related papers (2024-01-05T15:07:05Z)
Retinex-guided Channel-grouping based Patch Swap for Arbitrary Style Transfer [54.25418866649519]
The basic principle of the patch-matching based style transfer is to substitute the patches of the content image feature maps by the closest patches from the style image feature maps. Existing techniques treat the full-channel style feature patches as simple signal tensors and create new style feature patches via signal-level fusion. We propose a Retinex theory guided, channel-grouping based patch swap technique to solve the above challenges.
arXiv Detail & Related papers (2023-09-19T11:13:56Z)
Single Image Depth Estimation using Wavelet Decomposition [37.486778463181]
We present a novel method for predicting accurate depths from monocular images with high efficiency. This optimal efficiency is achieved by exploiting wavelet decomposition. We demonstrate that we can reconstruct high-fidelity depth maps by predicting sparse wavelet coefficients.
arXiv Detail & Related papers (2021-06-03T17:42:25Z)
Multi-modal Conditional Bounding Box Regression for Music Score Following [7.360807642941713]
This paper addresses the problem of sheet-image-based on-line audio-to-score alignment also known as score following. A conditional neural network architecture is proposed that directly predicts x,y coordinates of the matching positions in a complete score sheet image at each point in time for a given musical performance.
arXiv Detail & Related papers (2021-05-10T12:43:35Z)
Panoster: End-to-end Panoptic Segmentation of LiDAR Point Clouds [81.12016263972298]
We present Panoster, a novel proposal-free panoptic segmentation method for LiDAR point clouds. Unlike previous approaches, Panoster proposes a simplified framework incorporating a learning-based clustering solution to identify instances. At inference time, this acts as a class-agnostic segmentation, allowing Panoster to be fast, while outperforming prior methods in terms of accuracy.
arXiv Detail & Related papers (2020-10-28T18:10:20Z)
Contour Integration using Graph-Cut and Non-Classical Receptive Field [4.935491924643742]
We propose a novel method to detect image contours from the extracted edge segments of other algorithms. The proposed energy functions are inspired by the surround modulation in the primary visual cortex that help suppressing texture noise.
arXiv Detail & Related papers (2020-10-27T19:07:13Z)
WaveGrad: Estimating Gradients for Waveform Generation [55.405580817560754]
WaveGrad is a conditional model for waveform generation which estimates gradients of the data density. It starts from a Gaussian white noise signal and iteratively refines the signal via a gradient-based sampler conditioned on the mel-spectrogram. We find that it can generate high fidelity audio samples using as few as six iterations.
arXiv Detail & Related papers (2020-09-02T17:44:10Z)
Real Time Speech Enhancement in the Waveform Domain [99.02180506016721]
We present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is capable of removing various kinds of background noise including stationary and non-stationary noises.
arXiv Detail & Related papers (2020-06-23T09:19:13Z)
Decoding Imagined Speech using Wavelet Features and Deep Neural Networks [2.4063592468412267]
This paper proposes a novel approach that uses deep neural networks for classifying imagined speech. The proposed approach employs only the EEG channels over specific areas of the brain for classification, and derives distinct feature vectors from each of those channels. The proposed architecture and the approach of treating the data have resulted in an average classification accuracy of 57.15%, which is an improvement of around 35% over the state-of-the-art results.
arXiv Detail & Related papers (2020-03-19T00:36:19Z)

This list is automatically generated from the titles and abstracts of the papers in this site.