Source Separation with Deep Generative Priors
- URL: http://arxiv.org/abs/2002.07942v2
- Date: Mon, 21 Sep 2020 17:19:09 GMT
- Title: Source Separation with Deep Generative Priors
- Authors: Vivek Jayaram, John Thickstun
- Abstract summary: We use generative models as priors over the components of a mixture of sources, and noise-annealed Langevin dynamics to sample from the posterior distribution of sources given a mixture.
This decouples the source separation problem from generative modeling, enabling us to directly use cutting-edge generative models as priors.
The method achieves state-of-the-art performance for MNIST digit separation.
- Score: 17.665938343060112
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Despite substantial progress in signal source separation, results for richly
structured data continue to contain perceptible artifacts. In contrast, recent
deep generative models can produce authentic samples in a variety of domains
that are indistinguishable from samples of the data distribution. This paper
introduces a Bayesian approach to source separation that uses generative models
as priors over the components of a mixture of sources, and noise-annealed
Langevin dynamics to sample from the posterior distribution of sources given a
mixture. This decouples the source separation problem from generative modeling,
enabling us to directly use cutting-edge generative models as priors. The
method achieves state-of-the-art performance for MNIST digit separation. We
introduce new methodology for evaluating separation quality on richer datasets,
providing quantitative evaluation of separation results on CIFAR-10. We also
provide qualitative results on LSUN.
Related papers
- Score-based generative diffusion with "active" correlated noise sources [38.32962448223814]
Diffusion models exhibit robust generative properties by approximating the underlying distribution of a dataset.
In this work, we explore how the generative performance may be modulated if noise sources with temporal correlations are used for the destruction of the data.
arXiv Detail & Related papers (2024-11-11T18:51:08Z) - DetDiffusion: Synergizing Generative and Perceptive Models for Enhanced Data Generation and Perception [78.26734070960886]
Current perceptive models heavily depend on resource-intensive datasets.
We introduce perception-aware loss (P.A. loss) through segmentation, improving both quality and controllability.
Our method customizes data augmentation by extracting and utilizing perception-aware attribute (P.A. Attr) during generation.
arXiv Detail & Related papers (2024-03-20T04:58:03Z) - Learning Energy-Based Models by Cooperative Diffusion Recovery Likelihood [64.95663299945171]
Training energy-based models (EBMs) on high-dimensional data can be both challenging and time-consuming.
There exists a noticeable gap in sample quality between EBMs and other generative frameworks like GANs and diffusion models.
We propose cooperative diffusion recovery likelihood (CDRL), an effective approach to tractably learn and sample from a series of EBMs.
arXiv Detail & Related papers (2023-09-10T22:05:24Z) - Improving Out-of-Distribution Robustness of Classifiers via Generative
Interpolation [56.620403243640396]
Deep neural networks achieve superior performance for learning from independent and identically distributed (i.i.d.) data.
However, their performance deteriorates significantly when handling out-of-distribution (OoD) data.
We develop a simple yet effective method called Generative Interpolation to fuse generative models trained from multiple domains for synthesizing diverse OoD samples.
arXiv Detail & Related papers (2023-07-23T03:53:53Z) - Score-based Source Separation with Applications to Digital Communication
Signals [72.6570125649502]
We propose a new method for separating superimposed sources using diffusion-based generative models.
Motivated by applications in radio-frequency (RF) systems, we are interested in sources with underlying discrete nature.
Our method can be viewed as a multi-source extension to the recently proposed score distillation sampling scheme.
arXiv Detail & Related papers (2023-06-26T04:12:40Z) - Revisiting the Evaluation of Image Synthesis with GANs [55.72247435112475]
This study presents an empirical investigation into the evaluation of synthesis performance, with generative adversarial networks (GANs) as a representative of generative models.
In particular, we make in-depth analyses of various factors, including how to represent a data point in the representation space, how to calculate a fair distance using selected samples, and how many instances to use from each set.
arXiv Detail & Related papers (2023-04-04T17:54:32Z) - Distribution Preserving Source Separation With Time Frequency Predictive
Models [2.4201849657206496]
We provide an example of a distribution preserving source separation method, which aims at addressing perceptual shortcomings of state-of-the-art methods.
The separated signals follow their respective source distributions, which provides an advantage when separation results are evaluated in a listening test.
arXiv Detail & Related papers (2023-03-10T13:05:30Z) - Multi-Source Diffusion Models for Simultaneous Music Generation and Separation [17.124189082882395]
We train our model on Slakh2100, a standard dataset for musical source separation.
Our method is the first example of a single model that can handle both generation and separation tasks.
arXiv Detail & Related papers (2023-02-04T23:18:36Z) - Diffusion-based Generative Speech Source Separation [27.928990101986862]
We propose DiffSep, a new single channel source separation method based on score-matching of a differential equation (SDE)
Experiments on the WSJ0 2mix dataset demonstrate the potential of the method.
The method is also suitable for speech enhancement and shows performance competitive with prior work on the VoiceBank-DEMAND dataset.
arXiv Detail & Related papers (2022-10-31T13:46:55Z) - Improving the Reconstruction of Disentangled Representation Learners via Multi-Stage Modeling [54.94763543386523]
Current autoencoder-based disentangled representation learning methods achieve disentanglement by penalizing the ( aggregate) posterior to encourage statistical independence of the latent factors.
We present a novel multi-stage modeling approach where the disentangled factors are first learned using a penalty-based disentangled representation learning method.
Then, the low-quality reconstruction is improved with another deep generative model that is trained to model the missing correlated latent variables.
arXiv Detail & Related papers (2020-10-25T18:51:15Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.