Generalized Rectifier Wavelet Covariance Models For Texture Synthesis
- URL: http://arxiv.org/abs/2203.07902v1
- Date: Mon, 14 Mar 2022 17:07:40 GMT
- Title: Generalized Rectifier Wavelet Covariance Models For Texture Synthesis
- Authors: Antoine Brochard, Sixin Zhang, St\'ephane Mallat
- Abstract summary: State-of-the-art maximum entropy models for texture synthesis are built from statistics relying on image representations defined by convolutional neural networks (CNN)
We propose a family of statistics built upon non-linear wavelet based representations, that can be viewed as a particular instance of a one-layer CNN, using a generalized non-linearity.
These statistics significantly improve the visual quality of previous classical wavelet-based models, and allow one to produce syntheses of similar quality to state-of-the-art models, on both grayscale and color textures.
- Score: 2.585403833659771
- License: http://creativecommons.org/licenses/by-sa/4.0/
- Abstract: State-of-the-art maximum entropy models for texture synthesis are built from
statistics relying on image representations defined by convolutional neural
networks (CNN). Such representations capture rich structures in texture images,
outperforming wavelet-based representations in this regard. However, conversely
to neural networks, wavelets offer meaningful representations, as they are
known to detect structures at multiple scales (e.g. edges) in images. In this
work, we propose a family of statistics built upon non-linear wavelet based
representations, that can be viewed as a particular instance of a one-layer
CNN, using a generalized rectifier non-linearity. These statistics
significantly improve the visual quality of previous classical wavelet-based
models, and allow one to produce syntheses of similar quality to
state-of-the-art models, on both gray-scale and color textures.
Related papers
- Bayesian Inverse Graphics for Few-Shot Concept Learning [3.475273727432576]
We present a Bayesian model of perception that learns using only minimal data.
We show how this representation can be used for downstream tasks such as few-shot classification and estimation.
arXiv Detail & Related papers (2024-09-12T18:30:41Z) - Grounding and Enhancing Grid-based Models for Neural Fields [52.608051828300106]
This paper introduces a theoretical framework for grid-based models.
The framework points out that these models' approximation and generalization behaviors are determined by grid tangent kernels (GTK)
The introduced framework motivates the development of a novel grid-based model named the Multiplicative Fourier Adaptive Grid (MulFAGrid)
arXiv Detail & Related papers (2024-03-29T06:33:13Z) - Distance Weighted Trans Network for Image Completion [52.318730994423106]
We propose a new architecture that relies on Distance-based Weighted Transformer (DWT) to better understand the relationships between an image's components.
CNNs are used to augment the local texture information of coarse priors.
DWT blocks are used to recover certain coarse textures and coherent visual structures.
arXiv Detail & Related papers (2023-10-11T12:46:11Z) - Capturing dynamical correlations using implicit neural representations [85.66456606776552]
We develop an artificial intelligence framework which combines a neural network trained to mimic simulated data from a model Hamiltonian with automatic differentiation to recover unknown parameters from experimental data.
In doing so, we illustrate the ability to build and train a differentiable model only once, which then can be applied in real-time to multi-dimensional scattering data.
arXiv Detail & Related papers (2023-04-08T07:55:36Z) - Learning multi-scale local conditional probability models of images [7.07848787073901]
Deep neural networks can learn powerful prior probability models for images, as evidenced by the high-quality generations obtained with recent score-based diffusion methods.
But the means by which these networks capture complex global statistical structure, apparently without suffering from the curse of dimensionality, remain a mystery.
We incorporate diffusion methods into a multi-scale decomposition, reducing dimensionality by assuming a stationary local Markov model for wavelet coefficients conditioned on coarser-scale coefficients.
arXiv Detail & Related papers (2023-03-06T09:23:14Z) - DeepDC: Deep Distance Correlation as a Perceptual Image Quality
Evaluator [53.57431705309919]
ImageNet pre-trained deep neural networks (DNNs) show notable transferability for building effective image quality assessment (IQA) models.
We develop a novel full-reference IQA (FR-IQA) model based exclusively on pre-trained DNN features.
We conduct comprehensive experiments to demonstrate the superiority of the proposed quality model on five standard IQA datasets.
arXiv Detail & Related papers (2022-11-09T14:57:27Z) - Neural Implicit Representations for Physical Parameter Inference from a Single Video [49.766574469284485]
We propose to combine neural implicit representations for appearance modeling with neural ordinary differential equations (ODEs) for modelling physical phenomena.
Our proposed model combines several unique advantages: (i) Contrary to existing approaches that require large training datasets, we are able to identify physical parameters from only a single video.
The use of neural implicit representations enables the processing of high-resolution videos and the synthesis of photo-realistic images.
arXiv Detail & Related papers (2022-04-29T11:55:35Z) - An Efficient Pattern Mining Convolution Neural Network (CNN) algorithm
with Grey Wolf Optimization (GWO) [0.0]
This paper proposed a novel model of feature analysis method with the CNN based on Convoluted Pattern of Wavelet Transform (CPWT) feature vectors.
The performance of this proposed method can be validated by comparing with traditional state-of-art methods.
arXiv Detail & Related papers (2022-04-10T15:18:42Z) - Texture synthesis via projection onto multiscale, multilayer statistics [0.0]
We present a new model for texture synthesis based on a multiscale, multilayer feature extractor.
We explain the necessity of the different types of pre-defined wavelet filters used in our model and the advantages of multilayer structures for image synthesis.
arXiv Detail & Related papers (2021-05-22T23:32:34Z) - Probabilistic Graph Attention Network with Conditional Kernels for
Pixel-Wise Prediction [158.88345945211185]
We present a novel approach that advances the state of the art on pixel-level prediction in a fundamental aspect, i.e. structured multi-scale features learning and fusion.
We propose a probabilistic graph attention network structure based on a novel Attention-Gated Conditional Random Fields (AG-CRFs) model for learning and fusing multi-scale representations in a principled manner.
arXiv Detail & Related papers (2021-01-08T04:14:29Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.