Style Spectroscope: Improve Interpretability and Controllability through
Fourier Analysis
- URL: http://arxiv.org/abs/2208.06140v1
- Date: Fri, 12 Aug 2022 07:15:33 GMT
- Title: Style Spectroscope: Improve Interpretability and Controllability through
Fourier Analysis
- Authors: Zhiyu Jin and Xuli Shen and Bin Li and Xiangyang Xue
- Abstract summary: Universal style transfer (UST) infuses styles from arbitrary reference images into content images.
Existing methods are unable to explain experimental observations.
We present an equivalent form of the framework in the frequency domain.
- Score: 42.59845771101823
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Universal style transfer (UST) infuses styles from arbitrary reference images
into content images. Existing methods, while enjoying many practical successes,
are unable of explaining experimental observations, including different
performances of UST algorithms in preserving the spatial structure of content
images. In addition, methods are limited to cumbersome global controls on
stylization, so that they require additional spatial masks for desired
stylization. In this work, we provide a systematic Fourier analysis on a
general framework for UST. We present an equivalent form of the framework in
the frequency domain. The form implies that existing algorithms treat all
frequency components and pixels of feature maps equally, except for the
zero-frequency component. We connect Fourier amplitude and phase with Gram
matrices and a content reconstruction loss in style transfer, respectively.
Based on such equivalence and connections, we can thus interpret different
structure preservation behaviors between algorithms with Fourier phase. Given
the interpretations we have, we propose two manipulations in practice for
structure preservation and desired stylization. Both qualitative and
quantitative experiments demonstrate the competitive performance of our method
against the state-of-the-art methods. We also conduct experiments to
demonstrate (1) the abovementioned equivalence, (2) the interpretability based
on Fourier amplitude and phase and (3) the controllability associated with
frequency components.
Related papers
- The Empirical Watershed Wavelet [0.0]
In this paper, we provide theoretical results that permits us to build 2D empirical wavelet filters based on an arbitrary partitioning of the frequency domain.
We also propose an algorithm to detect such partitioning from an image spectrum by combining a scale-space representation to estimate the position of dominant harmonic modes and a watershed transform.
arXiv Detail & Related papers (2024-10-24T22:36:43Z) - FourierMamba: Fourier Learning Integration with State Space Models for Image Deraining [71.46369218331215]
Image deraining aims to remove rain streaks from rainy images and restore clear backgrounds.
We propose a new framework termed FourierMamba, which performs image deraining with Mamba in the Fourier space.
arXiv Detail & Related papers (2024-05-29T18:58:59Z) - HoloNets: Spectral Convolutions do extend to Directed Graphs [59.851175771106625]
Conventional wisdom dictates that spectral convolutional networks may only be deployed on undirected graphs.
Here we show this traditional reliance on the graph Fourier transform to be superfluous.
We provide a frequency-response interpretation of newly developed filters, investigate the influence of the basis used to express filters and discuss the interplay with characteristic operators on which networks are based.
arXiv Detail & Related papers (2023-10-03T17:42:09Z) - A statistically constrained internal method for single image
super-resolution [1.2891210250935146]
We show how a priori information can be integrated into an internal super-resolution approach, by constraining the learned up-sampling procedure of SinGAN.
We consider various types of constraints, related to the Fourier power spectrum, the color histograms and the consistency of the upsampling scheme.
arXiv Detail & Related papers (2023-02-03T10:48:31Z) - Deep Fourier Up-Sampling [100.59885545206744]
Up-sampling in the Fourier domain is more challenging as it does not follow such a local property.
We propose a theoretically sound Deep Fourier Up-Sampling (FourierUp) to solve these issues.
arXiv Detail & Related papers (2022-10-11T06:17:31Z) - Single Stage Virtual Try-on via Deformable Attention Flows [51.70606454288168]
Virtual try-on aims to generate a photo-realistic fitting result given an in-shop garment and a reference person image.
We develop a novel Deformable Attention Flow (DAFlow) which applies the deformable attention scheme to multi-flow estimation.
Our proposed method achieves state-of-the-art performance both qualitatively and quantitatively.
arXiv Detail & Related papers (2022-07-19T10:01:31Z) - Seeing Implicit Neural Representations as Fourier Series [13.216389226310987]
Implicit Neural Representations (INR) use multilayer perceptrons to represent high-frequency functions in low-dimensional problem domains.
These representations achieved state-of-the-art results on tasks related to complex 3D objects and scenes.
This work analyzes the connection between the two methods and shows that a Fourier mapped perceptron is structurally like one hidden layer SIREN.
arXiv Detail & Related papers (2021-09-01T08:40:20Z) - A Fourier-based Framework for Domain Generalization [82.54650565298418]
Domain generalization aims at tackling this problem by learning transferable knowledge from multiple source domains in order to generalize to unseen target domains.
This paper introduces a novel Fourier-based perspective for domain generalization.
Experiments on three benchmarks have demonstrated that the proposed method is able to achieve state-of-the-arts performance for domain generalization.
arXiv Detail & Related papers (2021-05-24T06:50:30Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.