On the Frequency Bias of Generative Models
- URL: http://arxiv.org/abs/2111.02447v1
- Date: Wed, 3 Nov 2021 18:12:11 GMT
- Title: On the Frequency Bias of Generative Models
- Authors: Katja Schwarz and Yiyi Liao and Andreas Geiger
- Abstract summary: We analyze proposed measures against high-frequency artifacts in state-of-the-art GAN training.
We find that none of the existing approaches can fully resolve spectral artifacts yet.
Our results suggest that there is great potential in improving the discriminator.
- Score: 61.60834513380388
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The key objective of Generative Adversarial Networks (GANs) is to generate
new data with the same statistics as the provided training data. However,
multiple recent works show that state-of-the-art architectures yet struggle to
achieve this goal. In particular, they report an elevated amount of high
frequencies in the spectral statistics which makes it straightforward to
distinguish real and generated images. Explanations for this phenomenon are
controversial: While most works attribute the artifacts to the generator, other
works point to the discriminator. We take a sober look at those explanations
and provide insights on what makes proposed measures against high-frequency
artifacts effective. To achieve this, we first independently assess the
architectures of both the generator and discriminator and investigate if they
exhibit a frequency bias that makes learning the distribution of high-frequency
content particularly problematic. Based on these experiments, we make the
following four observations: 1) Different upsampling operations bias the
generator towards different spectral properties. 2) Checkerboard artifacts
introduced by upsampling cannot explain the spectral discrepancies alone as the
generator is able to compensate for these artifacts. 3) The discriminator does
not struggle with detecting high frequencies per se but rather struggles with
frequencies of low magnitude. 4) The downsampling operations in the
discriminator can impair the quality of the training signal it provides. In
light of these findings, we analyze proposed measures against high-frequency
artifacts in state-of-the-art GAN training but find that none of the existing
approaches can fully resolve spectral artifacts yet. Our results suggest that
there is great potential in improving the discriminator and that this could be
key to match the distribution of the training data more closely.
Related papers
- Frequency-Aware Deepfake Detection: Improving Generalizability through
Frequency Space Learning [81.98675881423131]
This research addresses the challenge of developing a universal deepfake detector that can effectively identify unseen deepfake images.
Existing frequency-based paradigms have relied on frequency-level artifacts introduced during the up-sampling in GAN pipelines to detect forgeries.
We introduce a novel frequency-aware approach called FreqNet, centered around frequency domain learning, specifically designed to enhance the generalizability of deepfake detectors.
arXiv Detail & Related papers (2024-03-12T01:28:00Z) - Hodge-Aware Contrastive Learning [101.56637264703058]
Simplicial complexes prove effective in modeling data with multiway dependencies.
We develop a contrastive self-supervised learning approach for processing simplicial data.
arXiv Detail & Related papers (2023-09-14T00:40:07Z) - Investigating and Explaining the Frequency Bias in Image Classification [11.078920943157845]
CNNs exhibit many behaviors different from humans, one of which is the capability of employing high-frequency components.
This paper discusses the frequency bias phenomenon in image classification tasks.
arXiv Detail & Related papers (2022-05-06T11:45:43Z) - Simpler is better: spectral regularization and up-sampling techniques
for variational autoencoders [1.2234742322758418]
characterization of the spectral behavior of generative models based on neural networks remains an open issue.
Recent research has focused heavily on generative adversarial networks and the high-frequency discrepancies between real and generated images.
We propose a simple 2D Fourier transform-based spectral regularization loss for the Variational Autoencoders (VAEs)
arXiv Detail & Related papers (2022-01-19T11:49:57Z) - Unsupervised Learning Architecture for Classifying the Transient Noise
of Interferometric Gravitational-wave Detectors [2.8555963243398073]
transient noise with non-stationary and non-Gaussian features occurs at a high rate.
Classification of transient noise can offer clues for exploring its origin and improving the performance of the detector.
In this study, we propose an unsupervised learning architecture for the classification of transient noise.
arXiv Detail & Related papers (2021-11-19T05:37:06Z) - A Frequency Perspective of Adversarial Robustness [72.48178241090149]
We present a frequency-based understanding of adversarial examples, supported by theoretical and empirical findings.
Our analysis shows that adversarial examples are neither in high-frequency nor in low-frequency components, but are simply dataset dependent.
We propose a frequency-based explanation for the commonly observed accuracy vs. robustness trade-off.
arXiv Detail & Related papers (2021-10-26T19:12:34Z) - Generalizing Face Forgery Detection with High-frequency Features [63.33397573649408]
Current CNN-based detectors tend to overfit to method-specific color textures and thus fail to generalize.
We propose to utilize the high-frequency noises for face forgery detection.
The first is the multi-scale high-frequency feature extraction module that extracts high-frequency noises at multiple scales.
The second is the residual-guided spatial attention module that guides the low-level RGB feature extractor to concentrate more on forgery traces from a new perspective.
arXiv Detail & Related papers (2021-03-23T08:19:21Z) - Spectral Distribution Aware Image Generation [11.295032417617456]
Deep generative models for photo-realistic images can not be easily distinguished from real images by the human eye.
We propose to generate images according to the frequency distribution of the real data by employing a spectral discriminator.
We show that the resulting models can better generate images with realistic frequency spectra, which are thus harder to detect by this cue.
arXiv Detail & Related papers (2020-12-05T19:46:48Z) - WaveTransform: Crafting Adversarial Examples via Input Decomposition [69.01794414018603]
We introduce WaveTransform', that creates adversarial noise corresponding to low-frequency and high-frequency subbands, separately (or in combination)
Experiments show that the proposed attack is effective against the defense algorithm and is also transferable across CNNs.
arXiv Detail & Related papers (2020-10-29T17:16:59Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.