Adversarial Latent Autoencoders
- URL: http://arxiv.org/abs/2004.04467v1
- Date: Thu, 9 Apr 2020 10:33:44 GMT
- Title: Adversarial Latent Autoencoders
- Authors: Stanislav Pidhorskyi, Donald Adjeroh, Gianfranco Doretto
- Abstract summary: We introduce an autoencoder that tackles issues jointly, which we call Adversarial Latent Autoencoder (ALAE)
ALAE is the first autoencoder able to compare with, and go beyond the capabilities of a generator-only type of architecture.
- Score: 7.928094304325116
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Autoencoder networks are unsupervised approaches aiming at combining
generative and representational properties by learning simultaneously an
encoder-generator map. Although studied extensively, the issues of whether they
have the same generative power of GANs, or learn disentangled representations,
have not been fully addressed. We introduce an autoencoder that tackles these
issues jointly, which we call Adversarial Latent Autoencoder (ALAE). It is a
general architecture that can leverage recent improvements on GAN training
procedures. We designed two autoencoders: one based on a MLP encoder, and
another based on a StyleGAN generator, which we call StyleALAE. We verify the
disentanglement properties of both architectures. We show that StyleALAE can
not only generate 1024x1024 face images with comparable quality of StyleGAN,
but at the same resolution can also produce face reconstructions and
manipulations based on real images. This makes ALAE the first autoencoder able
to compare with, and go beyond the capabilities of a generator-only type of
architecture.
Related papers
- ARTEMIS: Using GANs with Multiple Discriminators to Generate Art [0.0]
We propose a novel method for generating abstract art.
First an autoencoder is trained to encode and decode the style representations of images, which are extracted from source images with a pretrained VGG network.
The decoder component of the autoencoder is extracted and used as a generator in a GAN.
arXiv Detail & Related papers (2023-11-14T16:19:29Z) - In-Domain GAN Inversion for Faithful Reconstruction and Editability [132.68255553099834]
We propose in-domain GAN inversion, which consists of a domain-guided domain-regularized and a encoder to regularize the inverted code in the native latent space of the pre-trained GAN model.
We make comprehensive analyses on the effects of the encoder structure, the starting inversion point, as well as the inversion parameter space, and observe the trade-off between the reconstruction quality and the editing property.
arXiv Detail & Related papers (2023-09-25T08:42:06Z) - Towards Accurate Image Coding: Improved Autoregressive Image Generation
with Dynamic Vector Quantization [73.52943587514386]
Existing vector quantization (VQ) based autoregressive models follow a two-stage generation paradigm.
We propose a novel two-stage framework: (1) Dynamic-Quantization VAE (DQ-VAE) which encodes image regions into variable-length codes based their information densities for accurate representation.
arXiv Detail & Related papers (2023-05-19T14:56:05Z) - ALAP-AE: As-Lite-as-Possible Auto-Encoder [6.244939945140818]
We present a novel algorithm to reduce tensor compute required by a conditional image generation autoencoder.
We show performance gains for various conditional image generation tasks.
We achieve real-time versions of various autoencoders on CPU-only devices while maintaining image quality.
arXiv Detail & Related papers (2022-03-19T18:03:08Z) - LoopITR: Combining Dual and Cross Encoder Architectures for Image-Text
Retrieval [117.15862403330121]
We propose LoopITR, which combines dual encoders and cross encoders in the same network for joint learning.
Specifically, we let the dual encoder provide hard negatives to the cross encoder, and use the more discriminative cross encoder to distill its predictions back to the dual encoder.
arXiv Detail & Related papers (2022-03-10T16:41:12Z) - Autoencoding Video Latents for Adversarial Video Generation [0.0]
AVLAE is a two stream latent autoencoder where the video distribution is learned by adversarial training.
We demonstrate that our approach learns to disentangle motion and appearance codes even without the explicit structural composition in the generator.
arXiv Detail & Related papers (2022-01-18T11:42:14Z) - Adversarial Neural Networks for Error Correcting Codes [76.70040964453638]
We introduce a general framework to boost the performance and applicability of machine learning (ML) models.
We propose to combine ML decoders with a competing discriminator network that tries to distinguish between codewords and noisy words.
Our framework is game-theoretic, motivated by generative adversarial networks (GANs)
arXiv Detail & Related papers (2021-12-21T19:14:44Z) - AE-StyleGAN: Improved Training of Style-Based Auto-Encoders [21.51697087024866]
StyleGANs have shown impressive results on data generation and manipulation in recent years.
In this paper, we focus on style-based generators asking a scientific question: Does forcing such a generator to reconstruct real data lead to more disentangled latent space and make the inversion process from image to latent space easy?
We describe a new methodology to train a style-based autoencoder where the encoder and generator are optimized end-to-end.
arXiv Detail & Related papers (2021-10-17T04:25:51Z) - Adaptable GAN Encoders for Image Reconstruction via Multi-type Latent
Vectors with Two-scale Attentions [24.308432688431996]
We propose a novel method (named MTV-TSA) to handle such problems.
Creating multi-type latent vectors (MTV) from latent space and two-scale attentions (TSA) from images allows designing a set of encoders.
The designed encoders could make GANs reconstruct higher fidelity images from most synthesized HQ images.
arXiv Detail & Related papers (2021-08-23T14:37:58Z) - Autoencoders for unsupervised anomaly detection in high energy physics [105.54048699217668]
We study the tagging of top jet images in a background of QCD jet images.
We show that the standard autoencoder setup cannot be considered as a model-independent anomaly tagger.
We suggest improved performance measures for the task of model-independent anomaly detection.
arXiv Detail & Related papers (2021-04-19T05:06:57Z) - GMM-Based Generative Adversarial Encoder Learning [39.57350884615545]
We present a simple architectural setup that combines the generative capabilities of GAN with an encoder.
We model the output of the encoder latent space via a GMM, which leads to both good clustering using this latent space and improved image generation by the GAN.
arXiv Detail & Related papers (2020-12-08T16:12:16Z) - Learning Autoencoders with Relational Regularization [89.53065887608088]
A new framework is proposed for learning autoencoders of data distributions.
We minimize the discrepancy between the model and target distributions, with a emphrelational regularization
We implement the framework with two scalable algorithms, making it applicable for both probabilistic and deterministic autoencoders.
arXiv Detail & Related papers (2020-02-07T17:27:30Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.