GAN Inversion for Out-of-Range Images with Geometric Transformations
- URL: http://arxiv.org/abs/2108.08998v1
- Date: Fri, 20 Aug 2021 04:38:40 GMT
- Title: GAN Inversion for Out-of-Range Images with Geometric Transformations
- Authors: Kyoungkook Kang, Seongtae Kim, Sunghyun Cho
- Abstract summary: We propose BDInvert, a novel GAN inversion approach to semantic editing of out-of-range images.
Our experiments show that BDInvert effectively supports semantic editing of out-of-range images with geometric transformations.
- Score: 22.914126221037222
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: For successful semantic editing of real images, it is critical for a GAN
inversion method to find an in-domain latent code that aligns with the domain
of a pre-trained GAN model. Unfortunately, such in-domain latent codes can be
found only for in-range images that align with the training images of a GAN
model. In this paper, we propose BDInvert, a novel GAN inversion approach to
semantic editing of out-of-range images that are geometrically unaligned with
the training images of a GAN model. To find a latent code that is semantically
editable, BDInvert inverts an input out-of-range image into an alternative
latent space than the original latent space. We also propose a regularized
inversion method to find a solution that supports semantic editing in the
alternative space. Our experiments show that BDInvert effectively supports
semantic editing of out-of-range images with geometric transformations.
Related papers
- In-Domain GAN Inversion for Faithful Reconstruction and Editability [132.68255553099834]
We propose in-domain GAN inversion, which consists of a domain-guided domain-regularized and a encoder to regularize the inverted code in the native latent space of the pre-trained GAN model.
We make comprehensive analyses on the effects of the encoder structure, the starting inversion point, as well as the inversion parameter space, and observe the trade-off between the reconstruction quality and the editing property.
arXiv Detail & Related papers (2023-09-25T08:42:06Z) - JoIN: Joint GANs Inversion for Intrinsic Image Decomposition [16.02463667910604]
We propose to solve ill-posed inverse imaging problems using a bank of Generative Adversarial Networks (GAN)
Our method builds on the demonstrated success of GANs to capture complex image distributions.
arXiv Detail & Related papers (2023-05-18T22:09:32Z) - Spatial Steerability of GANs via Self-Supervision from Discriminator [123.27117057804732]
We propose a self-supervised approach to improve the spatial steerability of GANs without searching for steerable directions in the latent space.
Specifically, we design randomly sampled Gaussian heatmaps to be encoded into the intermediate layers of generative models as spatial inductive bias.
During inference, users can interact with the spatial heatmaps in an intuitive manner, enabling them to edit the output image by adjusting the scene layout, moving, or removing objects.
arXiv Detail & Related papers (2023-01-20T07:36:29Z) - FlexIT: Towards Flexible Semantic Image Translation [59.09398209706869]
We propose FlexIT, a novel method which can take any input image and a user-defined text instruction for editing.
First, FlexIT combines the input image and text into a single target point in the CLIP multimodal embedding space.
We iteratively transform the input image toward the target point, ensuring coherence and quality with a variety of novel regularization terms.
arXiv Detail & Related papers (2022-03-09T13:34:38Z) - Cycle-Consistent Inverse GAN for Text-to-Image Synthesis [101.97397967958722]
We propose a novel unified framework of Cycle-consistent Inverse GAN for both text-to-image generation and text-guided image manipulation tasks.
We learn a GAN inversion model to convert the images back to the GAN latent space and obtain the inverted latent codes for each image.
In the text-guided optimization module, we generate images with the desired semantic attributes by optimizing the inverted latent codes.
arXiv Detail & Related papers (2021-08-03T08:38:16Z) - Force-in-domain GAN inversion [0.0]
Various semantics emerge in the latent space of Generative Adversarial Networks (GANs) when being trained to generate images.
An in-domain GAN inversion approach is recently proposed to constraint the inverted code within the latent space.
We propose a force-in-domain GAN based on the in-domain GAN, which utilizes a discriminator to force the inverted code within the latent space.
arXiv Detail & Related papers (2021-07-13T13:03:53Z) - The Geometry of Deep Generative Image Models and its Applications [0.0]
Generative adversarial networks (GANs) have emerged as a powerful unsupervised method to model the statistical patterns of real-world data sets.
These networks are trained to map random inputs in their latent space to new samples representative of the learned data.
The structure of the latent space is hard to intuit due to its high dimensionality and the non-linearity of the generator.
arXiv Detail & Related papers (2021-01-15T07:57:33Z) - FDA: Fourier Domain Adaptation for Semantic Segmentation [82.4963423086097]
We describe a simple method for unsupervised domain adaptation, whereby the discrepancy between the source and target distributions is reduced by swapping the low-frequency spectrum of one with the other.
We illustrate the method in semantic segmentation, where densely annotated images are aplenty in one domain, but difficult to obtain in another.
Our results indicate that even simple procedures can discount nuisance variability in the data that more sophisticated methods struggle to learn away.
arXiv Detail & Related papers (2020-04-11T22:20:48Z) - In-Domain GAN Inversion for Real Image Editing [56.924323432048304]
A common practice of feeding a real image to a trained GAN generator is to invert it back to a latent code.
Existing inversion methods typically focus on reconstructing the target image by pixel values yet fail to land the inverted code in the semantic domain of the original latent space.
We propose an in-domain GAN inversion approach, which faithfully reconstructs the input image and ensures the inverted code to be semantically meaningful for editing.
arXiv Detail & Related papers (2020-03-31T18:20:18Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.