Vector Quantized Image-to-Image Translation
- URL: http://arxiv.org/abs/2207.13286v1
- Date: Wed, 27 Jul 2022 04:22:29 GMT
- Title: Vector Quantized Image-to-Image Translation
- Authors: Yu-Jie Chen, Shin-I Cheng, Wei-Chen Chiu, Hung-Yu Tseng, Hsin-Ying Lee
- Abstract summary: We propose introducing the vector quantization technique into the image-to-image translation framework.
Our framework achieves comparable performance to the state-of-the-art image-to-image translation and image extension methods.
- Score: 31.65282783830092
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Current image-to-image translation methods formulate the task with
conditional generation models, leading to learning only the recolorization or
regional changes as being constrained by the rich structural information
provided by the conditional contexts. In this work, we propose introducing the
vector quantization technique into the image-to-image translation framework.
The vector quantized content representation can facilitate not only the
translation, but also the unconditional distribution shared among different
domains. Meanwhile, along with the disentangled style representation, the
proposed method further enables the capability of image extension with
flexibility in both intra- and inter-domains. Qualitative and quantitative
experiments demonstrate that our framework achieves comparable performance to
the state-of-the-art image-to-image translation and image extension methods.
Compared to methods for individual tasks, the proposed method, as a unified
framework, unleashes applications combining image-to-image translation,
unconditional generation, and image extension altogether. For example, it
provides style variability for image generation and extension, and equips
image-to-image translation with further extension capabilities.
Related papers
- Improving Diffusion-based Image Translation using Asymmetric Gradient
Guidance [51.188396199083336]
We present an approach that guides the reverse process of diffusion sampling by applying asymmetric gradient guidance.
Our model's adaptability allows it to be implemented with both image-fusion and latent-dif models.
Experiments show that our method outperforms various state-of-the-art models in image translation tasks.
arXiv Detail & Related papers (2023-06-07T12:56:56Z) - Conditional Score Guidance for Text-Driven Image-to-Image Translation [52.73564644268749]
We present a novel algorithm for text-driven image-to-image translation based on a pretrained text-to-image diffusion model.
Our method aims to generate a target image by selectively editing the regions of interest in a source image.
arXiv Detail & Related papers (2023-05-29T10:48:34Z) - Variational Bayesian Framework for Advanced Image Generation with
Domain-Related Variables [29.827191184889898]
We present a unified Bayesian framework for advanced conditional generative problems.
We propose a variational Bayesian image translation network (VBITN) that enables multiple image translation and editing tasks.
arXiv Detail & Related papers (2023-05-23T09:47:23Z) - Unsupervised Image-to-Image Translation with Generative Prior [103.54337984566877]
Unsupervised image-to-image translation aims to learn the translation between two visual domains without paired data.
We present a novel framework, Generative Prior-guided UN Image-to-image Translation (GP-UNIT), to improve the overall quality and applicability of the translation algorithm.
arXiv Detail & Related papers (2022-04-07T17:59:23Z) - Multi-domain Unsupervised Image-to-Image Translation with Appearance
Adaptive Convolution [62.4972011636884]
We propose a novel multi-domain unsupervised image-to-image translation (MDUIT) framework.
We exploit the decomposed content feature and appearance adaptive convolution to translate an image into a target appearance.
We show that the proposed method produces visually diverse and plausible results in multiple domains compared to the state-of-the-art methods.
arXiv Detail & Related papers (2022-02-06T14:12:34Z) - Separating Content and Style for Unsupervised Image-to-Image Translation [20.44733685446886]
Unsupervised image-to-image translation aims to learn the mapping between two visual domains with unpaired samples.
We propose to separate the content code and style code simultaneously in a unified framework.
Based on the correlation between the latent features and the high-level domain-invariant tasks, the proposed framework demonstrates superior performance.
arXiv Detail & Related papers (2021-10-27T12:56:50Z) - Unbalanced Feature Transport for Exemplar-based Image Translation [51.54421432912801]
This paper presents a general image translation framework that incorporates optimal transport for feature alignment between conditional inputs and style exemplars in image translation.
We show that our method achieves superior image translation qualitatively and quantitatively as compared with the state-of-the-art.
arXiv Detail & Related papers (2021-06-19T12:07:48Z) - Multiple GAN Inversion for Exemplar-based Image-to-Image Translation [0.0]
We propose Multiple GAN Inversion for Exemplar-based Image-to-Image Translation.
Our novel Multiple GAN Inversion avoids human intervention using a self-deciding algorithm in choosing the number of layers.
Experimental results shows the advantage of the proposed method compared to existing state-of-the-art exemplar-based image-to-image translation methods.
arXiv Detail & Related papers (2021-03-26T13:46:14Z) - Style Intervention: How to Achieve Spatial Disentanglement with
Style-based Generators? [100.60938767993088]
We propose a lightweight optimization-based algorithm which could adapt to arbitrary input images and render natural translation effects under flexible objectives.
We verify the performance of the proposed framework in facial attribute editing on high-resolution images, where both photo-realism and consistency are required.
arXiv Detail & Related papers (2020-11-19T07:37:31Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.