InfinityGAN: Towards Infinite-Resolution Image Synthesis
- URL: http://arxiv.org/abs/2104.03963v1
- Date: Thu, 8 Apr 2021 17:59:30 GMT
- Title: InfinityGAN: Towards Infinite-Resolution Image Synthesis
- Authors: Chieh Hubert Lin, Hsin-Ying Lee, Yen-Chi Cheng, Sergey Tulyakov,
Ming-Hsuan Yang
- Abstract summary: We present InfinityGAN, a method to generate arbitrary-resolution images.
We show how it trains and infers patch-by-patch seamlessly with low computational resources.
- Score: 92.40782797030977
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We present InfinityGAN, a method to generate arbitrary-resolution images. The
problem is associated with several key challenges. First, scaling existing
models to a high resolution is resource-constrained, both in terms of
computation and availability of high-resolution training data. Infinity-GAN
trains and infers patch-by-patch seamlessly with low computational resources.
Second, large images should be locally and globally consistent, avoid
repetitive patterns, and look realistic. To address these, InfinityGAN takes
global appearance, local structure and texture into account.With this
formulation, we can generate images with resolution and level of detail not
attainable before. Experimental evaluation supports that InfinityGAN generates
imageswith superior global structure compared to baselines at the same time
featuring parallelizable inference. Finally, we how several applications
unlocked by our approach, such as fusing styles spatially, multi-modal
outpainting and image inbetweening at arbitrary input and output resolutions
Related papers
- High-Resolution Frame Interpolation with Patch-based Cascaded Diffusion [44.52838839928787]
We introduce a patch-based cascaded pixel diffusion model for frame, HiFI.
We show that HiFI helps significantly with high resolution and complex repeated textures that require global context.
We also show that this technique drastically reduces memory usage at inference time and also allows us to use a single model at test time.
arXiv Detail & Related papers (2024-10-15T17:59:04Z) - Beyond LLaVA-HD: Diving into High-Resolution Large Multimodal Models [44.437693135170576]
We propose a new framework, LMM with Sophisticated Tasks, Local image compression, and Mixture of global Experts (SliME)
We extract contextual information from the global view using a mixture of adapters, based on the observation that different adapters excel at different tasks.
The proposed method achieves leading performance across various benchmarks with only 2 million training data.
arXiv Detail & Related papers (2024-06-12T17:59:49Z) - Distance Weighted Trans Network for Image Completion [52.318730994423106]
We propose a new architecture that relies on Distance-based Weighted Transformer (DWT) to better understand the relationships between an image's components.
CNNs are used to augment the local texture information of coarse priors.
DWT blocks are used to recover certain coarse textures and coherent visual structures.
arXiv Detail & Related papers (2023-10-11T12:46:11Z) - High-Resolution GAN Inversion for Degraded Images in Large Diverse
Datasets [39.21692649763314]
In this paper, we present a novel GAN inversion framework that utilizes the powerful generative ability of StyleGAN-XL.
To ease the inversion challenge with StyleGAN-XL, Clustering & Regularize Inversion (CRI) is proposed.
We validate our CRI scheme on multiple restoration tasks (i.e., inpainting, colorization, and super-resolution) of complex natural images, and show preferable quantitative and qualitative results.
arXiv Detail & Related papers (2023-02-07T11:24:11Z) - Learning Enriched Features for Fast Image Restoration and Enhancement [166.17296369600774]
This paper presents a holistic goal of maintaining spatially-precise high-resolution representations through the entire network.
We learn an enriched set of features that combines contextual information from multiple scales, while simultaneously preserving the high-resolution spatial details.
Our approach achieves state-of-the-art results for a variety of image processing tasks, including defocus deblurring, image denoising, super-resolution, and image enhancement.
arXiv Detail & Related papers (2022-04-19T17:59:45Z) - Any-resolution Training for High-resolution Image Synthesis [55.19874755679901]
Generative models operate at fixed resolution, even though natural images come in a variety of sizes.
We argue that every pixel matters and create datasets with variable-size images, collected at their native resolutions.
We introduce continuous-scale training, a process that samples patches at random scales to train a new generator with variable output resolutions.
arXiv Detail & Related papers (2022-04-14T17:59:31Z) - Efficient texture-aware multi-GAN for image inpainting [5.33024001730262]
Recent GAN-based (Generative adversarial networks) inpainting methods show remarkable improvements.
We propose a multi-GAN architecture improving both the performance and rendering efficiency.
arXiv Detail & Related papers (2020-09-30T14:58:03Z) - PULSE: Self-Supervised Photo Upsampling via Latent Space Exploration of
Generative Models [77.32079593577821]
PULSE (Photo Upsampling via Latent Space Exploration) generates high-resolution, realistic images at resolutions previously unseen in the literature.
Our method outperforms state-of-the-art methods in perceptual quality at higher resolutions and scale factors than previously possible.
arXiv Detail & Related papers (2020-03-08T16:44:31Z) - Gated Fusion Network for Degraded Image Super Resolution [78.67168802945069]
We propose a dual-branch convolutional neural network to extract base features and recovered features separately.
By decomposing the feature extraction step into two task-independent streams, the dual-branch model can facilitate the training process.
arXiv Detail & Related papers (2020-03-02T13:28:32Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.