CooGAN: A Memory-Efficient Framework for High-Resolution Facial
Attribute Editing
- URL: http://arxiv.org/abs/2011.01563v1
- Date: Tue, 3 Nov 2020 08:40:00 GMT
- Title: CooGAN: A Memory-Efficient Framework for High-Resolution Facial
Attribute Editing
- Authors: Xuanhong Chen, Bingbing Ni, Naiyuan Liu, Ziang Liu, Yiliu Jiang, Loc
Truong, and Qi Tian
- Abstract summary: We propose a NOVEL pixel translation framework called Cooperative GAN(CooGAN) for HR facial image editing.
This framework features a local path for fine-grained local facial patch generation (i.e., patch-level HR, LOW memory) and a global path for global lowresolution (LR) facial structure monitoring (i.e., image-level LR, LOW memory)
In addition, we propose a lighter selective transfer unit for more efficient multi-scale features fusion, yielding higher fidelity facial attributes manipulation.
- Score: 84.92009553462384
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In contrast to great success of memory-consuming face editing methods at a
low resolution, to manipulate high-resolution (HR) facial images, i.e.,
typically larger than 7682 pixels, with very limited memory is still
challenging. This is due to the reasons of 1) intractable huge demand of
memory; 2) inefficient multi-scale features fusion. To address these issues, we
propose a NOVEL pixel translation framework called Cooperative GAN(CooGAN) for
HR facial image editing. This framework features a local path for fine-grained
local facial patch generation (i.e., patch-level HR, LOW memory) and a global
path for global lowresolution (LR) facial structure monitoring (i.e.,
image-level LR, LOW memory), which largely reduce memory requirements. Both
paths work in a cooperative manner under a local-to-global consistency
objective (i.e., for smooth stitching). In addition, we propose a lighter
selective transfer unit for more efficient multi-scale features fusion,
yielding higher fidelity facial attributes manipulation. Extensive experiments
on CelebAHQ well demonstrate the memory efficiency as well as the high image
generation quality of the proposed framework.
Related papers
- Beyond LLaVA-HD: Diving into High-Resolution Large Multimodal Models [44.437693135170576]
We propose a new framework, LMM with Sophisticated Tasks, Local image compression, and Mixture of global Experts (SliME)
We extract contextual information from the global view using a mixture of adapters, based on the observation that different adapters excel at different tasks.
The proposed method achieves leading performance across various benchmarks with only 2 million training data.
arXiv Detail & Related papers (2024-06-12T17:59:49Z) - W-Net: A Facial Feature-Guided Face Super-Resolution Network [8.037821981254389]
Face Super-Resolution aims to recover high-resolution (HR) face images from low-resolution (LR) ones.
Existing approaches are not ideal due to their low reconstruction efficiency and insufficient utilization of prior information.
This paper proposes a novel network architecture called W-Net to address this challenge.
arXiv Detail & Related papers (2024-06-02T09:05:40Z) - Real-time 6K Image Rescaling with Rate-distortion Optimization [53.61374658914277]
We propose a novel framework for real-time 6K rate-distortion-aware image rescaling.
Our framework embeds an HR image into a JPEG LR thumbnail by an encoder.
Then, an efficient frequency-aware decoder reconstructs a high-fidelity HR image from the LR one in real time.
arXiv Detail & Related papers (2023-04-03T15:21:56Z) - Hierarchical Similarity Learning for Aliasing Suppression Image
Super-Resolution [64.15915577164894]
A hierarchical image super-resolution network (HSRNet) is proposed to suppress the influence of aliasing.
HSRNet achieves better quantitative and visual performance than other works, and remits the aliasing more effectively.
arXiv Detail & Related papers (2022-06-07T14:55:32Z) - Memory-augmented Deep Unfolding Network for Guided Image
Super-resolution [67.83489239124557]
Guided image super-resolution (GISR) aims to obtain a high-resolution (HR) target image by enhancing the spatial resolution of a low-resolution (LR) target image under the guidance of a HR image.
Previous model-based methods mainly takes the entire image as a whole, and assume the prior distribution between the HR target image and the HR guidance image.
We propose a maximal a posterior (MAP) estimation model for GISR with two types of prior on the HR target image.
arXiv Detail & Related papers (2022-02-12T15:37:13Z) - Best-Buddy GANs for Highly Detailed Image Super-Resolution [71.13466303340192]
We consider the single image super-resolution (SISR) problem, where a high-resolution (HR) image is generated based on a low-resolution (LR) input.
Most methods along this line rely on a predefined single-LR-single-HR mapping, which is not flexible enough for the SISR task.
We propose best-buddy GANs (Beby-GAN) for rich-detail SISR. Relaxing the immutable one-to-one constraint, we allow the estimated patches to dynamically seek the best supervision.
arXiv Detail & Related papers (2021-03-29T02:58:27Z) - PULSE: Self-Supervised Photo Upsampling via Latent Space Exploration of
Generative Models [77.32079593577821]
PULSE (Photo Upsampling via Latent Space Exploration) generates high-resolution, realistic images at resolutions previously unseen in the literature.
Our method outperforms state-of-the-art methods in perceptual quality at higher resolutions and scale factors than previously possible.
arXiv Detail & Related papers (2020-03-08T16:44:31Z) - Joint Face Completion and Super-resolution using Multi-scale Feature
Relation Learning [26.682678558621625]
This paper proposes a multi-scale feature graph generative adversarial network (MFG-GAN) to implement the face restoration of images in which both degradation modes coexist.
Based on the GAN, the MFG-GAN integrates the graph convolution and feature pyramid network to restore occluded low-resolution face images to non-occluded high-resolution face images.
Experimental results on the public-domain CelebA and Helen databases show that the proposed approach outperforms state-of-the-art methods in performing face super-resolution (up to 4x or 8x) and face completion simultaneously.
arXiv Detail & Related papers (2020-02-29T13:31:46Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.