Attribute Controllable Beautiful Caucasian Face Generation by Aesthetics
Driven Reinforcement Learning
- URL: http://arxiv.org/abs/2208.04517v1
- Date: Tue, 9 Aug 2022 03:04:10 GMT
- Title: Attribute Controllable Beautiful Caucasian Face Generation by Aesthetics
Driven Reinforcement Learning
- Authors: Xin Jin, Shu Zhao, Le Zhang, Xin Zhao, Qiang Deng, Chaoen Xiao
- Abstract summary: We build the techniques of reinforcement learning into the generator of EigenGAN.
The agent tries to figure out how to alter the semantic attributes of the generated human faces towards more preferable ones.
We present a new variant incorporating the ingredients emerging in the reinforcement learning communities in recent years.
- Score: 21.329906392100884
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: In recent years, image generation has made great strides in improving the
quality of images, producing high-fidelity ones. Also, quite recently, there
are architecture designs, which enable GAN to unsupervisedly learn the semantic
attributes represented in different layers. However, there is still a lack of
research on generating face images more consistent with human aesthetics. Based
on EigenGAN [He et al., ICCV 2021], we build the techniques of reinforcement
learning into the generator of EigenGAN. The agent tries to figure out how to
alter the semantic attributes of the generated human faces towards more
preferable ones. To accomplish this, we trained an aesthetics scoring model
that can conduct facial beauty prediction. We also can utilize this scoring
model to analyze the correlation between face attributes and aesthetics scores.
Empirically, using off-the-shelf techniques from reinforcement learning would
not work well. So instead, we present a new variant incorporating the
ingredients emerging in the reinforcement learning communities in recent years.
Compared to the original generated images, the adjusted ones show clear
distinctions concerning various attributes. Experimental results using the
MindSpore, show the effectiveness of the proposed method. Altered facial images
are commonly more attractive, with significantly improved aesthetic levels.
Related papers
- Towards the Detection of AI-Synthesized Human Face Images [12.090322373964124]
This paper presents a benchmark including human face images produced by Generative Adversarial Networks (GANs) and a variety of DMs.
Then, the forgery traces introduced by different generative models have been analyzed in the frequency domain to draw various insights.
The paper further demonstrates that a detector trained with frequency representation can generalize well to other unseen generative models.
arXiv Detail & Related papers (2024-02-13T19:37:44Z) - Detecting Generated Images by Real Images Only [64.12501227493765]
Existing generated image detection methods detect visual artifacts in generated images or learn discriminative features from both real and generated images by massive training.
This paper approaches the generated image detection problem from a new perspective: Start from real images.
By finding the commonality of real images and mapping them to a dense subspace in feature space, the goal is that generated images, regardless of their generative model, are then projected outside the subspace.
arXiv Detail & Related papers (2023-11-02T03:09:37Z) - Neural Image-based Avatars: Generalizable Radiance Fields for Human
Avatar Modeling [28.242591786838936]
We present a method that enables novel views and novel poses of arbitrary human performers from sparse multi-view images.
A key ingredient of our method is a hybrid appearance blending module that combines the advantages of the implicit body NeRF representation and image-based rendering.
arXiv Detail & Related papers (2023-04-10T23:53:28Z) - Stare at What You See: Masked Image Modeling without Reconstruction [154.74533119863864]
Masked Autoencoders (MAE) have been prevailing paradigms for large-scale vision representation pre-training.
Recent approaches apply semantic-rich teacher models to extract image features as the reconstruction target, leading to better performance.
We argue the features extracted by powerful teacher models already encode rich semantic correlation across regions in an intact image.
arXiv Detail & Related papers (2022-11-16T12:48:52Z) - A domain adaptive deep learning solution for scanpath prediction of
paintings [66.46953851227454]
This paper focuses on the eye-movement analysis of viewers during the visual experience of a certain number of paintings.
We introduce a new approach to predicting human visual attention, which impacts several cognitive functions for humans.
The proposed new architecture ingests images and returns scanpaths, a sequence of points featuring a high likelihood of catching viewers' attention.
arXiv Detail & Related papers (2022-09-22T22:27:08Z) - Detecting High-Quality GAN-Generated Face Images using Neural Networks [23.388645531702597]
We propose a new strategy to differentiate GAN-generated images from authentic images by leveraging spectral band discrepancies.
In particular, we enable the digital preservation of face images using the Cross-band co-occurrence matrix and spatial co-occurrence matrix.
We show that the performance boost is particularly significant and achieves more than 92% in different post-processing environments.
arXiv Detail & Related papers (2022-03-03T13:53:27Z) - Heredity-aware Child Face Image Generation with Latent Space
Disentanglement [96.92684978356425]
We propose a novel approach, called ChildGAN, to generate a child's image according to the images of parents with heredity prior.
The main idea is to disentangle the latent space of a pre-trained generation model and precisely control the face attributes of child images with clear semantics.
arXiv Detail & Related papers (2021-08-25T06:59:43Z) - Ensembling with Deep Generative Views [72.70801582346344]
generative models can synthesize "views" of artificial images that mimic real-world variations, such as changes in color or pose.
Here, we investigate whether such views can be applied to real images to benefit downstream analysis tasks such as image classification.
We use StyleGAN2 as the source of generative augmentations and investigate this setup on classification tasks involving facial attributes, cat faces, and cars.
arXiv Detail & Related papers (2021-04-29T17:58:35Z) - Attributes Aware Face Generation with Generative Adversarial Networks [133.44359317633686]
We propose a novel attributes aware face image generator method with generative adversarial networks called AFGAN.
Three stacked generators generate $64 times 64$, $128 times 128$ and $256 times 256$ resolution face images respectively.
In addition, an image-attribute matching loss is proposed to enhance the correlation between the generated images and input attributes.
arXiv Detail & Related papers (2020-12-03T09:25:50Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.