AgeFlow: Conditional Age Progression and Regression with Normalizing
Flows
- URL: http://arxiv.org/abs/2105.07239v1
- Date: Sat, 15 May 2021 15:02:07 GMT
- Title: AgeFlow: Conditional Age Progression and Regression with Normalizing
Flows
- Authors: Zhizhong Huang, Shouzhen Chen, Junping Zhang, Hongming Shan
- Abstract summary: Age progression and regression aim to synthesize photorealistic appearance of a given face image with aging and rejuvenation effects, respectively.
Existing generative adversarial networks (GANs) based methods suffer from the following three major issues.
This paper proposes a novel framework, termed AgeFlow, to integrate the advantages of both flow-based models and GANs.
- Score: 19.45760984401544
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Age progression and regression aim to synthesize photorealistic appearance of
a given face image with aging and rejuvenation effects, respectively. Existing
generative adversarial networks (GANs) based methods suffer from the following
three major issues: 1) unstable training introducing strong ghost artifacts in
the generated faces, 2) unpaired training leading to unexpected changes in
facial attributes such as genders and races, and 3) non-bijective age mappings
increasing the uncertainty in the face transformation. To overcome these
issues, this paper proposes a novel framework, termed AgeFlow, to integrate the
advantages of both flow-based models and GANs. The proposed AgeFlow contains
three parts: an encoder that maps a given face to a latent space through an
invertible neural network, a novel invertible conditional translation module
(ICTM) that translates the source latent vector to target one, and a decoder
that reconstructs the generated face from the target latent vector using the
same encoder network; all parts are invertible achieving bijective age
mappings. The novelties of ICTM are two-fold. First, we propose an
attribute-aware knowledge distillation to learn the manipulation direction of
age progression while keeping other unrelated attributes unchanged, alleviating
unexpected changes in facial attributes. Second, we propose to use GANs in the
latent space to ensure the learned latent vector indistinguishable from the
real ones, which is much easier than traditional use of GANs in the image
domain. Experimental results demonstrate superior performance over existing
GANs-based methods on two benchmarked datasets. The source code is available at
https://github.com/Hzzone/AgeFlow.
Related papers
- G2Face: High-Fidelity Reversible Face Anonymization via Generative and Geometric Priors [71.69161292330504]
Reversible face anonymization seeks to replace sensitive identity information in facial images with synthesized alternatives.
This paper introduces Gtextsuperscript2Face, which leverages both generative and geometric priors to enhance identity manipulation.
Our method outperforms existing state-of-the-art techniques in face anonymization and recovery, while preserving high data utility.
arXiv Detail & Related papers (2024-08-18T12:36:47Z) - Tilt your Head: Activating the Hidden Spatial-Invariance of Classifiers [0.7704032792820767]
Deep neural networks are applied in more and more areas of everyday life.
They still lack essential abilities, such as robustly dealing with spatially transformed input signals.
We propose a novel technique to emulate such an inference process for neural nets.
arXiv Detail & Related papers (2024-05-06T09:47:29Z) - CLR-Face: Conditional Latent Refinement for Blind Face Restoration Using
Score-Based Diffusion Models [57.9771859175664]
Recent generative-prior-based methods have shown promising blind face restoration performance.
Generating fine-grained facial details faithful to inputs remains a challenging problem.
We introduce a diffusion-based-prior inside a VQGAN architecture that focuses on learning the distribution over uncorrupted latent embeddings.
arXiv Detail & Related papers (2024-02-08T23:51:49Z) - SARGAN: Spatial Attention-based Residuals for Facial Expression
Manipulation [1.7056768055368383]
We present a novel method named SARGAN that addresses the limitations from three perspectives.
We exploited a symmetric encoder-decoder network to attend facial features at multiple scales.
Our proposed model performs significantly better than state-of-the-art methods.
arXiv Detail & Related papers (2023-03-30T08:15:18Z) - FaceTuneGAN: Face Autoencoder for Convolutional Expression Transfer
Using Neural Generative Adversarial Networks [0.7043489166804575]
We present FaceTuneGAN, a new 3D face model representation decomposing and encoding separately facial identity and facial expression.
We propose a first adaptation of image-to-image translation networks, that have successfully been used in the 2D domain, to 3D face geometry.
arXiv Detail & Related papers (2021-12-01T14:42:03Z) - Disentangled Lifespan Face Synthesis [100.29058545878341]
A lifespan face synthesis (LFS) model aims to generate a set of photo-realistic face images of a person's whole life, given only one snapshot as reference.
The generated face image given a target age code is expected to be age-sensitive reflected by bio-plausible transformations of shape and texture.
This is achieved by extracting shape, texture and identity features separately from an encoder.
arXiv Detail & Related papers (2021-08-05T22:33:14Z) - Mitigating Generation Shifts for Generalized Zero-Shot Learning [52.98182124310114]
Generalized Zero-Shot Learning (GZSL) is the task of leveraging semantic information (e.g., attributes) to recognize the seen and unseen samples, where unseen classes are not observable during training.
We propose a novel Generation Shifts Mitigating Flow framework for learning unseen data synthesis efficiently and effectively.
Experimental results demonstrate that GSMFlow achieves state-of-the-art recognition performance in both conventional and generalized zero-shot settings.
arXiv Detail & Related papers (2021-07-07T11:43:59Z) - Do Generative Models Know Disentanglement? Contrastive Learning is All
You Need [59.033559925639075]
We propose an unsupervised and model-agnostic method: Disentanglement via Contrast (DisCo) in the Variation Space.
DisCo achieves the state-of-the-art disentanglement given pretrained non-disentangled generative models, including GAN, VAE, and Flow.
arXiv Detail & Related papers (2021-02-21T08:01:20Z) - PFA-GAN: Progressive Face Aging with Generative Adversarial Network [19.45760984401544]
This paper proposes a novel progressive face aging framework based on generative adversarial network (PFA-GAN)
The framework can be trained in an end-to-end manner to eliminate accumulative artifacts and blurriness.
Extensively experimental results demonstrate superior performance over existing (c)GANs-based methods.
arXiv Detail & Related papers (2020-12-07T05:45:13Z) - Robust Person Re-Identification through Contextual Mutual Boosting [77.1976737965566]
We propose the Contextual Mutual Boosting Network (CMBN) to localize pedestrians.
It localizes pedestrians and recalibrates features by effectively exploiting contextual information and statistical inference.
Experiments on the benchmarks demonstrate the superiority of the architecture compared the state-of-the-art.
arXiv Detail & Related papers (2020-09-16T06:33:35Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.