What's in a Decade? Transforming Faces Through Time
- URL: http://arxiv.org/abs/2210.06642v2
- Date: Mon, 17 Oct 2022 03:01:34 GMT
- Title: What's in a Decade? Transforming Faces Through Time
- Authors: Eric Ming Chen, Jin Sun, Apoorv Khandelwal, Dani Lischinski, Noah
Snavely, Hadar Averbuch-Elor
- Abstract summary: We assemble the Faces Through Time dataset, which contains over a thousand portrait images from each decade, spanning the 1880s to the present day.
We present a framework for resynthesizing portrait images across time, imagining how a portrait taken during a particular decade might have looked like, had it been taken in other decades.
- Score: 70.78847389726937
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: How can one visually characterize people in a decade? In this work, we
assemble the Faces Through Time dataset, which contains over a thousand
portrait images from each decade, spanning the 1880s to the present day. Using
our new dataset, we present a framework for resynthesizing portrait images
across time, imagining how a portrait taken during a particular decade might
have looked like, had it been taken in other decades. Our framework optimizes a
family of per-decade generators that reveal subtle changes that differentiate
decade--such as different hairstyles or makeup--while maintaining the identity
of the input portrait. Experiments show that our method is more effective in
resynthesizing portraits across time compared to state-of-the-art
image-to-image translation methods, as well as attribute-based and
language-guided portrait editing models. Our code and data will be available at
https://facesthroughtime.github.io
Related papers
- When StyleGAN Meets Stable Diffusion: a $\mathscr{W}_+$ Adapter for
Personalized Image Generation [60.305112612629465]
Text-to-image diffusion models have excelled in producing diverse, high-quality, and photo-realistic images.
We present a novel use of the extended StyleGAN embedding space $mathcalW_+$ to achieve enhanced identity preservation and disentanglement for diffusion models.
Our method adeptly generates personalized text-to-image outputs that are not only compatible with prompt descriptions but also amenable to common StyleGAN editing directions.
arXiv Detail & Related papers (2023-11-29T09:05:14Z) - Enhancing the Authenticity of Rendered Portraits with
Identity-Consistent Transfer Learning [30.64677966402945]
We present a novel photo-realistic portrait generation framework that can effectively mitigate the ''uncanny valley'' effect.
Our key idea is to employ transfer learning to learn an identity-consistent mapping from the latent space of rendered portraits to that of real portraits.
arXiv Detail & Related papers (2023-10-06T12:20:40Z) - ReliTalk: Relightable Talking Portrait Generation from a Single Video [62.47116237654984]
ReliTalk is a novel framework for relightable audio-driven talking portrait generation from monocular videos.
Our key insight is to decompose the portrait's reflectance from implicitly learned audio-driven facial normals and images.
arXiv Detail & Related papers (2023-09-05T17:59:42Z) - Context-Preserving Two-Stage Video Domain Translation for Portrait
Stylization [68.10073215175055]
We propose a novel two-stage video translation framework with an objective function which enforces a model to generate a temporally coherent stylized video.
Our model runs in real-time with the latency of 0.011 seconds per frame and requires only 5.6M parameters.
arXiv Detail & Related papers (2023-05-30T15:46:25Z) - Adversarially-Guided Portrait Matting [0.0]
We present a method for generating alpha mattes using a limited data source.
We pretrain a novel transformerbased model (StyleMatte) on portrait datasets.
We utilize this model to provide image-mask pairs for the StyleGAN3-based network (StyleMatteGAN)
arXiv Detail & Related papers (2023-05-04T16:45:04Z) - Portrait Interpretation and a Benchmark [49.484161789329804]
The proposed portrait interpretation recognizes the perception of humans from a new systematic perspective.
We construct a new dataset that contains 250,000 images labeled with identity, gender, age, physique, height, expression, and posture of the whole body and arms.
Our experimental results demonstrate that combining the tasks related to portrait interpretation can yield benefits.
arXiv Detail & Related papers (2022-07-27T06:25:09Z) - Image Retrieval from Contextual Descriptions [22.084939474881796]
Image Retrieval from Contextual Descriptions (ImageCoDe)
Models tasked with retrieving the correct image from a set of 10 minimally contrastive candidates based on a contextual description.
Best variant achieves an accuracy of 20.9 on video frames and 59.4 on static pictures, compared with 90.8 in humans.
arXiv Detail & Related papers (2022-03-29T19:18:12Z) - CtlGAN: Few-shot Artistic Portraits Generation with Contrastive Transfer
Learning [77.27821665339492]
CtlGAN is a new few-shot artistic portraits generation model with a novel contrastive transfer learning strategy.
We adapt a pretrained StyleGAN in the source domain to a target artistic domain with no more than 10 artistic faces.
We propose a new encoder which embeds real faces into Z+ space and proposes a dual-path training strategy to better cope with the adapted decoder.
arXiv Detail & Related papers (2022-03-16T13:28:17Z) - Lifespan Age Transformation Synthesis [40.963816368819415]
We propose a novel image-to-image generative adversarial network architecture.
Our framework can predict a full head portrait for ages 0-70 from a single photo.
arXiv Detail & Related papers (2020-03-21T22:48:06Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.