DCFace: Synthetic Face Generation with Dual Condition Diffusion Model
        - URL: http://arxiv.org/abs/2304.07060v1
- Date: Fri, 14 Apr 2023 11:31:49 GMT
- Title: DCFace: Synthetic Face Generation with Dual Condition Diffusion Model
- Authors: Minchul Kim, Feng Liu, Anil Jain, Xiaoming Liu
- Abstract summary: We propose a Dual Condition Face Generator (DCFace) based on a diffusion model.
Our novel Patch-wise style extractor and Time-step dependent ID loss enables DCFace to consistently produce face images of the same subject under different styles with precise control.
- Score: 18.662943303044315
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract:   Generating synthetic datasets for training face recognition models is
challenging because dataset generation entails more than creating high fidelity
images. It involves generating multiple images of same subjects under different
factors (\textit{e.g.}, variations in pose, illumination, expression, aging and
occlusion) which follows the real image conditional distribution. Previous
works have studied the generation of synthetic datasets using GAN or 3D models.
In this work, we approach the problem from the aspect of combining subject
appearance (ID) and external factor (style) conditions. These two conditions
provide a direct way to control the inter-class and intra-class variations. To
this end, we propose a Dual Condition Face Generator (DCFace) based on a
diffusion model. Our novel Patch-wise style extractor and Time-step dependent
ID loss enables DCFace to consistently produce face images of the same subject
under different styles with precise control. Face recognition models trained on
synthetic images from the proposed DCFace provide higher verification
accuracies compared to previous works by $6.11\%$ on average in $4$ out of $5$
test datasets, LFW, CFP-FP, CPLFW, AgeDB and CALFW. Code is available at
https://github.com/mk-minchul/dcface
 
      
        Related papers
        - Towards Consistent and Controllable Image Synthesis for Face Editing [18.646961062736207]
 RigFace is a novel approach to control the lighting, facial expression and head pose of a portrait photo.
Our model achieves comparable or even superior performance in both identity preservation and photorealism compared to existing face editing models.
 arXiv  Detail & Related papers  (2025-02-04T16:36:07Z)
- OSDFace: One-Step Diffusion Model for Face Restoration [72.5045389847792]
 Diffusion models have demonstrated impressive performance in face restoration.
We propose OSDFace, a novel one-step diffusion model for face restoration.
Results demonstrate that OSDFace surpasses current state-of-the-art (SOTA) methods in both visual quality and quantitative metrics.
 arXiv  Detail & Related papers  (2024-11-26T07:07:48Z)
- Realistic and Efficient Face Swapping: A Unified Approach with Diffusion   Models [69.50286698375386]
 We propose a novel approach that better harnesses diffusion models for face-swapping.
We introduce a mask shuffling technique during inpainting training, which allows us to create a so-called universal model for swapping.
Ours is a relatively unified approach and so it is resilient to errors in other off-the-shelf models.
 arXiv  Detail & Related papers  (2024-09-11T13:43:53Z)
- TCDiff: Triple Condition Diffusion Model with 3D Constraints for   Stylizing Synthetic Faces [1.7535229154829601]
 Face recognition experiments using 1k, 2k, and 5k classes of our new dataset for training outperform state-of-the-art synthetic datasets in real face benchmarks.
 arXiv  Detail & Related papers  (2024-09-05T14:59:41Z)
- Arc2Face: A Foundation Model for ID-Consistent Human Faces [95.00331107591859]
 Arc2Face is an identity-conditioned face foundation model.
It can generate diverse photo-realistic images with an unparalleled degree of face similarity than existing models.
 arXiv  Detail & Related papers  (2024-03-18T10:32:51Z)
- Face Swap via Diffusion Model [4.026688121914668]
 This report presents a diffusion model based framework for face swapping between two portrait images.
The basic framework consists of three components, for face feature encoding, multi-conditional generation, and face inpainting respectively.
 arXiv  Detail & Related papers  (2024-03-02T07:02:17Z)
- Controllable 3D Face Generation with Conditional Style Code Diffusion [51.24656496304069]
 TEx-Face(TExt & Expression-to-Face) addresses challenges by dividing the task into three components, i.e., 3D GAN Inversion, Conditional Style Code Diffusion, and 3D Face Decoding.
Experiments conducted on FFHQ, CelebA-HQ, and CelebA-Dialog demonstrate the promising performance of our TEx-Face.
 arXiv  Detail & Related papers  (2023-12-21T15:32:49Z)
- Generated Faces in the Wild: Quantitative Comparison of Stable
  Diffusion, Midjourney and DALL-E 2 [47.64219291655723]
 We conduct a comparison of three popular systems including Stable Diffusion, Midjourney, and DALL-E 2 in their ability to generate photorealistic faces in the wild.
We find that Stable Diffusion generates better faces than the other systems, according to the FID score.
We also introduce a dataset of generated faces in the wild dubbed GFW, including a total of 15,076 faces.
 arXiv  Detail & Related papers  (2022-10-02T17:53:08Z)
- Learning to Aggregate and Personalize 3D Face from In-the-Wild Photo
  Collection [65.92058628082322]
 Non-parametric face modeling aims to reconstruct 3D face only from images without shape assumptions.
This paper presents a novel Learning to Aggregate and Personalize framework for unsupervised robust 3D face modeling.
 arXiv  Detail & Related papers  (2021-06-15T03:10:17Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
       
     
           This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.