Masked Linear Regression for Learning Local Receptive Fields for Facial
Expression Synthesis
- URL: http://arxiv.org/abs/2011.09104v1
- Date: Wed, 18 Nov 2020 06:04:24 GMT
- Title: Masked Linear Regression for Learning Local Receptive Fields for Facial
Expression Synthesis
- Authors: Nazar Khan, Arbish Akram, Arif Mahmood, Sania Ashraf, Kashif Murtaza
- Abstract summary: We propose a constrained version of ridge regression that exploits the local and sparse structure of facial expressions.
In contrast to the existing approaches, our proposed model can be efficiently trained on larger image sizes.
The proposed algorithm is also compared with state-of-the-art GANs including Pix2Pix, CycleGAN, StarGAN and GANimation.
- Score: 10.28711904929932
- License: http://creativecommons.org/publicdomain/zero/1.0/
- Abstract: Compared to facial expression recognition, expression synthesis requires a
very high-dimensional mapping. This problem exacerbates with increasing image
sizes and limits existing expression synthesis approaches to relatively small
images. We observe that facial expressions often constitute sparsely
distributed and locally correlated changes from one expression to another. By
exploiting this observation, the number of parameters in an expression
synthesis model can be significantly reduced. Therefore, we propose a
constrained version of ridge regression that exploits the local and sparse
structure of facial expressions. We consider this model as masked regression
for learning local receptive fields. In contrast to the existing approaches,
our proposed model can be efficiently trained on larger image sizes.
Experiments using three publicly available datasets demonstrate that our model
is significantly better than $\ell_0, \ell_1$ and $\ell_2$-regression, SVD
based approaches, and kernelized regression in terms of mean-squared-error,
visual quality as well as computational and spatial complexities. The reduction
in the number of parameters allows our method to generalize better even after
training on smaller datasets. The proposed algorithm is also compared with
state-of-the-art GANs including Pix2Pix, CycleGAN, StarGAN and GANimation.
These GANs produce photo-realistic results as long as the testing and the
training distributions are similar. In contrast, our results demonstrate
significant generalization of the proposed algorithm over out-of-dataset human
photographs, pencil sketches and even animal faces.
Related papers
- No Pose, No Problem: Surprisingly Simple 3D Gaussian Splats from Sparse Unposed Images [100.80376573969045]
NoPoSplat is a feed-forward model capable of reconstructing 3D scenes parameterized by 3D Gaussians from multi-view images.
Our model achieves real-time 3D Gaussian reconstruction during inference.
This work makes significant advances in pose-free generalizable 3D reconstruction and demonstrates its applicability to real-world scenarios.
arXiv Detail & Related papers (2024-10-31T17:58:22Z) - OFER: Occluded Face Expression Reconstruction [16.06622406877353]
We introduce OFER, a novel approach for single image 3D face reconstruction that can generate plausible, diverse, and expressive 3D faces.
We propose a novel ranking mechanism that sorts the outputs of the shape diffusion network based on the predicted shape accuracy scores to select the best match.
arXiv Detail & Related papers (2024-10-29T00:21:26Z) - Fast constrained sampling in pre-trained diffusion models [77.21486516041391]
Diffusion models have dominated the field of large, generative image models.
We propose an algorithm for fast-constrained sampling in large pre-trained diffusion models.
arXiv Detail & Related papers (2024-10-24T14:52:38Z) - NODER: Image Sequence Regression Based on Neural Ordinary Differential Equations [2.711538918087856]
We propose an optimization-based new framework called NODER, which leverages neural ordinary differential equations to capture complex underlying dynamics.
Our model needs only a couple of images in a sequence for prediction, which is practical, especially for clinical situations.
arXiv Detail & Related papers (2024-07-18T07:50:46Z) - Symbolic Expression Transformer: A Computer Vision Approach for Symbolic
Regression [9.978824294461196]
Symbolic Regression (SR) is a type of regression analysis to automatically find the mathematical expression that best fits the data.
Inspired by the fact that human beings can infer a mathematical expression based on the curve of it, we propose Symbolic Expression Transformer (SET)
SET is a sample-agnostic model from the perspective of computer vision for SR.
arXiv Detail & Related papers (2022-05-24T05:35:46Z) - Adversarial Parametric Pose Prior [106.12437086990853]
We learn a prior that restricts the SMPL parameters to values that produce realistic poses via adversarial training.
We show that our learned prior covers the diversity of the real-data distribution, facilitates optimization for 3D reconstruction from 2D keypoints, and yields better pose estimates when used for regression from images.
arXiv Detail & Related papers (2021-12-08T10:05:32Z) - Score-Based Generative Modeling through Stochastic Differential
Equations [114.39209003111723]
We present a differential equation that transforms a complex data distribution to a known prior distribution by injecting noise.
A corresponding reverse-time SDE transforms the prior distribution back into the data distribution by slowly removing the noise.
By leveraging advances in score-based generative modeling, we can accurately estimate these scores with neural networks.
We demonstrate high fidelity generation of 1024 x 1024 images for the first time from a score-based generative model.
arXiv Detail & Related papers (2020-11-26T19:39:10Z) - Pixel-based Facial Expression Synthesis [1.7056768055368383]
We propose a pixel-based facial expression synthesis method in which each output pixel observes only one input pixel.
The proposed model is two orders of magnitude smaller which makes it suitable for deployment on resource-constrained devices.
arXiv Detail & Related papers (2020-10-27T16:00:45Z) - 3D Dense Geometry-Guided Facial Expression Synthesis by Adversarial
Learning [54.24887282693925]
We propose a novel framework to exploit 3D dense (depth and surface normals) information for expression manipulation.
We use an off-the-shelf state-of-the-art 3D reconstruction model to estimate the depth and create a large-scale RGB-Depth dataset.
Our experiments demonstrate that the proposed method outperforms the competitive baseline and existing arts by a large margin.
arXiv Detail & Related papers (2020-09-30T17:12:35Z) - Deep Variational Network Toward Blind Image Restoration [60.45350399661175]
Blind image restoration is a common yet challenging problem in computer vision.
We propose a novel blind image restoration method, aiming to integrate both the advantages of them.
Experiments on two typical blind IR tasks, namely image denoising and super-resolution, demonstrate that the proposed method achieves superior performance over current state-of-the-arts.
arXiv Detail & Related papers (2020-08-25T03:30:53Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.