Fugu-MT 論文翻訳(概要): GIF: Generative Interpretable Faces

論文の概要: GIF: Generative Interpretable Faces

arxiv url: http://arxiv.org/abs/2009.00149v2
Date: Wed, 25 Nov 2020 13:37:01 GMT
ステータス: 翻訳完了
システム内更新日: 2022-10-23 06:36:48.830794
Title: GIF: Generative Interpretable Faces
Title（参考訳）: GIF:生成可能な解釈可能な顔
Authors: Partha Ghosh, Pravir Singh Gupta, Roy Uziel, Anurag Ranjan, Michael Black, Timo Bolkart
Abstract要約: 3次元顔モデリング法はパラメトリック制御を提供するが、非現実的な画像を生成する。最近の手法は、教師なしの方法で異なる因子をアンタングルしようとするか、事前訓練されたモデルに制御ポストホックを追加することによって、部分的な制御を得る。我々は、生成過程における乱れを促すために、事前定義された制御パラメータに生成モデルを条件付けする。
参考スコア（独自算出の注目度）: 16.573491296543196
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Photo-realistic visualization and animation of expressive human faces have been a long standing challenge. 3D face modeling methods provide parametric control but generates unrealistic images, on the other hand, generative 2D models like GANs (Generative Adversarial Networks) output photo-realistic face images, but lack explicit control. Recent methods gain partial control, either by attempting to disentangle different factors in an unsupervised manner, or by adding control post hoc to a pre-trained model. Unconditional GANs, however, may entangle factors that are hard to undo later. We condition our generative model on pre-defined control parameters to encourage disentanglement in the generation process. Specifically, we condition StyleGAN2 on FLAME, a generative 3D face model. While conditioning on FLAME parameters yields unsatisfactory results, we find that conditioning on rendered FLAME geometry and photometric details works well. This gives us a generative 2D face model named GIF (Generative Interpretable Faces) that offers FLAME's parametric control. Here, interpretable refers to the semantic meaning of different parameters. Given FLAME parameters for shape, pose, expressions, parameters for appearance, lighting, and an additional style vector, GIF outputs photo-realistic face images. We perform an AMT based perceptual study to quantitatively and qualitatively evaluate how well GIF follows its conditioning. The code, data, and trained model are publicly available for research purposes at http://gif.is.tue.mpg.de.
Abstract（参考訳）: 写実的な可視化と表現力のある人間の顔のアニメーションは、長年の課題だった。 3d顔モデリングはパラメトリックな制御を提供するが、非現実的な画像を生成する一方で、gans(generative adversarial networks)のような生成型2dモデルは、フォトリアリスティックな顔画像を出力するが、明示的な制御は欠如している。最近の手法は、教師なしの方法で異なる因子をアンタングルしようとするか、事前訓練されたモデルに制御ポストホックを追加することによって、部分的な制御を得る。しかし、無条件のGANは後に解離しにくい因子を絡み合わせることがある。我々は、生成過程における乱れを促すために、事前定義された制御パラメータに生成モデルを条件付けする。具体的には3次元顔モデルFLAMEにStyleGAN2を付加する。 FLAMEパラメータの条件付けは不満足な結果をもたらすが、レンダリングされたFLAME幾何や測光の詳細の条件付けはうまく機能する。これにより、FLAMEのパラメトリック制御を提供するGIF(Generative Interpretable Faces)という2次元顔モデルを生成することができる。ここでは、解釈とは異なるパラメータの意味をいう。形状、ポーズ、表現、外観、照明のパラメータ、および追加のスタイルベクトルのFLAMEパラメータが与えられた場合、GIFはフォトリアリスティックな顔画像を生成する。 AMTに基づく知覚的研究を行い、GIFの条件付けにどれだけの効果があるかを定量的に定性的に評価する。コード、データ、トレーニングされたモデルは、http://gif.is.tue.mpg.deで研究目的で公開されている。

論文の概要: GIF: Generative Interpretable Faces

関連論文リスト