Fugu-MT 論文翻訳(概要): PhotoApp: Photorealistic Appearance Editing of Head Portraits

論文の概要: PhotoApp: Photorealistic Appearance Editing of Head Portraits

arxiv url: http://arxiv.org/abs/2103.07658v1
Date: Sat, 13 Mar 2021 08:59:49 GMT
ステータス: 翻訳完了
システム内更新日: 2021-03-16 14:17:05.891264
Title: PhotoApp: Photorealistic Appearance Editing of Head Portraits
Title（参考訳）: PhotoApp: ヘッドポートレートのフォトリアルな外観編集
Authors: Mallikarjun B R, Ayush Tewari, Abdallah Dib, Tim Weyrich, Bernd Bickel, Hans-Peter Seidel, Hanspeter Pfister, Wojciech Matusik, Louis Chevallier, Mohamed Elgharib, Christian Theobalt
Abstract要約: 本稿では,ポートレート画像におけるカメラ視点とシーン照明の高品質な直感的編集手法を提案する。ほとんどの編集アプローチは、光やカメラステージといった設定でキャプチャされたトレーニングデータを使用した教師あり学習に依存している。 StyleGANの潜在空間で学習する監督問題を設計します。これは、教師付き学習と生成的敵対的モデリングのベストを組み合わせる。
参考スコア（独自算出の注目度）: 97.23638022484153
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Photorealistic editing of portraits is a challenging task as humans are very sensitive to inconsistencies in faces. We present an approach for high-quality intuitive editing of the camera viewpoint and scene illumination in a portrait image. This requires our method to capture and control the full reflectance field of the person in the image. Most editing approaches rely on supervised learning using training data captured with setups such as light and camera stages. Such datasets are expensive to acquire, not readily available and do not capture all the rich variations of in-the-wild portrait images. In addition, most supervised approaches only focus on relighting, and do not allow camera viewpoint editing. Thus, they only capture and control a subset of the reflectance field. Recently, portrait editing has been demonstrated by operating in the generative model space of StyleGAN. While such approaches do not require direct supervision, there is a significant loss of quality when compared to the supervised approaches. In this paper, we present a method which learns from limited supervised training data. The training images only include people in a fixed neutral expression with eyes closed, without much hair or background variations. Each person is captured under 150 one-light-at-a-time conditions and under 8 camera poses. Instead of training directly in the image space, we design a supervised problem which learns transformations in the latent space of StyleGAN. This combines the best of supervised learning and generative adversarial modeling. We show that the StyleGAN prior allows for generalisation to different expressions, hairstyles and backgrounds. This produces high-quality photorealistic results for in-the-wild images and significantly outperforms existing methods. Our approach can edit the illumination and pose simultaneously, and runs at interactive rates.
Abstract（参考訳）: ポートレートのフォトリアリズム編集は、人間が顔の不一致に非常に敏感であるため、難しい作業です。本稿では,ポートレート画像におけるカメラ視点とシーン照明の高品質な直感的編集手法を提案する。これにより,画像中の人物のフルリフレクタンスフィールドをキャプチャし,制御する手法が求められている。ほとんどの編集アプローチは、光やカメラステージといった設定でキャプチャされたトレーニングデータを使用した教師あり学習に依存している。このようなデータセットは、取得するのに高価であり、容易に入手できず、ワイルドポートレート画像のすべての豊富なバリエーションをキャプチャしません。加えて、ほとんどの監督されたアプローチはライトアップのみに焦点を当てており、カメラ視点の編集を許可していない。したがって、反射フィールドのサブセットのみをキャプチャして制御します。近年,StyleGAN の生成モデル空間で動作することで,肖像画編集が実証されている。このようなアプローチは直接の監視を必要としないが、監督されたアプローチと比較すると品質が大幅に低下する。本稿では,限られた教師付きトレーニングデータから学習する手法を提案する。トレーニング画像には、髪や背景のバリエーションがほとんどなく、目を閉じた固定された中性表現の人々しか含まれていない。一人ひとりが1回150回、カメラが8回撮影されます。画像空間で直接トレーニングする代わりに、StyleGANの潜在的な空間で変換を学ぶ監督された問題を設計します。これは、教師付き学習と生成的敵対的モデリングのベストを組み合わせる。 StyleGANの前には,表現,髪型,背景の一般化が可能であることを示す。これにより、インザワイルド画像のための高品質のフォトリアリズム結果が生成され、既存の方法を大幅に上回ります。本手法は照明とポーズを同時に編集し,インタラクティブな速度で動作させる。

論文の概要: PhotoApp: Photorealistic Appearance Editing of Head Portraits

関連論文リスト