Fugu-MT 論文翻訳(概要): Towards Hard-pose Virtual Try-on via 3D-aware Global Correspondence Learning

論文の概要: Towards Hard-pose Virtual Try-on via 3D-aware Global Correspondence Learning

arxiv url: http://arxiv.org/abs/2211.14052v1
Date: Fri, 25 Nov 2022 12:16:21 GMT
ステータス: 翻訳完了
システム内更新日: 2022-11-28 17:13:27.440517
Title: Towards Hard-pose Virtual Try-on via 3D-aware Global Correspondence Learning
Title（参考訳）: 3D対応グローバル対応学習によるハードな仮想試行に向けて
Authors: Zaiyu Huang, Hanhui Li, Zhenyu Xie, Michael Kampffmeyer, Qingling Cai, Xiaodan Liang
Abstract要約: 3D対応のグローバルな対応は、グローバルな意味的相関、局所的な変形、および3D人体の幾何学的先行を共同でエンコードする信頼性のあるフローである。対向ジェネレータは、3D認識フローによって歪んだ衣服と、対象者の画像とを入力として、フォトリアリスティックな試着結果を合成する。
参考スコア（独自算出の注目度）: 70.75369367311897
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: In this paper, we target image-based person-to-person virtual try-on in the presence of diverse poses and large viewpoint variations. Existing methods are restricted in this setting as they estimate garment warping flows mainly based on 2D poses and appearance, which omits the geometric prior of the 3D human body shape. Moreover, current garment warping methods are confined to localized regions, which makes them ineffective in capturing long-range dependencies and results in inferior flows with artifacts. To tackle these issues, we present 3D-aware global correspondences, which are reliable flows that jointly encode global semantic correlations, local deformations, and geometric priors of 3D human bodies. Particularly, given an image pair depicting the source and target person, (a) we first obtain their pose-aware and high-level representations via two encoders, and introduce a coarse-to-fine decoder with multiple refinement modules to predict the pixel-wise global correspondence. (b) 3D parametric human models inferred from images are incorporated as priors to regularize the correspondence refinement process so that our flows can be 3D-aware and better handle variations of pose and viewpoint. (c) Finally, an adversarial generator takes the garment warped by the 3D-aware flow, and the image of the target person as inputs, to synthesize the photo-realistic try-on result. Extensive experiments on public benchmarks and our HardPose test set demonstrate the superiority of our method against the SOTA try-on approaches.
Abstract（参考訳）: 本稿では,多彩なポーズや視点変化の存在下で,画像に基づく仮想試行を目標とする。既存の方法は、主に2次元のポーズと外観に基づいて衣服の反りの流れを推定し、3次元の人体形状の幾何学的先行を省略するため、この設定で制限されている。また,現在の衣服の反りは局所的な領域に限定されているため,長距離の依存関係を捉えるのに効果がなく,アーティファクトによる流れが劣る。これらの課題に対処するために,グローバルな意味的相関,局所的な変形,および3次元人体の幾何学的先行を協調的にエンコードする信頼性のある3D対応のグローバル通信を提案する。特に、ソースとターゲットの人物を描いた画像対が与えられる。 (a)まず2つのエンコーダによるポーズ認識とハイレベル表現を取得し,画素ワイドな大域対応を予測するために複数の改良モジュールを備えた粗大なデコーダを導入する。 b) 画像から推定される3次元パラメトリック人間モデルを先行として組み込んで, 対応改善プロセスの規則化を行い, フローを3次元認識し, ポーズや視点のバリエーションをよりよく扱えるようにした。 (c)最後に、対向生成装置は、3D認識フローによって歪んだ衣服と、対象者の画像とを入力として、フォトリアリスティックな試着結果を合成する。公開ベンチマークとHardPoseテストセットに関する大規模な実験は、SOTA試行アプローチに対する我々の手法の優位性を実証している。

論文の概要: Towards Hard-pose Virtual Try-on via 3D-aware Global Correspondence Learning

関連論文リスト