Progressive and Aligned Pose Attention Transfer for Person Image
Generation
- URL: http://arxiv.org/abs/2103.11622v1
- Date: Mon, 22 Mar 2021 07:24:57 GMT
- Title: Progressive and Aligned Pose Attention Transfer for Person Image
Generation
- Authors: Zhen Zhu, Tengteng Huang, Mengde Xu, Baoguang Shi, Wenqing Cheng,
Xiang Bai
- Abstract summary: This paper proposes a new generative adversarial network for pose transfer, i.e., transferring the pose of a given person to a target pose.
We use two types of blocks, namely Pose-Attentional Transfer Block (PATB) and Aligned Pose-Attentional Transfer Bloc (APATB)
We verify the efficacy of the model on the Market-1501 and DeepFashion datasets, using quantitative and qualitative measures.
- Score: 59.87492938953545
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: This paper proposes a new generative adversarial network for pose transfer,
i.e., transferring the pose of a given person to a target pose. We design a
progressive generator which comprises a sequence of transfer blocks. Each block
performs an intermediate transfer step by modeling the relationship between the
condition and the target poses with attention mechanism. Two types of blocks
are introduced, namely Pose-Attentional Transfer Block (PATB) and Aligned
Pose-Attentional Transfer Bloc ~(APATB). Compared with previous works, our
model generates more photorealistic person images that retain better appearance
consistency and shape consistency compared with input images. We verify the
efficacy of the model on the Market-1501 and DeepFashion datasets, using
quantitative and qualitative measures. Furthermore, we show that our method can
be used for data augmentation for the person re-identification task,
alleviating the issue of data insufficiency. Code and pretrained models are
available at https://github.com/tengteng95/Pose-Transfer.git.
Related papers
- Person Image Synthesis via Denoising Diffusion Model [116.34633988927429]
We show how denoising diffusion models can be applied for high-fidelity person image synthesis.
Our results on two large-scale benchmarks and a user study demonstrate the photorealism of our proposed approach under challenging scenarios.
arXiv Detail & Related papers (2022-11-22T18:59:50Z) - FixMyPose: Pose Correctional Captioning and Retrieval [67.20888060019028]
We introduce a new captioning dataset named FixMyPose to address automated pose correction systems.
We collect descriptions of correcting a "current" pose to look like a "target" pose.
To avoid ML biases, we maintain a balance across characters with diverse demographics.
arXiv Detail & Related papers (2021-04-04T21:45:44Z) - PoNA: Pose-guided Non-local Attention for Human Pose Transfer [105.14398322129024]
We propose a new human pose transfer method using a generative adversarial network (GAN) with simplified cascaded blocks.
Our model generates sharper and more realistic images with rich details, while having fewer parameters and faster speed.
arXiv Detail & Related papers (2020-12-13T12:38:29Z) - Human Pose Transfer by Adaptive Hierarchical Deformation [24.70009597455219]
We propose an adaptive human pose transfer network with two hierarchical deformation levels.
The first level generates human semantic parsing aligned with the target pose.
The second level generates the final textured person image in the target pose with the semantic guidance.
arXiv Detail & Related papers (2020-12-13T01:49:26Z) - MUST-GAN: Multi-level Statistics Transfer for Self-driven Person Image
Generation [13.06676286691587]
Pose-guided person image generation usually involves using paired source-target images to supervise the training.
We propose a novel multi-level statistics transfer model, which disentangles and transfers multi-level appearance features from person images.
Our approach allows for flexible manipulation of person appearance and pose properties to perform pose transfer and clothes style transfer tasks.
arXiv Detail & Related papers (2020-11-18T04:38:48Z) - Person image generation with semantic attention network for person
re-identification [9.30413920076019]
We propose a novel person pose-guided image generation method, which is called the semantic attention network.
The network consists of several semantic attention blocks, where each block attends to preserve and update the pose code and the clothing textures.
Compared with other methods, our network can characterize better body shape and keep clothing attributes, simultaneously.
arXiv Detail & Related papers (2020-08-18T12:18:51Z) - XingGAN for Person Image Generation [149.54517767056382]
We propose a novel Generative Adversarial Network (XingGAN) for person image generation tasks.
XingGAN consists of two generation branches that model the person's appearance and shape information.
We show that the proposed XingGAN advances the state-of-the-art performance in terms of objective quantitative scores and subjective visual realness.
arXiv Detail & Related papers (2020-07-17T23:40:22Z) - Neural Pose Transfer by Spatially Adaptive Instance Normalization [73.04483812364127]
We propose the first neural pose transfer model that solves the pose transfer via the latest technique for image style transfer.
Our model does not require any correspondences between the source and target meshes.
Experiments show that the proposed model can effectively transfer deformation from source to target meshes, and has good generalization ability to deal with unseen identities or poses of meshes.
arXiv Detail & Related papers (2020-03-16T14:33:59Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.