Structure-Aware Flow Generation for Human Body Reshaping
- URL: http://arxiv.org/abs/2203.04670v2
- Date: Fri, 11 Mar 2022 03:38:21 GMT
- Title: Structure-Aware Flow Generation for Human Body Reshaping
- Authors: Jianqiang Ren, Yuan Yao, Biwen Lei, Miaomiao Cui, Xuansong Xie
- Abstract summary: We develop an end-to-end flow generation architecture to achieve unprecedentedly controllable performance under arbitrary poses and garments.
For a comprehensive evaluation, we construct the first large-scale body reshaping dataset, namely BR-5K.
Our approach significantly outperforms existing state-of-the-art methods in terms of visual performance, controllability, and efficiency.
- Score: 15.365236395118982
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Body reshaping is an important procedure in portrait photo retouching. Due to
the complicated structure and multifarious appearance of human bodies, existing
methods either fall back on the 3D domain via body morphable model or resort to
keypoint-based image deformation, leading to inefficiency and unsatisfied
visual quality. In this paper, we address these limitations by formulating an
end-to-end flow generation architecture under the guidance of body structural
priors, including skeletons and Part Affinity Fields, and achieve
unprecedentedly controllable performance under arbitrary poses and garments. A
compositional attention mechanism is introduced for capturing both visual
perceptual correlations and structural associations of the human body to
reinforce the manipulation consistency among related parts. For a comprehensive
evaluation, we construct the first large-scale body reshaping dataset, namely
BR-5K, which contains 5,000 portrait photos as well as professionally retouched
targets. Extensive experiments demonstrate that our approach significantly
outperforms existing state-of-the-art methods in terms of visual performance,
controllability, and efficiency. The dataset is available at our website:
https://github.com/JianqiangRen/FlowBasedBodyReshaping.
Related papers
- Reconstructing People, Places, and Cameras [57.81696692335401]
"Humans and Structure from Motion" (HSfM) is a method for jointly reconstructing multiple human meshes, scene point clouds, and camera parameters in a metric world coordinate system.
Our results show that incorporating human data into the SfM pipeline improves camera pose estimation.
arXiv Detail & Related papers (2024-12-23T18:58:34Z) - BodyMetric: Evaluating the Realism of Human Bodies in Text-to-Image Generation [9.85749440360125]
BodyMetric is a learnable metric that predicts body realism in images.
We demonstrate BodyMetric through applications that were previously infeasible at scale.
arXiv Detail & Related papers (2024-12-05T11:48:54Z) - EasyHOI: Unleashing the Power of Large Models for Reconstructing Hand-Object Interactions in the Wild [79.71523320368388]
Our work aims to reconstruct hand-object interactions from a single-view image.
We first design a novel pipeline to estimate the underlying hand pose and object shape.
With the initial reconstruction, we employ a prior-guided optimization scheme.
arXiv Detail & Related papers (2024-11-21T16:33:35Z) - SSP-IR: Semantic and Structure Priors for Diffusion-based Realistic Image Restoration [20.873676111265656]
SSP-IR aims to fully exploit semantic and structure priors from low-quality images.
Our method outperforms other state-of-the-art methods overall on both synthetic and real-world datasets.
arXiv Detail & Related papers (2024-07-04T04:55:14Z) - 3D WholeBody Pose Estimation based on Semantic Graph Attention Network and Distance Information [2.457872341625575]
A novel Semantic Graph Attention Network can benefit from the ability of self-attention to capture global context.
A Body Part Decoder assists in extracting and refining the information related to specific segments of the body.
A Geometry Loss makes a critical constraint on the structural skeleton of the body, ensuring that the model's predictions adhere to the natural limits of human posture.
arXiv Detail & Related papers (2024-06-03T10:59:00Z) - Structure-Aware Human Body Reshaping with Adaptive Affinity-Graph Network [14.361677329761672]
We propose a novel Adaptive Affinity-Graph Network (AAGN), which extracts the global affinity between different body parts.
For high-frequency details, a Body Shape Discriminator (BSD) is designed to extract information from both high-frequency and spatial domain.
Our framework significantly enhances the aesthetic appeal of photos, surpassing all previous work to achieve state-of-the-art in all evaluation metrics.
arXiv Detail & Related papers (2024-04-22T08:44:10Z) - Towards Robust and Expressive Whole-body Human Pose and Shape Estimation [51.457517178632756]
Whole-body pose and shape estimation aims to jointly predict different behaviors of the entire human body from a monocular image.
Existing methods often exhibit degraded performance under the complexity of in-the-wild scenarios.
We propose a novel framework to enhance the robustness of whole-body pose and shape estimation.
arXiv Detail & Related papers (2023-12-14T08:17:42Z) - Human as Points: Explicit Point-based 3D Human Reconstruction from Single-view RGB Images [71.91424164693422]
We introduce an explicit point-based human reconstruction framework called HaP.
Our approach is featured by fully-explicit point cloud estimation, manipulation, generation, and refinement in the 3D geometric space.
Our results may indicate a paradigm rollback to the fully-explicit and geometry-centric algorithm design.
arXiv Detail & Related papers (2023-11-06T05:52:29Z) - Generalizable Neural Performer: Learning Robust Radiance Fields for
Human Novel View Synthesis [52.720314035084215]
This work targets at using a general deep learning framework to synthesize free-viewpoint images of arbitrary human performers.
We present a simple yet powerful framework, named Generalizable Neural Performer (GNR), that learns a generalizable and robust neural body representation.
Experiments on GeneBody-1.0 and ZJU-Mocap show better robustness of our methods than recent state-of-the-art generalizable methods.
arXiv Detail & Related papers (2022-04-25T17:14:22Z) - Structure-aware Person Image Generation with Pose Decomposition and
Semantic Correlation [29.727033198797518]
We propose a structure-aware flow based method for high-quality person image generation.
We decompose the human body into different semantic parts and apply different networks to predict the flow fields for these parts separately.
Our method can generate high-quality results under large pose discrepancy and outperforms state-of-the-art methods in both qualitative and quantitative comparisons.
arXiv Detail & Related papers (2021-02-05T03:07:57Z) - Kinematic-Structure-Preserved Representation for Unsupervised 3D Human
Pose Estimation [58.72192168935338]
Generalizability of human pose estimation models developed using supervision on large-scale in-studio datasets remains questionable.
We propose a novel kinematic-structure-preserved unsupervised 3D pose estimation framework, which is not restrained by any paired or unpaired weak supervisions.
Our proposed model employs three consecutive differentiable transformations named as forward-kinematics, camera-projection and spatial-map transformation.
arXiv Detail & Related papers (2020-06-24T23:56:33Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.