GC-VTON: Predicting Globally Consistent and Occlusion Aware Local Flows
with Neighborhood Integrity Preservation for Virtual Try-on
- URL: http://arxiv.org/abs/2311.04932v1
- Date: Tue, 7 Nov 2023 10:09:49 GMT
- Title: GC-VTON: Predicting Globally Consistent and Occlusion Aware Local Flows
with Neighborhood Integrity Preservation for Virtual Try-on
- Authors: Hamza Rawal, Muhammad Junaid Ahmad, Farooq Zaman
- Abstract summary: Flow based garment warping is an integral part of image-based virtual try-on networks.
We propose a novel approach where we disentangle the global boundary alignment and local texture preserving tasks.
A consistency loss is then employed between the two modules which harmonizes the local flows with the global boundary alignment.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Flow based garment warping is an integral part of image-based virtual try-on
networks. However, optimizing a single flow predicting network for simultaneous
global boundary alignment and local texture preservation results in sub-optimal
flow fields. Moreover, dense flows are inherently not suited to handle
intricate conditions like garment occlusion by body parts or by other garments.
Forcing flows to handle the above issues results in various distortions like
texture squeezing, and stretching. In this work, we propose a novel approach
where we disentangle the global boundary alignment and local texture preserving
tasks via our GlobalNet and LocalNet modules. A consistency loss is then
employed between the two modules which harmonizes the local flows with the
global boundary alignment. Additionally, we explicitly handle occlusions by
predicting body-parts visibility mask, which is used to mask out the occluded
regions in the warped garment. The masking prevents the LocalNet from
predicting flows that distort texture to compensate for occlusions. We also
introduce a novel regularization loss (NIPR), that defines a criteria to
identify the regions in the warped garment where texture integrity is violated
(squeezed or stretched). NIPR subsequently penalizes the flow in those regions
to ensure regular and coherent warps that preserve the texture in local
neighborhoods. Evaluation on a widely used virtual try-on dataset demonstrates
strong performance of our network compared to the current SOTA methods.
Related papers
- LCP-Fusion: A Neural Implicit SLAM with Enhanced Local Constraints and Computable Prior [22.948360879064758]
LCP-Fusion is a neural implicit SLAM system with enhanced local constraints and computable prior.
We show that our method achieve better localization accuracy and reconstruction consistency than existing RGB-D implicit SLAM.
arXiv Detail & Related papers (2024-11-06T02:05:44Z) - GP-VTON: Towards General Purpose Virtual Try-on via Collaborative
Local-Flow Global-Parsing Learning [63.8668179362151]
Virtual Try-ON aims to transfer an in-shop garment onto a specific person.
Existing methods employ a global warping module to model the anisotropic deformation for different garment parts.
We propose an innovative Local-Flow Global-Parsing (LFGP) warping module and a Dynamic Gradient Truncation (DGT) training strategy.
arXiv Detail & Related papers (2023-03-24T02:12:29Z) - Push-the-Boundary: Boundary-aware Feature Propagation for Semantic
Segmentation of 3D Point Clouds [0.5249805590164901]
We propose a boundary-aware feature propagation mechanism to improve semantic segmentation near object boundaries.
With one shared encoder, our network outputs (i) boundary localization, (ii) prediction of directions pointing to the object's interior, and (iii) semantic segmentation, in three parallel streams.
Our proposed approach yields consistent improvements by reducing boundary errors.
arXiv Detail & Related papers (2022-12-23T15:42:01Z) - Global Aggregation then Local Distribution for Scene Parsing [99.1095068574454]
We show that our approach can be modularized as an end-to-end trainable block and easily plugged into existing semantic segmentation networks.
Our approach allows us to build new state of the art on major semantic segmentation benchmarks including Cityscapes, ADE20K, Pascal Context, Camvid and COCO-stuff.
arXiv Detail & Related papers (2021-07-28T03:46:57Z) - Sign-Agnostic CONet: Learning Implicit Surface Reconstructions by
Sign-Agnostic Optimization of Convolutional Occupancy Networks [39.65056638604885]
We learn implicit surface reconstruction by sign-agnostic optimization of convolutional occupancy networks.
We show this goal can be effectively achieved by a simple yet effective design.
arXiv Detail & Related papers (2021-05-08T03:35:32Z) - Unsupervised Metric Relocalization Using Transform Consistency Loss [66.19479868638925]
Training networks to perform metric relocalization traditionally requires accurate image correspondences.
We propose a self-supervised solution, which exploits a key insight: localizing a query image within a map should yield the same absolute pose, regardless of the reference image used for registration.
We evaluate our framework on synthetic and real-world data, showing our approach outperforms other supervised methods when a limited amount of ground-truth information is available.
arXiv Detail & Related papers (2020-11-01T19:24:27Z) - LRC-Net: Learning Discriminative Features on Point Clouds by Encoding
Local Region Contexts [65.79931333193016]
We present a novel Local-Region-Context Network (LRC-Net) to learn discriminative features on point clouds.
LRC-Net encodes fine-grained contexts inside and among local regions simultaneously.
Results show LRC-Net is competitive with state-of-the-art methods in shape classification and shape segmentation applications.
arXiv Detail & Related papers (2020-03-18T14:34:08Z) - Image Fine-grained Inpainting [89.17316318927621]
We present a one-stage model that utilizes dense combinations of dilated convolutions to obtain larger and more effective receptive fields.
To better train this efficient generator, except for frequently-used VGG feature matching loss, we design a novel self-guided regression loss.
We also employ a discriminator with local and global branches to ensure local-global contents consistency.
arXiv Detail & Related papers (2020-02-07T03:45:25Z) - Dense Residual Network: Enhancing Global Dense Feature Flow for
Character Recognition [75.4027660840568]
This paper explores how to enhance the local and global dense feature flow by exploiting hierarchical features fully from all the convolution layers.
Technically, we propose an efficient and effective CNN framework, i.e., Fast Dense Residual Network (FDRN) for text recognition.
arXiv Detail & Related papers (2020-01-23T06:55:08Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.