Bilevel Online Adaptation for Out-of-Domain Human Mesh Reconstruction
- URL: http://arxiv.org/abs/2103.16449v1
- Date: Tue, 30 Mar 2021 15:47:58 GMT
- Title: Bilevel Online Adaptation for Out-of-Domain Human Mesh Reconstruction
- Authors: Shanyan Guan, Jingwei Xu, Yunbo Wang, Bingbing Ni, Xiaokang Yang
- Abstract summary: This paper considers a new problem of adapting a pre-trained model of human mesh reconstruction to out-of-domain streaming videos.
We propose Bilevel Online Adaptation, which divides the optimization process of overall multi-objective into two steps of weight probe and weight update in a training.
We demonstrate that BOA leads to state-of-the-art results on two human mesh reconstruction benchmarks.
- Score: 94.25865526414717
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: This paper considers a new problem of adapting a pre-trained model of human
mesh reconstruction to out-of-domain streaming videos. However, most previous
methods based on the parametric SMPL model \cite{loper2015smpl} underperform in
new domains with unexpected, domain-specific attributes, such as camera
parameters, lengths of bones, backgrounds, and occlusions. Our general idea is
to dynamically fine-tune the source model on test video streams with additional
temporal constraints, such that it can mitigate the domain gaps without
over-fitting the 2D information of individual test frames. A subsequent
challenge is how to avoid conflicts between the 2D and temporal constraints. We
propose to tackle this problem using a new training algorithm named Bilevel
Online Adaptation (BOA), which divides the optimization process of overall
multi-objective into two steps of weight probe and weight update in a training
iteration. We demonstrate that BOA leads to state-of-the-art results on two
human mesh reconstruction benchmarks.
Related papers
- Backpropagation-free Network for 3D Test-time Adaptation [42.469853469556966]
Test-Time Adaptation (TTA) methods tend to apply computationally heavy and memory-intensive backpropagation-based approaches.
Here, we propose a novel method that uses a backpropagation-free approach for TTA for the specific case of 3D data.
arXiv Detail & Related papers (2024-03-27T10:50:24Z) - Source-Guided Similarity Preservation for Online Person
Re-Identification [3.655597435084387]
Online Unsupervised Domain Adaptation (OUDA) is the task of continuously adapting a model trained on a well-annotated source domain dataset to a target domain observed as a data stream.
In OUDA, person Re-ID models face two main challenges: catastrophic forgetting and domain shift.
We propose a new Source-guided Similarity Preservation (S2P) framework to alleviate these two problems.
arXiv Detail & Related papers (2024-02-23T09:07:20Z) - SPDE priors for uncertainty quantification of end-to-end neural data
assimilation schemes [4.213142548113385]
Recent advances in the deep learning community enables to adress this problem as neural architecture embedding data assimilation variational framework.
In this work, we draw from SPDE-based Processes to estimate prior models able to handle non-stationary covariances in both space and time.
Our neural variational scheme is modified to embed an augmented state formulation with both state SPDE parametrization to estimate.
arXiv Detail & Related papers (2024-02-02T19:18:12Z) - Multi-domain Learning for Updating Face Anti-spoofing Models [17.506385040102213]
We present a new model for MD-FAS, which addresses the forgetting issue when learning new domain data.
First, we devise a simple yet effective module, called spoof region estimator(SRE), to identify spoof traces in the spoof image.
Unlike prior works that estimate spoof traces which generate multiple outputs or a low-resolution binary mask, SRE produces one single, detailed pixel-wise estimate in an unsupervised manner.
arXiv Detail & Related papers (2022-08-23T18:28:34Z) - Back to MLP: A Simple Baseline for Human Motion Prediction [59.18776744541904]
This paper tackles the problem of human motion prediction, consisting in forecasting future body poses from historically observed sequences.
We show that the performance of these approaches can be surpassed by a light-weight and purely architectural architecture with only 0.14M parameters.
An exhaustive evaluation on Human3.6M, AMASS and 3DPW datasets shows that our method, which we dub siMLPe, consistently outperforms all other approaches.
arXiv Detail & Related papers (2022-07-04T16:35:58Z) - Adapting the Mean Teacher for keypoint-based lung registration under
geometric domain shifts [75.51482952586773]
deep neural networks generally require plenty of labeled training data and are vulnerable to domain shifts between training and test data.
We present a novel approach to geometric domain adaptation for image registration, adapting a model from a labeled source to an unlabeled target domain.
Our method consistently improves on the baseline model by 50%/47% while even matching the accuracy of models trained on target data.
arXiv Detail & Related papers (2022-07-01T12:16:42Z) - Decoupled Multi-task Learning with Cyclical Self-Regulation for Face
Parsing [71.19528222206088]
We propose a novel Decoupled Multi-task Learning with Cyclical Self-Regulation for face parsing.
Specifically, DML-CSR designs a multi-task model which comprises face parsing, binary edge, and category edge detection.
Our method achieves the new state-of-the-art performance on the Helen, CelebA-HQ, and LapaMask datasets.
arXiv Detail & Related papers (2022-03-28T02:12:30Z) - Out-of-Domain Human Mesh Reconstruction via Dynamic Bilevel Online
Adaptation [87.85851771425325]
We consider a new problem of adapting a human mesh reconstruction model to out-of-domain streaming videos.
We tackle this problem through online adaptation, gradually correcting the model bias during testing.
We propose the Dynamic Bilevel Online Adaptation algorithm (DynaBOA)
arXiv Detail & Related papers (2021-11-07T07:23:24Z) - Learning Nonparametric Human Mesh Reconstruction from a Single Image
without Ground Truth Meshes [56.27436157101251]
We propose a novel approach to learn human mesh reconstruction without any ground truth meshes.
This is made possible by introducing two new terms into the loss function of a graph convolutional neural network (Graph CNN)
arXiv Detail & Related papers (2020-02-28T20:30:07Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.