Bilevel Online Adaptation for Out-of-Domain Human Mesh Reconstruction
- URL: http://arxiv.org/abs/2103.16449v1
- Date: Tue, 30 Mar 2021 15:47:58 GMT
- Title: Bilevel Online Adaptation for Out-of-Domain Human Mesh Reconstruction
- Authors: Shanyan Guan, Jingwei Xu, Yunbo Wang, Bingbing Ni, Xiaokang Yang
- Abstract summary: This paper considers a new problem of adapting a pre-trained model of human mesh reconstruction to out-of-domain streaming videos.
We propose Bilevel Online Adaptation, which divides the optimization process of overall multi-objective into two steps of weight probe and weight update in a training.
We demonstrate that BOA leads to state-of-the-art results on two human mesh reconstruction benchmarks.
- Score: 94.25865526414717
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: This paper considers a new problem of adapting a pre-trained model of human
mesh reconstruction to out-of-domain streaming videos. However, most previous
methods based on the parametric SMPL model \cite{loper2015smpl} underperform in
new domains with unexpected, domain-specific attributes, such as camera
parameters, lengths of bones, backgrounds, and occlusions. Our general idea is
to dynamically fine-tune the source model on test video streams with additional
temporal constraints, such that it can mitigate the domain gaps without
over-fitting the 2D information of individual test frames. A subsequent
challenge is how to avoid conflicts between the 2D and temporal constraints. We
propose to tackle this problem using a new training algorithm named Bilevel
Online Adaptation (BOA), which divides the optimization process of overall
multi-objective into two steps of weight probe and weight update in a training
iteration. We demonstrate that BOA leads to state-of-the-art results on two
human mesh reconstruction benchmarks.
Related papers
- Towards Robust and Realistic Human Pose Estimation via WiFi Signals [85.60557095666934]
WiFi-based human pose estimation is a challenging task that bridges discrete and subtle WiFi signals to human skeletons.
This paper revisits this problem and reveals two critical yet overlooked issues: 1) cross-domain gap, i.e., due to significant variations between source-target domain pose distributions; and 2) structural fidelity gap, i.e., predicted skeletal poses manifest distorted topology.
This paper fills these gaps by reformulating the task into a novel two-phase framework dubbed DT-Pose: Domain-consistent representation learning and Topology-constrained Pose decoding.
arXiv Detail & Related papers (2025-01-16T09:38:22Z) - Backpropagation-free Network for 3D Test-time Adaptation [42.469853469556966]
Test-Time Adaptation (TTA) methods tend to apply computationally heavy and memory-intensive backpropagation-based approaches.
Here, we propose a novel method that uses a backpropagation-free approach for TTA for the specific case of 3D data.
arXiv Detail & Related papers (2024-03-27T10:50:24Z) - Solving Inverse Problems with Model Mismatch using Untrained Neural Networks within Model-based Architectures [14.551812310439004]
We introduce an untrained forward model residual block within the model-based architecture to match the data consistency in the measurement domain for each instance.
Our approach offers a unified solution that is less parameter-sensitive, requires no additional data, and enables simultaneous fitting of the forward model and reconstruction in a single pass.
arXiv Detail & Related papers (2024-03-07T19:02:13Z) - Source-Guided Similarity Preservation for Online Person
Re-Identification [3.655597435084387]
Online Unsupervised Domain Adaptation (OUDA) is the task of continuously adapting a model trained on a well-annotated source domain dataset to a target domain observed as a data stream.
In OUDA, person Re-ID models face two main challenges: catastrophic forgetting and domain shift.
We propose a new Source-guided Similarity Preservation (S2P) framework to alleviate these two problems.
arXiv Detail & Related papers (2024-02-23T09:07:20Z) - Multi-domain Learning for Updating Face Anti-spoofing Models [17.506385040102213]
We present a new model for MD-FAS, which addresses the forgetting issue when learning new domain data.
First, we devise a simple yet effective module, called spoof region estimator(SRE), to identify spoof traces in the spoof image.
Unlike prior works that estimate spoof traces which generate multiple outputs or a low-resolution binary mask, SRE produces one single, detailed pixel-wise estimate in an unsupervised manner.
arXiv Detail & Related papers (2022-08-23T18:28:34Z) - Back to MLP: A Simple Baseline for Human Motion Prediction [59.18776744541904]
This paper tackles the problem of human motion prediction, consisting in forecasting future body poses from historically observed sequences.
We show that the performance of these approaches can be surpassed by a light-weight and purely architectural architecture with only 0.14M parameters.
An exhaustive evaluation on Human3.6M, AMASS and 3DPW datasets shows that our method, which we dub siMLPe, consistently outperforms all other approaches.
arXiv Detail & Related papers (2022-07-04T16:35:58Z) - Adapting the Mean Teacher for keypoint-based lung registration under
geometric domain shifts [75.51482952586773]
deep neural networks generally require plenty of labeled training data and are vulnerable to domain shifts between training and test data.
We present a novel approach to geometric domain adaptation for image registration, adapting a model from a labeled source to an unlabeled target domain.
Our method consistently improves on the baseline model by 50%/47% while even matching the accuracy of models trained on target data.
arXiv Detail & Related papers (2022-07-01T12:16:42Z) - Decoupled Multi-task Learning with Cyclical Self-Regulation for Face
Parsing [71.19528222206088]
We propose a novel Decoupled Multi-task Learning with Cyclical Self-Regulation for face parsing.
Specifically, DML-CSR designs a multi-task model which comprises face parsing, binary edge, and category edge detection.
Our method achieves the new state-of-the-art performance on the Helen, CelebA-HQ, and LapaMask datasets.
arXiv Detail & Related papers (2022-03-28T02:12:30Z) - Out-of-Domain Human Mesh Reconstruction via Dynamic Bilevel Online
Adaptation [87.85851771425325]
We consider a new problem of adapting a human mesh reconstruction model to out-of-domain streaming videos.
We tackle this problem through online adaptation, gradually correcting the model bias during testing.
We propose the Dynamic Bilevel Online Adaptation algorithm (DynaBOA)
arXiv Detail & Related papers (2021-11-07T07:23:24Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.