UDA-COPE: Unsupervised Domain Adaptation for Category-level Object Pose
Estimation
- URL: http://arxiv.org/abs/2111.12580v1
- Date: Wed, 24 Nov 2021 16:00:48 GMT
- Title: UDA-COPE: Unsupervised Domain Adaptation for Category-level Object Pose
Estimation
- Authors: Taeyeop Lee, Byeong-Uk Lee, Inkyu Shin, Jaesung Choe, Ukcheol Shin, In
So Kweon, Kuk-Jin Yoon
- Abstract summary: We propose an unsupervised domain adaptation (UDA) for category-level object pose estimation, called textbfUDA-COPE.
Inspired by the recent multi-modal UDA techniques, the proposed method exploits a teacher-student self-supervised learning scheme to train a pose estimation network without using target domain labels.
- Score: 84.16372642822495
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Learning to estimate object pose often requires ground-truth (GT) labels,
such as CAD model and absolute-scale object pose, which is expensive and
laborious to obtain in the real world. To tackle this problem, we propose an
unsupervised domain adaptation (UDA) for category-level object pose estimation,
called \textbf{UDA-COPE}. Inspired by the recent multi-modal UDA techniques,
the proposed method exploits a teacher-student self-supervised learning scheme
to train a pose estimation network without using target domain labels. We also
introduce a bidirectional filtering method between predicted normalized object
coordinate space (NOCS) map and observed point cloud, to not only make our
teacher network more robust to the target domain but also to provide more
reliable pseudo labels for the student network training. Extensive experimental
results demonstrate the effectiveness of our proposed method both
quantitatively and qualitatively. Notably, without leveraging target-domain GT
labels, our proposed method achieves comparable or sometimes superior
performance to existing methods that depend on the GT labels.
Related papers
- Exploiting Unlabeled Data with Multiple Expert Teachers for Open Vocabulary Aerial Object Detection and Its Orientation Adaptation [58.37525311718006]
We put forth a novel formulation of the aerial object detection problem, namely open-vocabulary aerial object detection (OVAD)
We propose CastDet, a CLIP-activated student-teacher detection framework that serves as the first OVAD detector specifically designed for the challenging aerial scenario.
Our framework integrates a robust localization teacher along with several box selection strategies to generate high-quality proposals for novel objects.
arXiv Detail & Related papers (2024-11-04T12:59:13Z) - Enhancing Domain Adaptation through Prompt Gradient Alignment [16.618313165111793]
We develop a line of works based on prompt learning to learn both domain-invariant and specific features.
We cast UDA as a multiple-objective optimization problem in which each objective is represented by a domain loss.
Our method consistently surpasses other prompt-based baselines by a large margin on different UDA benchmarks.
arXiv Detail & Related papers (2024-06-13T17:40:15Z) - WLST: Weak Labels Guided Self-training for Weakly-supervised Domain
Adaptation on 3D Object Detection [22.835487211419483]
weakly-supervised domain adaptation (WDA) is an underexplored yet practical task that only requires few labeling effort on the target domain.
We propose a general weak labels guided self-training framework, WLST, designed for WDA on 3D object detection.
Our method is able to generate more robust and consistent pseudo labels that would benefit the training process on the target domain.
arXiv Detail & Related papers (2023-10-05T18:17:07Z) - Contrastive Mean Teacher for Domain Adaptive Object Detectors [20.06919799819326]
Mean-teacher self-training is a powerful paradigm in unsupervised domain adaptation for object detection, but it struggles with low-quality pseudo-labels.
We propose Contrastive Mean Teacher (CMT) -- a unified, general-purpose framework with the two paradigms naturally integrated to maximize beneficial learning signals.
CMT leads to new state-of-the-art target-domain performance: 51.9% mAP on Foggy Cityscapes, outperforming the previously best by 2.1% mAP.
arXiv Detail & Related papers (2023-05-04T17:55:17Z) - Self-training through Classifier Disagreement for Cross-Domain Opinion
Target Extraction [62.41511766918932]
Opinion target extraction (OTE) or aspect extraction (AE) is a fundamental task in opinion mining.
Recent work focus on cross-domain OTE, which is typically encountered in real-world scenarios.
We propose a new SSL approach that opts for selecting target samples whose model output from a domain-specific teacher and student network disagrees on the unlabelled target data.
arXiv Detail & Related papers (2023-02-28T16:31:17Z) - Unsupervised Domain Adaptation for Monocular 3D Object Detection via
Self-Training [57.25828870799331]
We propose STMono3D, a new self-teaching framework for unsupervised domain adaptation on Mono3D.
We develop a teacher-student paradigm to generate adaptive pseudo labels on the target domain.
STMono3D achieves remarkable performance on all evaluated datasets and even surpasses fully supervised results on the KITTI 3D object detection dataset.
arXiv Detail & Related papers (2022-04-25T12:23:07Z) - RefRec: Pseudo-labels Refinement via Shape Reconstruction for
Unsupervised 3D Domain Adaptation [23.275638060852067]
RefRec is the first approach to investigate pseudo-labels and self-training in UDA for point clouds.
We present two main innovations to make self-training effective on 3D data.
arXiv Detail & Related papers (2021-10-21T10:24:05Z) - Aligning Pretraining for Detection via Object-Level Contrastive Learning [57.845286545603415]
Image-level contrastive representation learning has proven to be highly effective as a generic model for transfer learning.
We argue that this could be sub-optimal and thus advocate a design principle which encourages alignment between the self-supervised pretext task and the downstream task.
Our method, called Selective Object COntrastive learning (SoCo), achieves state-of-the-art results for transfer performance on COCO detection.
arXiv Detail & Related papers (2021-06-04T17:59:52Z) - Uncertainty-Aware Consistency Regularization for Cross-Domain Semantic
Segmentation [63.75774438196315]
Unsupervised domain adaptation (UDA) aims to adapt existing models of the source domain to a new target domain with only unlabeled data.
Most existing methods suffer from noticeable negative transfer resulting from either the error-prone discriminator network or the unreasonable teacher model.
We propose an uncertainty-aware consistency regularization method for cross-domain semantic segmentation.
arXiv Detail & Related papers (2020-04-19T15:30:26Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.