Unsupervised domain adaptation for clinician pose estimation and
instance segmentation in the OR
- URL: http://arxiv.org/abs/2108.11801v1
- Date: Thu, 26 Aug 2021 14:07:43 GMT
- Title: Unsupervised domain adaptation for clinician pose estimation and
instance segmentation in the OR
- Authors: Vinkle Srivastav, Afshin Gangi, Nicolas Padoy
- Abstract summary: We study how joint person pose estimation and segmentation instance can be performed on low resolutions images from 1x to 12x.
We propose a novel unsupervised domain adaptation method, called emphAdaptOR, to adapt a model from an emphin-the-wild labeled source domain to a statistically different unlabeled target domain.
We show the generality of our method as a semi-supervised learning (SSL) method on the large-scale emphCOCO dataset.
- Score: 4.024513066910992
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: The fine-grained localization of clinicians in the operating room (OR) is a
key component to design the new generation of OR support systems. Computer
vision models for person pixel-based segmentation and body-keypoints detection
are needed to better understand the clinical activities and the spatial layout
of the OR. This is challenging, not only because OR images are very different
from traditional vision datasets, but also because data and annotations are
hard to collect and generate in the OR due to privacy concerns. To address
these concerns, we first study how joint person pose estimation and instance
segmentation can be performed on low resolutions images from 1x to 12x. Second,
to address the domain shift and the lack of annotations, we propose a novel
unsupervised domain adaptation method, called \emph{AdaptOR}, to adapt a model
from an \emph{in-the-wild} labeled source domain to a statistically different
unlabeled target domain. We propose to exploit explicit geometric constraints
on the different augmentations of the unlabeled target domain image to generate
accurate pseudo labels, and using these pseudo labels to train the model on
high- and low-resolution OR images in a \emph{self-training} framework.
Furthermore, we propose \emph{disentangled feature normalization} to handle the
statistically different source and target domain data. Extensive experimental
results with detailed ablation studies on the two OR datasets \emph{MVOR+} and
\emph{TUM-OR-test} show the effectiveness of our approach against strongly
constructed baselines, especially on the low-resolution privacy-preserving OR
images. Finally, we show the generality of our method as a semi-supervised
learning (SSL) method on the large-scale \emph{COCO} dataset, where we achieve
comparable results with as few as \textbf{1\%} of labeled supervision against a
model trained with 100\% labeled supervision.
Related papers
- Unsupervised Camouflaged Object Segmentation as Domain Adaptation [5.304455190825436]
In this work, we investigate a new task, namely unsupervised camouflaged object segmentation (UCOS), where the target objects own a common attribute, i.e., camouflage.
We find that the state-of-the-art unsupervised models struggle in adapting UCOS, due to the domain gap between the properties of generic and camouflaged objects.
We formulate the UCOS as a source-free unsupervised domain adaptation task (UCOS-DA), where both source labels and target labels are absent during the whole model training process.
arXiv Detail & Related papers (2023-08-08T18:46:16Z) - Adaptive Face Recognition Using Adversarial Information Network [57.29464116557734]
Face recognition models often degenerate when training data are different from testing data.
We propose a novel adversarial information network (AIN) to address it.
arXiv Detail & Related papers (2023-05-23T02:14:11Z) - Deep face recognition with clustering based domain adaptation [57.29464116557734]
We propose a new clustering-based domain adaptation method designed for face recognition task in which the source and target domain do not share any classes.
Our method effectively learns the discriminative target feature by aligning the feature domain globally, and, at the meantime, distinguishing the target clusters locally.
arXiv Detail & Related papers (2022-05-27T12:29:11Z) - Source-Free Domain Adaptive Fundus Image Segmentation with Denoised
Pseudo-Labeling [56.98020855107174]
Domain adaptation typically requires to access source domain data to utilize their distribution information for domain alignment with the target data.
In many real-world scenarios, the source data may not be accessible during the model adaptation in the target domain due to privacy issue.
We present a novel denoised pseudo-labeling method for this problem, which effectively makes use of the source model and unlabeled target data.
arXiv Detail & Related papers (2021-09-19T06:38:21Z) - Semi-Supervised Domain Adaptation with Prototypical Alignment and
Consistency Learning [86.6929930921905]
This paper studies how much it can help address domain shifts if we further have a few target samples labeled.
To explore the full potential of landmarks, we incorporate a prototypical alignment (PA) module which calculates a target prototype for each class from the landmarks.
Specifically, we severely perturb the labeled images, making PA non-trivial to achieve and thus promoting model generalizability.
arXiv Detail & Related papers (2021-04-19T08:46:08Z) - Semantic Segmentation with Generative Models: Semi-Supervised Learning
and Strong Out-of-Domain Generalization [112.68171734288237]
We propose a novel framework for discriminative pixel-level tasks using a generative model of both images and labels.
We learn a generative adversarial network that captures the joint image-label distribution and is trained efficiently using a large set of unlabeled images.
We demonstrate strong in-domain performance compared to several baselines, and are the first to showcase extreme out-of-domain generalization.
arXiv Detail & Related papers (2021-04-12T21:41:25Z) - Consistency Regularization with High-dimensional Non-adversarial
Source-guided Perturbation for Unsupervised Domain Adaptation in Segmentation [15.428323201750144]
BiSIDA employs consistency regularization to efficiently exploit information from the unlabeled target dataset.
BiSIDA achieves new state-of-the-art on two commonly-used synthetic-to-real domain adaptation benchmarks.
arXiv Detail & Related papers (2020-09-18T03:26:44Z) - High-Order Information Matters: Learning Relation and Topology for
Occluded Person Re-Identification [84.43394420267794]
We propose a novel framework by learning high-order relation and topology information for discriminative features and robust alignment.
Our framework significantly outperforms state-of-the-art by6.5%mAP scores on Occluded-Duke dataset.
arXiv Detail & Related papers (2020-03-18T12:18:35Z) - Domain Adaptive Medical Image Segmentation via Adversarial Learning of
Disease-Specific Spatial Patterns [6.298270929323396]
We propose an unsupervised domain adaptation framework for boosting image segmentation performance across multiple domains.
We enforce architectures to be adaptive to new data by rejecting improbable segmentation patterns and implicitly learning through semantic and boundary information.
We demonstrate that recalibrating the deep networks on a few unlabeled images from the target domain improves the segmentation accuracy significantly.
arXiv Detail & Related papers (2020-01-25T13:48:02Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.