Domain Adaptive Semantic Segmentation with Self-Supervised Depth
Estimation
- URL: http://arxiv.org/abs/2104.13613v1
- Date: Wed, 28 Apr 2021 07:47:36 GMT
- Title: Domain Adaptive Semantic Segmentation with Self-Supervised Depth
Estimation
- Authors: Qin Wang, Dengxin Dai, Lukas Hoyer, Olga Fink, Luc Van Gool
- Abstract summary: Domain adaptation for semantic segmentation aims to improve the model performance in the presence of a distribution shift between source and target domain.
We leverage the guidance from self-supervised depth estimation, which is available on both domains, to bridge the domain gap.
We demonstrate the effectiveness of our proposed approach on the benchmark tasks SYNTHIA-to-Cityscapes and GTA-to-Cityscapes.
- Score: 84.34227665232281
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Domain adaptation for semantic segmentation aims to improve the model
performance in the presence of a distribution shift between source and target
domain. Leveraging the supervision from auxiliary tasks~(such as depth
estimation) has the potential to heal this shift because many visual tasks are
closely related to each other. However, such a supervision is not always
available. In this work, we leverage the guidance from self-supervised depth
estimation, which is available on both domains, to bridge the domain gap. On
the one hand, we propose to explicitly learn the task feature correlation to
strengthen the target semantic predictions with the help of target depth
estimation. On the other hand, we use the depth prediction discrepancy from
source and target depth decoders to approximate the pixel-wise adaptation
difficulty. The adaptation difficulty, inferred from depth, is then used to
refine the target semantic segmentation pseudo-labels. The proposed method can
be easily implemented into existing segmentation frameworks. We demonstrate the
effectiveness of our proposed approach on the benchmark tasks
SYNTHIA-to-Cityscapes and GTA-to-Cityscapes, on which we achieve the new
state-of-the-art performance of $55.0\%$ and $56.6\%$, respectively. Our code
is available at \url{https://github.com/qinenergy/corda}.
Related papers
- MICDrop: Masking Image and Depth Features via Complementary Dropout for Domain-Adaptive Semantic Segmentation [155.0797148367653]
Unsupervised Domain Adaptation (UDA) is the task of bridging the domain gap between a labeled source domain and an unlabeled target domain.
We propose to leverage geometric information, i.e., depth predictions, as depth discontinuities often coincide with segmentation boundaries.
We show that our method can be plugged into various recent UDA methods and consistently improve results across standard UDA benchmarks.
arXiv Detail & Related papers (2024-08-29T12:15:10Z) - SemHint-MD: Learning from Noisy Semantic Labels for Self-Supervised
Monocular Depth Estimation [19.229255297016635]
Self-supervised depth estimation can be trapped in a local minimum due to the gradient-locality issue of the photometric loss.
We present a framework to enhance depth by leveraging semantic segmentation to guide the network to jump out of the local minimum.
arXiv Detail & Related papers (2023-03-31T17:20:27Z) - Semantics-Depth-Symbiosis: Deeply Coupled Semi-Supervised Learning of
Semantics and Depth [83.94528876742096]
We tackle the MTL problem of two dense tasks, ie, semantic segmentation and depth estimation, and present a novel attention module called Cross-Channel Attention Module (CCAM)
In a true symbiotic spirit, we then formulate a novel data augmentation for the semantic segmentation task using predicted depth called AffineMix, and a simple depth augmentation using predicted semantics called ColorAug.
Finally, we validate the performance gain of the proposed method on the Cityscapes dataset, which helps us achieve state-of-the-art results for a semi-supervised joint model based on depth and semantic
arXiv Detail & Related papers (2022-06-21T17:40:55Z) - X-Distill: Improving Self-Supervised Monocular Depth via Cross-Task
Distillation [69.9604394044652]
We propose a novel method to improve the self-supervised training of monocular depth via cross-task knowledge distillation.
During training, we utilize a pretrained semantic segmentation teacher network and transfer its semantic knowledge to the depth network.
We extensively evaluate the efficacy of our proposed approach on the KITTI benchmark and compare it with the latest state of the art.
arXiv Detail & Related papers (2021-10-24T19:47:14Z) - Learning to Relate Depth and Semantics for Unsupervised Domain
Adaptation [87.1188556802942]
We present an approach for encoding visual task relationships to improve model performance in an Unsupervised Domain Adaptation (UDA) setting.
We propose a novel Cross-Task Relation Layer (CTRL), which encodes task dependencies between the semantic and depth predictions.
Furthermore, we propose an Iterative Self-Learning (ISL) training scheme, which exploits semantic pseudo-labels to provide extra supervision on the target domain.
arXiv Detail & Related papers (2021-05-17T13:42:09Z) - Latent Space Regularization for Unsupervised Domain Adaptation in
Semantic Segmentation [14.050836886292869]
We introduce feature-level space-shaping regularization strategies to reduce the domain discrepancy in semantic segmentation.
We verify the effectiveness of such methods in the autonomous driving setting.
arXiv Detail & Related papers (2021-04-06T16:07:22Z) - Pixel-Level Cycle Association: A New Perspective for Domain Adaptive
Semantic Segmentation [169.82760468633236]
We propose to build the pixel-level cycle association between source and target pixel pairs.
Our method can be trained end-to-end in one stage and introduces no additional parameters.
arXiv Detail & Related papers (2020-10-31T00:11:36Z) - DESC: Domain Adaptation for Depth Estimation via Semantic Consistency [24.13837264978472]
We propose a domain adaptation approach to train a monocular depth estimation model.
We bridge the domain gap by leveraging semantic predictions and low-level edge features.
Our approach is evaluated on standard domain adaptation benchmarks for monocular depth estimation.
arXiv Detail & Related papers (2020-09-03T10:54:05Z) - Shape Consistent 2D Keypoint Estimation under Domain Shift [35.15266729401601]
We present a novel deep adaptation framework for estimating keypoints under domain shift.
Our method seamlessly combines three different components: feature alignment, adversarial training and self-supervision.
Our approach outperforms state-of-the-art domain adaptation methods in the 2D keypoint prediction task.
arXiv Detail & Related papers (2020-08-04T14:32:06Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.