Domain Adaptive Semantic Segmentation by Optimal Transport
- URL: http://arxiv.org/abs/2303.16435v1
- Date: Wed, 29 Mar 2023 03:33:54 GMT
- Title: Domain Adaptive Semantic Segmentation by Optimal Transport
- Authors: Yaqian Guo, Xin Wang, Ce Li, Shihui Ying
- Abstract summary: semantic scene segmentation has received a great deal of attention due to the richness of the semantic information it contains.
Current approaches are mainly based on convolutional neural networks (CNN), but they rely on a large number of labels.
We propose a domain adaptation (DA) framework based on optimal transport (OT) and attention mechanism to address this issue.
- Score: 13.133890240271308
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Scene segmentation is widely used in the field of autonomous driving for
environment perception, and semantic scene segmentation (3S) has received a
great deal of attention due to the richness of the semantic information it
contains. It aims to assign labels to pixels in an image, thus enabling
automatic image labeling. Current approaches are mainly based on convolutional
neural networks (CNN), but they rely on a large number of labels. Therefore,
how to use a small size of labeled data to achieve semantic segmentation
becomes more and more important. In this paper, we propose a domain adaptation
(DA) framework based on optimal transport (OT) and attention mechanism to
address this issue. Concretely, first we generate the output space via CNN due
to its superiority of feature representation. Second, we utilize OT to achieve
a more robust alignment of source and target domains in output space, where the
OT plan defines a well attention mechanism to improve the adaptation of the
model. In particular, with OT, the number of network parameters has been
reduced and the network has been better interpretable. Third, to better
describe the multi-scale property of features, we construct a multi-scale
segmentation network to perform domain adaptation. Finally, in order to verify
the performance of our proposed method, we conduct experimental comparison with
three benchmark and four SOTA methods on three scene datasets, and the mean
intersection-over-union (mIOU) has been significant improved, and visualization
results under multiple domain adaptation scenarios also show that our proposed
method has better performance than compared semantic segmentation methods.
Related papers
- UniDAformer: Unified Domain Adaptive Panoptic Segmentation Transformer
via Hierarchical Mask Calibration [49.16591283724376]
We design UniDAformer, a unified domain adaptive panoptic segmentation transformer that is simple but can achieve domain adaptive instance segmentation and semantic segmentation simultaneously within a single network.
UniDAformer introduces Hierarchical Mask (HMC) that rectifies inaccurate predictions at the level of regions, superpixels and annotated pixels via online self-training on the fly.
It has three unique features: 1) it enables unified domain adaptive panoptic adaptation; 2) it mitigates false predictions and improves domain adaptive panoptic segmentation effectively; 3) it is end-to-end trainable with a much simpler training and inference pipeline.
arXiv Detail & Related papers (2022-06-30T07:32:23Z) - AF$_2$: Adaptive Focus Framework for Aerial Imagery Segmentation [86.44683367028914]
Aerial imagery segmentation has some unique challenges, the most critical one among which lies in foreground-background imbalance.
We propose Adaptive Focus Framework (AF$), which adopts a hierarchical segmentation procedure and focuses on adaptively utilizing multi-scale representations.
AF$ has significantly improved the accuracy on three widely used aerial benchmarks, as fast as the mainstream method.
arXiv Detail & Related papers (2022-02-18T10:14:45Z) - Improving Semi-Supervised and Domain-Adaptive Semantic Segmentation with
Self-Supervised Depth Estimation [94.16816278191477]
We present a framework for semi-adaptive and domain-supervised semantic segmentation.
It is enhanced by self-supervised monocular depth estimation trained only on unlabeled image sequences.
We validate the proposed model on the Cityscapes dataset.
arXiv Detail & Related papers (2021-08-28T01:33:38Z) - Latent Space Regularization for Unsupervised Domain Adaptation in
Semantic Segmentation [14.050836886292869]
We introduce feature-level space-shaping regularization strategies to reduce the domain discrepancy in semantic segmentation.
We verify the effectiveness of such methods in the autonomous driving setting.
arXiv Detail & Related papers (2021-04-06T16:07:22Z) - Cross-modal Learning for Domain Adaptation in 3D Semantic Segmentation [11.895722159139108]
Domain adaptation is an important task to enable learning when labels are scarce.
We propose cross-modal learning, where we enforce consistency between the predictions of two modalities via mutual mimicking.
We constrain our network to make correct predictions on labeled data and consistent predictions across modalities on unlabeled target-domain data.
arXiv Detail & Related papers (2021-01-18T18:59:21Z) - Towards Adaptive Semantic Segmentation by Progressive Feature Refinement [16.40758125170239]
We propose an innovative progressive feature refinement framework, along with domain adversarial learning to boost the transferability of segmentation networks.
As a result, the segmentation models trained with source domain images can be transferred to a target domain without significant performance degradation.
arXiv Detail & Related papers (2020-09-30T04:17:48Z) - Affinity Space Adaptation for Semantic Segmentation Across Domains [57.31113934195595]
In this paper, we address the problem of unsupervised domain adaptation (UDA) in semantic segmentation.
Motivated by the fact that source and target domain have invariant semantic structures, we propose to exploit such invariance across domains.
We develop two affinity space adaptation strategies: affinity space cleaning and adversarial affinity space alignment.
arXiv Detail & Related papers (2020-09-26T10:28:11Z) - Spatial Attention Pyramid Network for Unsupervised Domain Adaptation [66.75008386980869]
Unsupervised domain adaptation is critical in various computer vision tasks.
We design a new spatial attention pyramid network for unsupervised domain adaptation.
Our method performs favorably against the state-of-the-art methods by a large margin.
arXiv Detail & Related papers (2020-03-29T09:03:23Z) - Gated Path Selection Network for Semantic Segmentation [72.44994579325822]
We develop a novel network named Gated Path Selection Network (GPSNet), which aims to learn adaptive receptive fields.
In GPSNet, we first design a two-dimensional multi-scale network - SuperNet, which densely incorporates features from growing receptive fields.
To dynamically select desirable semantic context, a gate prediction module is further introduced.
arXiv Detail & Related papers (2020-01-19T12:32:17Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.