AdaStereo: An Efficient Domain-Adaptive Stereo Matching Approach
- URL: http://arxiv.org/abs/2112.04974v1
- Date: Thu, 9 Dec 2021 15:10:47 GMT
- Title: AdaStereo: An Efficient Domain-Adaptive Stereo Matching Approach
- Authors: Xiao Song, Guorun Yang, Xinge Zhu, Hui Zhou, Yuexin Ma, Zhe Wang,
Jianping Shi
- Abstract summary: We present a novel domain-adaptive approach called AdaStereo to align multi-level representations for deep stereo matching networks.
Our models achieve state-of-the-art cross-domain performance on multiple benchmarks, including KITTI, Middlebury, ETH3D and DrivingStereo.
Our method is robust to various domain adaptation settings, and can be easily integrated into quick adaptation application scenarios and real-world deployments.
- Score: 50.855679274530615
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Recently, records on stereo matching benchmarks are constantly broken by
end-to-end disparity networks. However, the domain adaptation ability of these
deep models is quite limited. Addressing such problem, we present a novel
domain-adaptive approach called AdaStereo that aims to align multi-level
representations for deep stereo matching networks. Compared to previous
methods, our AdaStereo realizes a more standard, complete and effective domain
adaptation pipeline. Firstly, we propose a non-adversarial progressive color
transfer algorithm for input image-level alignment. Secondly, we design an
efficient parameter-free cost normalization layer for internal feature-level
alignment. Lastly, a highly related auxiliary task, self-supervised
occlusion-aware reconstruction is presented to narrow the gaps in output space.
We perform intensive ablation studies and break-down comparisons to validate
the effectiveness of each proposed module. With no extra inference overhead and
only a slight increase in training complexity, our AdaStereo models achieve
state-of-the-art cross-domain performance on multiple benchmarks, including
KITTI, Middlebury, ETH3D and DrivingStereo, even outperforming some
state-of-the-art disparity networks finetuned with target-domain ground-truths.
Moreover, based on two additional evaluation metrics, the superiority of our
domain-adaptive stereo matching pipeline is further uncovered from more
perspectives. Finally, we demonstrate that our method is robust to various
domain adaptation settings, and can be easily integrated into quick adaptation
application scenarios and real-world deployments.
Related papers
- SR-Stereo & DAPE: Stepwise Regression and Pre-trained Edges for Practical Stereo Matching [2.8908326904081334]
We propose a novel stepwise regression architecture to overcome domain discrepancies.
To enhance the edge awareness of models adapting new domains with sparse ground truth, we propose Domain Adaptation based on Pre-trained Edges (DAPE)
The proposed SR-Stereo and DAPE are extensively evaluated on SceneFlow, KITTI, Middbury 2014 and ETH3D.
arXiv Detail & Related papers (2024-06-11T05:25:25Z) - Multi-scale Alternated Attention Transformer for Generalized Stereo
Matching [7.493797166406228]
We present a simple but highly effective network called Alternated Attention U-shaped Transformer (AAUformer) to balance the impact of epipolar line in dual and single view.
Compared to other models, our model has several main designs.
We performed a series of both comparative studies and ablation studies on several mainstream stereo matching datasets.
arXiv Detail & Related papers (2023-08-06T08:22:39Z) - Uncertainty Guided Adaptive Warping for Robust and Efficient Stereo
Matching [77.133400999703]
Correlation based stereo matching has achieved outstanding performance.
Current methods with a fixed model do not work uniformly well across various datasets.
This paper proposes a new perspective to dynamically calculate correlation for robust stereo matching.
arXiv Detail & Related papers (2023-07-26T09:47:37Z) - PointFix: Learning to Fix Domain Bias for Robust Online Stereo
Adaptation [67.41325356479229]
We propose to incorporate an auxiliary point-selective network into a meta-learning framework, called PointFix.
In a nutshell, our auxiliary network learns to fix local variants intensively by effectively back-propagating local information through the meta-gradient.
This network is model-agnostic, so can be used in any kind of architectures in a plug-and-play manner.
arXiv Detail & Related papers (2022-07-27T07:48:29Z) - Dynamic Feature Alignment for Semi-supervised Domain Adaptation [23.67093835143]
We propose to use dynamic feature alignment to address both inter- and intra-domain discrepancy.
Our approach, which doesn't require extensive tuning or adversarial training, significantly improves the state of the art for semi-supervised domain adaptation.
arXiv Detail & Related papers (2021-10-18T22:26:27Z) - Seeking Similarities over Differences: Similarity-based Domain Alignment
for Adaptive Object Detection [86.98573522894961]
We propose a framework that generalizes the components commonly used by Unsupervised Domain Adaptation (UDA) algorithms for detection.
Specifically, we propose a novel UDA algorithm, ViSGA, that leverages the best design choices and introduces a simple but effective method to aggregate features at instance-level.
We show that both similarity-based grouping and adversarial training allows our model to focus on coarsely aligning feature groups, without being forced to match all instances across loosely aligned domains.
arXiv Detail & Related papers (2021-10-04T13:09:56Z) - Continual Adaptation for Deep Stereo [52.181067640300014]
We propose a continual adaptation paradigm for deep stereo networks designed to deal with challenging and ever-changing environments.
In our paradigm, the learning signals needed to continuously adapt models online can be sourced from self-supervision via right-to-left image warping or from traditional stereo algorithms.
Our network architecture and adaptation algorithms realize the first real-time self-adaptive deep stereo system.
arXiv Detail & Related papers (2020-07-10T08:15:58Z) - AdaStereo: A Simple and Efficient Approach for Adaptive Stereo Matching [50.06646151004375]
A novel domain-adaptive pipeline called AdaStereo aims to align multi-level representations for deep stereo matching networks.
Our AdaStereo models achieve state-of-the-art cross-domain performance on multiple stereo benchmarks, including KITTI, Middlebury, ETH3D, and DrivingStereo.
arXiv Detail & Related papers (2020-04-09T16:15:13Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.