Stereo-Matching Knowledge Distilled Monocular Depth Estimation Filtered
by Multiple Disparity Consistency
- URL: http://arxiv.org/abs/2401.12019v2
- Date: Tue, 23 Jan 2024 03:16:43 GMT
- Title: Stereo-Matching Knowledge Distilled Monocular Depth Estimation Filtered
by Multiple Disparity Consistency
- Authors: Woonghyun Ka, Jae Young Lee, Jaehyun Choi, Junmo Kim
- Abstract summary: We propose a method to identify and filter errors in the pseudo-depth map using multiple disparity maps.
Experimental results show that the proposed method outperforms the previous methods.
- Score: 31.261772846687297
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In stereo-matching knowledge distillation methods of the self-supervised
monocular depth estimation, the stereo-matching network's knowledge is
distilled into a monocular depth network through pseudo-depth maps. In these
methods, the learning-based stereo-confidence network is generally utilized to
identify errors in the pseudo-depth maps to prevent transferring the errors.
However, the learning-based stereo-confidence networks should be trained with
ground truth (GT), which is not feasible in a self-supervised setting. In this
paper, we propose a method to identify and filter errors in the pseudo-depth
map using multiple disparity maps by checking their consistency without the
need for GT and a training process. Experimental results show that the proposed
method outperforms the previous methods and works well on various
configurations by filtering out erroneous areas where the stereo-matching is
vulnerable, especially such as textureless regions, occlusion boundaries, and
reflective surfaces.
Related papers
- Depth-aware Volume Attention for Texture-less Stereo Matching [67.46404479356896]
We propose a lightweight volume refinement scheme to tackle the texture deterioration in practical outdoor scenarios.
We introduce a depth volume supervised by the ground-truth depth map, capturing the relative hierarchy of image texture.
Local fine structure and context are emphasized to mitigate ambiguity and redundancy during volume aggregation.
arXiv Detail & Related papers (2024-02-14T04:07:44Z) - Unsupervised Light Field Depth Estimation via Multi-view Feature
Matching with Occlusion Prediction [15.421219881815956]
It is costly to obtain sufficient depth labels for supervised training.
In this paper, we propose an unsupervised framework to estimate depth from LF images.
arXiv Detail & Related papers (2023-01-20T06:11:17Z) - Learning Monocular Depth Estimation via Selective Distillation of Stereo
Knowledge [34.380048111601894]
We design a decoder (MaskDecoder) that learns two binary masks which are trained to choose optimally between the proxy disparity maps and the estimated depth maps for each pixel.
The learned masks are then fed to another decoder (DepthDecoder) to enforce the estimated depths.
Experiments validate our methods achieve state-of-the-art performance for self- and proxy-supervised monocular depth estimation.
arXiv Detail & Related papers (2022-05-18T00:34:28Z) - Learning Occlusion-Aware Coarse-to-Fine Depth Map for Self-supervised
Monocular Depth Estimation [11.929584800629673]
We propose a novel network to learn an Occlusion-aware Coarse-to-Fine Depth map for self-supervised monocular depth estimation.
The proposed OCFD-Net does not only employ a discrete depth constraint for learning a coarse-level depth map, but also employ a continuous depth constraint for learning a scene depth residual.
arXiv Detail & Related papers (2022-03-21T12:43:42Z) - Towards Unpaired Depth Enhancement and Super-Resolution in the Wild [121.96527719530305]
State-of-the-art data-driven methods of depth map super-resolution rely on registered pairs of low- and high-resolution depth maps of the same scenes.
We consider an approach to depth map enhancement based on learning from unpaired data.
arXiv Detail & Related papers (2021-05-25T16:19:16Z) - H-Net: Unsupervised Attention-based Stereo Depth Estimation Leveraging
Epipolar Geometry [4.968452390132676]
We introduce the H-Net, a deep-learning framework for unsupervised stereo depth estimation.
For the first time, a Siamese autoencoder architecture is used for depth estimation.
Our method outperforms the state-ofthe-art unsupervised stereo depth estimation methods.
arXiv Detail & Related papers (2021-04-22T19:16:35Z) - On the confidence of stereo matching in a deep-learning era: a
quantitative evaluation [124.09613797008099]
We review more than ten years of developments in the field of confidence estimation for stereo matching.
We study the different behaviors of each measure when applied to a pool of different stereo algorithms and, for the first time in literature, when paired with a state-of-the-art deep stereo network.
arXiv Detail & Related papers (2021-01-02T11:40:17Z) - Adaptive confidence thresholding for monocular depth estimation [83.06265443599521]
We propose a new approach to leverage pseudo ground truth depth maps of stereo images generated from self-supervised stereo matching methods.
The confidence map of the pseudo ground truth depth map is estimated to mitigate performance degeneration by inaccurate pseudo depth maps.
Experimental results demonstrate superior performance to state-of-the-art monocular depth estimation methods.
arXiv Detail & Related papers (2020-09-27T13:26:16Z) - Reversing the cycle: self-supervised deep stereo through enhanced
monocular distillation [51.714092199995044]
In many fields, self-supervised learning solutions are rapidly evolving and filling the gap with supervised approaches.
We propose a novel self-supervised paradigm reversing the link between the two.
In order to train deep stereo networks, we distill knowledge through a monocular completion network.
arXiv Detail & Related papers (2020-08-17T07:40:22Z) - Single Image Depth Estimation Trained via Depth from Defocus Cues [105.67073923825842]
Estimating depth from a single RGB image is a fundamental task in computer vision.
In this work, we rely, instead of different views, on depth from focus cues.
We present results that are on par with supervised methods on KITTI and Make3D datasets and outperform unsupervised learning approaches.
arXiv Detail & Related papers (2020-01-14T20:22:54Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.