Depth Refinement for Improved Stereo Reconstruction
- URL: http://arxiv.org/abs/2112.08070v1
- Date: Wed, 15 Dec 2021 12:21:08 GMT
- Title: Depth Refinement for Improved Stereo Reconstruction
- Authors: Amit Bracha, Noam Rotstein, David Bensa\"id, Ron Slossberg and Ron
Kimmel
- Abstract summary: Current techniques for depth estimation from stereoscopic images still suffer from a built-in drawback.
A simple analysis reveals that the depth error is quadratically proportional to the object's distance.
We propose a simple but effective method that uses a refinement network for depth estimation.
- Score: 13.941756438712382
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Depth estimation is a cornerstone of a vast number of applications requiring
3D assessment of the environment, such as robotics, augmented reality, and
autonomous driving to name a few. One prominent technique for depth estimation
is stereo matching which has several advantages: it is considered more
accessible than other depth-sensing technologies, can produce dense depth
estimates in real-time, and has benefited greatly from the advances of deep
learning in recent years. However, current techniques for depth estimation from
stereoscopic images still suffer from a built-in drawback. To reconstruct
depth, a stereo matching algorithm first estimates the disparity map between
the left and right images before applying a geometric triangulation. A simple
analysis reveals that the depth error is quadratically proportional to the
object's distance. Therefore, constant disparity errors are translated to large
depth errors for objects far from the camera. To mitigate this quadratic
relation, we propose a simple but effective method that uses a refinement
network for depth estimation. We show analytical and empirical results
suggesting that the proposed learning procedure reduces this quadratic
relation. We evaluate the proposed refinement procedure on well-known
benchmarks and datasets, like Sceneflow and KITTI datasets, and demonstrate
significant improvements in the depth accuracy metric.
Related papers
- FS-Depth: Focal-and-Scale Depth Estimation from a Single Image in Unseen
Indoor Scene [57.26600120397529]
It has long been an ill-posed problem to predict absolute depth maps from single images in real (unseen) indoor scenes.
We develop a focal-and-scale depth estimation model to well learn absolute depth maps from single images in unseen indoor scenes.
arXiv Detail & Related papers (2023-07-27T04:49:36Z) - Probabilistic and Geometric Depth: Detecting Objects in Perspective [78.00922683083776]
3D object detection is an important capability needed in various practical applications such as driver assistance systems.
Monocular 3D detection, as an economical solution compared to conventional settings relying on binocular vision or LiDAR, has drawn increasing attention recently but still yields unsatisfactory results.
This paper first presents a systematic study on this problem and observes that the current monocular 3D detection problem can be simplified as an instance depth estimation problem.
arXiv Detail & Related papers (2021-07-29T16:30:33Z) - Geometry Uncertainty Projection Network for Monocular 3D Object
Detection [138.24798140338095]
We propose a Geometry Uncertainty Projection Network (GUP Net) to tackle the error amplification problem at both inference and training stages.
Specifically, a GUP module is proposed to obtains the geometry-guided uncertainty of the inferred depth.
At the training stage, we propose a Hierarchical Task Learning strategy to reduce the instability caused by error amplification.
arXiv Detail & Related papers (2021-07-29T06:59:07Z) - Virtual Normal: Enforcing Geometric Constraints for Accurate and Robust
Depth Prediction [87.08227378010874]
We show the importance of the high-order 3D geometric constraints for depth prediction.
By designing a loss term that enforces a simple geometric constraint, we significantly improve the accuracy and robustness of monocular depth estimation.
We show state-of-the-art results of learning metric depth on NYU Depth-V2 and KITTI.
arXiv Detail & Related papers (2021-03-07T00:08:21Z) - Direct Depth Learning Network for Stereo Matching [79.3665881702387]
A novel Direct Depth Learning Network (DDL-Net) is designed for stereo matching.
DDL-Net consists of two stages: the Coarse Depth Estimation stage and the Adaptive-Grained Depth Refinement stage.
We show that DDL-Net achieves an average improvement of 25% on the SceneFlow dataset and $12%$ on the DrivingStereo dataset.
arXiv Detail & Related papers (2020-12-10T10:33:57Z) - Variational Monocular Depth Estimation for Reliability Prediction [12.951621755732544]
Self-supervised learning for monocular depth estimation is widely investigated as an alternative to supervised learning approach.
Previous works have successfully improved the accuracy of depth estimation by modifying the model structure.
In this paper, we theoretically formulate a variational model for the monocular depth estimation to predict the reliability of the estimated depth image.
arXiv Detail & Related papers (2020-11-24T06:23:51Z) - Deep Multi-view Depth Estimation with Predicted Uncertainty [11.012201499666503]
We employ a dense-optical-flow network to compute correspondences and then triangulate the point cloud to obtain an initial depth map.
To further increase the triangulation accuracy, we introduce a depth-refinement network (DRN) that optimize the initial depth map based on the image's contextual cues.
arXiv Detail & Related papers (2020-11-19T00:22:09Z) - Adjusting Bias in Long Range Stereo Matching: A semantics guided
approach [14.306250516592305]
We propose a pair of novel depth-based loss functions for foreground and background, separately.
These loss functions are tunable and can balance the inherent bias of the stereo learning algorithms.
Our solution yields substantial improvements in disparity and depth estimation, particularly for objects located at distances beyond 50 meters.
arXiv Detail & Related papers (2020-09-10T01:47:53Z) - Occlusion-Aware Depth Estimation with Adaptive Normal Constraints [85.44842683936471]
We present a new learning-based method for multi-frame depth estimation from a color video.
Our method outperforms the state-of-the-art in terms of depth estimation accuracy.
arXiv Detail & Related papers (2020-04-02T07:10:45Z) - DELTAS: Depth Estimation by Learning Triangulation And densification of
Sparse points [14.254472131009653]
Multi-view stereo (MVS) is the golden mean between the accuracy of active depth sensing and the practicality of monocular depth estimation.
Cost volume based approaches employing 3D convolutional neural networks (CNNs) have considerably improved the accuracy of MVS systems.
We propose an efficient depth estimation approach by first (a) detecting and evaluating descriptors for interest points, then (b) learning to match and triangulate a small set of interest points, and finally (c) densifying this sparse set of 3D points using CNNs.
arXiv Detail & Related papers (2020-03-19T17:56:41Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.