Bayesian Learning for Disparity Map Refinement for Semi-Dense Active
Stereo Vision
- URL: http://arxiv.org/abs/2209.05082v1
- Date: Mon, 12 Sep 2022 08:33:40 GMT
- Title: Bayesian Learning for Disparity Map Refinement for Semi-Dense Active
Stereo Vision
- Authors: Laurent Valentin Jospin, Hamid Laga, Farid Boussaid, Mohammed
Bennamoun
- Abstract summary: We propose a new learning strategy to train neural networks to estimate high-quality subpixel disparity maps for semi-dense active stereo vision.
We demonstrate that the proposed method outperforms the current state-of-the-art active stereo models.
- Score: 30.330599857204344
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: A major focus of recent developments in stereo vision has been on how to
obtain accurate dense disparity maps in passive stereo vision. Active vision
systems enable more accurate estimations of dense disparity compared to passive
stereo. However, subpixel-accurate disparity estimation remains an open problem
that has received little attention. In this paper, we propose a new learning
strategy to train neural networks to estimate high-quality subpixel disparity
maps for semi-dense active stereo vision. The key insight is that neural
networks can double their accuracy if they are able to jointly learn how to
refine the disparity map while invalidating the pixels where there is
insufficient information to correct the disparity estimate. Our approach is
based on Bayesian modeling where validated and invalidated pixels are defined
by their stochastic properties, allowing the model to learn how to choose by
itself which pixels are worth its attention. Using active stereo datasets such
as Active-Passive SimStereo, we demonstrate that the proposed method
outperforms the current state-of-the-art active stereo models. We also
demonstrate that the proposed approach compares favorably with state-of-the-art
passive stereo models on the Middlebury dataset.
Related papers
- MaDis-Stereo: Enhanced Stereo Matching via Distilled Masked Image Modeling [18.02254687807291]
Transformer-based stereo models have been studied recently, their performance still lags behind CNN-based stereo models due to the inherent data scarcity issue in the stereo matching task.
We propose Masked Image Modeling Distilled Stereo matching model, termed MaDis-Stereo, that enhances locality inductive bias by leveraging Masked Image Modeling (MIM) in training Transformer-based stereo model.
arXiv Detail & Related papers (2024-09-04T16:17:45Z) - Left-right Discrepancy for Adversarial Attack on Stereo Networks [8.420135490466851]
We introduce a novel adversarial attack approach that generates perturbation noise specifically designed to maximize the discrepancy between left and right image features.
Experiments demonstrate the superior capability of our method to induce larger prediction errors in stereo neural networks.
arXiv Detail & Related papers (2024-01-14T02:30:38Z) - Active-Passive SimStereo -- Benchmarking the Cross-Generalization
Capabilities of Deep Learning-based Stereo Methods [26.662129158141763]
Self-similar or bland regions can make it difficult to match patches between two images.
Active stereo-based methods mitigate this problem by projecting a pseudo-random pattern on the scene.
If this pattern acts as a form of adversarial noise, it could negatively impact the performance of deep learning-based methods.
arXiv Detail & Related papers (2022-09-17T10:30:32Z) - Self-Supervised Depth Completion for Active Stereo [55.79929735390945]
Active stereo systems are widely used in the robotics industry due to their low cost and high quality depth maps.
These depth sensors suffer from stereo artefacts and do not provide dense depth estimates.
We present the first self-supervised depth completion method for active stereo systems that predicts accurate dense depth maps.
arXiv Detail & Related papers (2021-10-07T07:33:52Z) - CAMERAS: Enhanced Resolution And Sanity preserving Class Activation
Mapping for image saliency [61.40511574314069]
Backpropagation image saliency aims at explaining model predictions by estimating model-centric importance of individual pixels in the input.
We propose CAMERAS, a technique to compute high-fidelity backpropagation saliency maps without requiring any external priors.
arXiv Detail & Related papers (2021-06-20T08:20:56Z) - SMD-Nets: Stereo Mixture Density Networks [68.56947049719936]
We propose Stereo Mixture Density Networks (SMD-Nets), a simple yet effective learning framework compatible with a wide class of 2D and 3D architectures.
Specifically, we exploit bimodal mixture densities as output representation and show that this allows for sharp and precise disparity estimates near discontinuities.
We carry out comprehensive experiments on a new high-resolution and highly realistic synthetic stereo dataset, consisting of stereo pairs at 8Mpx resolution, as well as on real-world stereo datasets.
arXiv Detail & Related papers (2021-04-08T16:15:46Z) - Stereopagnosia: Fooling Stereo Networks with Adversarial Perturbations [71.00754846434744]
We show that imperceptible additive perturbations can significantly alter the disparity map.
We show that, when used for adversarial data augmentation, our perturbations result in trained models that are more robust.
arXiv Detail & Related papers (2020-09-21T19:20:09Z) - Reversing the cycle: self-supervised deep stereo through enhanced
monocular distillation [51.714092199995044]
In many fields, self-supervised learning solutions are rapidly evolving and filling the gap with supervised approaches.
We propose a novel self-supervised paradigm reversing the link between the two.
In order to train deep stereo networks, we distill knowledge through a monocular completion network.
arXiv Detail & Related papers (2020-08-17T07:40:22Z) - Expanding Sparse Guidance for Stereo Matching [24.74333370941674]
We propose a novel sparsity expansion technique to expand the sparse cues concerning RGB images for local feature enhancement.
Our approach significantly boosts the existing state-of-the-art stereo algorithms with extremely sparse cues.
arXiv Detail & Related papers (2020-04-24T06:41:11Z) - Distilling Localization for Self-Supervised Representation Learning [82.79808902674282]
Contrastive learning has revolutionized unsupervised representation learning.
Current contrastive models are ineffective at localizing the foreground object.
We propose a data-driven approach for learning in variance to backgrounds.
arXiv Detail & Related papers (2020-04-14T16:29:42Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.