ADCPNet: Adaptive Disparity Candidates Prediction Network for Efficient
Real-Time Stereo Matching
- URL: http://arxiv.org/abs/2011.09023v1
- Date: Wed, 18 Nov 2020 01:18:52 GMT
- Title: ADCPNet: Adaptive Disparity Candidates Prediction Network for Efficient
Real-Time Stereo Matching
- Authors: He Dai, Xuchong Zhang, Yongli Zhao, Hongbin Sun
- Abstract summary: coarse-to-fine method has largely relieved the memory constraints and speed limitations of large-scale network models.
Previous coarse-to-fine designs employ constant offsets and three or more stages to progressively refine the coarse disparity map.
This paper claims that the coarse matching errors can be corrected efficiently with fewer stages as long as more accurate disparity candidates can be provided.
- Score: 8.046317778069325
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Efficient real-time disparity estimation is critical for the application of
stereo vision systems in various areas. Recently, stereo network based on
coarse-to-fine method has largely relieved the memory constraints and speed
limitations of large-scale network models. Nevertheless, all of the previous
coarse-to-fine designs employ constant offsets and three or more stages to
progressively refine the coarse disparity map, still resulting in
unsatisfactory computation accuracy and inference time when deployed on mobile
devices. This paper claims that the coarse matching errors can be corrected
efficiently with fewer stages as long as more accurate disparity candidates can
be provided. Therefore, we propose a dynamic offset prediction module to meet
different correction requirements of diverse objects and design an efficient
two-stage framework. Besides, we propose a disparity-independent convolution to
further improve the performance since it is more consistent with the local
statistical characteristics of the compact cost volume. The evaluation results
on multiple datasets and platforms clearly demonstrate that, the proposed
network outperforms the state-of-the-art lightweight models especially for
mobile devices in terms of accuracy and speed. Code will be made available.
Related papers
- Task-Oriented Real-time Visual Inference for IoVT Systems: A Co-design Framework of Neural Networks and Edge Deployment [61.20689382879937]
Task-oriented edge computing addresses this by shifting data analysis to the edge.
Existing methods struggle to balance high model performance with low resource consumption.
We propose a novel co-design framework to optimize neural network architecture.
arXiv Detail & Related papers (2024-10-29T19:02:54Z) - Multi-scale Iterative Residuals for Fast and Scalable Stereo Matching [13.76996108304056]
This paper presents an iterative multi-scale coarse-to-fine refinement (iCFR) framework to bridge this gap.
We use multi-scale warped features to estimate disparity residuals and push the disparity search range in the cost volume to a minimum limit.
Finally, we apply a refinement network to recover the loss of precision which is inherent in multi-scale approaches.
arXiv Detail & Related papers (2021-10-25T09:54:17Z) - FADNet++: Real-Time and Accurate Disparity Estimation with Configurable
Networks [19.29846600092521]
FADNet++ is an efficient deep network for disparity estimation.
It can boost its accuracy with a fast model inference speed for real-time applications.
It achieves a new state-of-the-art result for the SceneFlow dataset.
arXiv Detail & Related papers (2021-10-06T08:50:33Z) - Multi-Exit Semantic Segmentation Networks [78.44441236864057]
We propose a framework for converting state-of-the-art segmentation models to MESS networks.
specially trained CNNs that employ parametrised early exits along their depth to save during inference on easier samples.
We co-optimise the number, placement and architecture of the attached segmentation heads, along with the exit policy, to adapt to the device capabilities and application-specific requirements.
arXiv Detail & Related papers (2021-06-07T11:37:03Z) - CFNet: Cascade and Fused Cost Volume for Robust Stereo Matching [27.313740022587442]
We propose CFNet, a Cascade and Fused cost volume based network to improve the robustness of the stereo matching network.
We employ a variance-based uncertainty estimation to adaptively adjust the next stage disparity search space.
Our proposed method achieves the state-of-the-art overall performance and obtains the 1st place on the stereo task of Robust Vision Challenge 2020.
arXiv Detail & Related papers (2021-04-09T11:38:59Z) - Continual Adaptation for Deep Stereo [52.181067640300014]
We propose a continual adaptation paradigm for deep stereo networks designed to deal with challenging and ever-changing environments.
In our paradigm, the learning signals needed to continuously adapt models online can be sourced from self-supervision via right-to-left image warping or from traditional stereo algorithms.
Our network architecture and adaptation algorithms realize the first real-time self-adaptive deep stereo system.
arXiv Detail & Related papers (2020-07-10T08:15:58Z) - FADNet: A Fast and Accurate Network for Disparity Estimation [18.05392578461659]
We propose an efficient and accurate deep network for disparity estimation named FADNet.
It exploits efficient 2D based correlation layers with stacked blocks to preserve fast computation.
It contains multi-scale predictions so as to exploit a multi-scale weight scheduling training technique to improve the accuracy.
arXiv Detail & Related papers (2020-03-24T10:27:11Z) - Resolution Adaptive Networks for Efficient Inference [53.04907454606711]
We propose a novel Resolution Adaptive Network (RANet), which is inspired by the intuition that low-resolution representations are sufficient for classifying "easy" inputs.
In RANet, the input images are first routed to a lightweight sub-network that efficiently extracts low-resolution representations.
High-resolution paths in the network maintain the capability to recognize the "hard" samples.
arXiv Detail & Related papers (2020-03-16T16:54:36Z) - Diversity inducing Information Bottleneck in Model Ensembles [73.80615604822435]
In this paper, we target the problem of generating effective ensembles of neural networks by encouraging diversity in prediction.
We explicitly optimize a diversity inducing adversarial loss for learning latent variables and thereby obtain diversity in the output predictions necessary for modeling multi-modal data.
Compared to the most competitive baselines, we show significant improvements in classification accuracy, under a shift in the data distribution.
arXiv Detail & Related papers (2020-03-10T03:10:41Z) - Triple Wins: Boosting Accuracy, Robustness and Efficiency Together by
Enabling Input-Adaptive Inference [119.19779637025444]
Deep networks were recently suggested to face the odds between accuracy (on clean natural images) and robustness (on adversarially perturbed images)
This paper studies multi-exit networks associated with input-adaptive inference, showing their strong promise in achieving a "sweet point" in cooptimizing model accuracy, robustness and efficiency.
arXiv Detail & Related papers (2020-02-24T00:40:22Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.