Content-Aware Inter-Scale Cost Aggregation for Stereo Matching
- URL: http://arxiv.org/abs/2006.03209v1
- Date: Fri, 5 Jun 2020 02:38:34 GMT
- Title: Content-Aware Inter-Scale Cost Aggregation for Stereo Matching
- Authors: Chengtang Yao, Yunde Jia, Huijun Di, Yuwei Wu, Lidong Yu
- Abstract summary: Our method achieves reliable detail recovery when upsampling through the aggregation of information across different scales.
A novel decomposition strategy is proposed to efficiently construct the 3D filter weights and aggregate the 3D cost volume.
Experiment results on Scene Flow dataset, KITTI2015 and Middlebury demonstrate the effectiveness of our method.
- Score: 42.02981855948903
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Cost aggregation is a key component of stereo matching for high-quality depth
estimation. Most methods use multi-scale processing to downsample cost volume
for proper context information, but will cause loss of details when upsampling.
In this paper, we present a content-aware inter-scale cost aggregation method
that adaptively aggregates and upsamples the cost volume from coarse-scale to
fine-scale by learning dynamic filter weights according to the content of the
left and right views on the two scales. Our method achieves reliable detail
recovery when upsampling through the aggregation of information across
different scales. Furthermore, a novel decomposition strategy is proposed to
efficiently construct the 3D filter weights and aggregate the 3D cost volume,
which greatly reduces the computation cost. We first learn the 2D similarities
via the feature maps on the two scales, and then build the 3D filter weights
based on the 2D similarities from the left and right views. After that, we
split the aggregation in a full 3D spatial-disparity space into the aggregation
in 1D disparity space and 2D spatial space. Experiment results on Scene Flow
dataset, KITTI2015 and Middlebury demonstrate the effectiveness of our method.
Related papers
- Mip-Splatting: Alias-free 3D Gaussian Splatting [52.366815964832426]
3D Gaussian Splatting has demonstrated impressive novel view synthesis results, reaching high fidelity and efficiency.
Strong artifacts can be observed when changing the sampling rate, eg, by changing focal length or camera distance.
We find that the source for this phenomenon can be attributed to the lack of 3D frequency constraints and the usage of a 2D dilation filter.
arXiv Detail & Related papers (2023-11-27T13:03:09Z) - Image-Coupled Volume Propagation for Stereo Matching [0.24366811507669117]
We propose a new way to process the 4D cost volume where we merge two different concepts in one framework to achieve a symbiotic relationship.
A feature matching part is responsible for identifying matching pixels pairs along the baseline while a concurrent image volume part is inspired by depth-from-mono CNNs.
Our end-to-end trained CNN is ranked 2nd on KITTI2012 and ETH3D benchmarks while being significantly faster than the 1st-ranked method.
arXiv Detail & Related papers (2022-12-30T13:23:25Z) - Correlate-and-Excite: Real-Time Stereo Matching via Guided Cost Volume
Excitation [65.83008812026635]
We construct Guided Cost volume Excitation (GCE) and show that simple channel excitation of cost volume guided by image can improve performance considerably.
We present an end-to-end network that we call Correlate-and-Excite (CoEx)
arXiv Detail & Related papers (2021-08-12T14:32:26Z) - Stereo Object Matching Network [78.35697025102334]
This paper presents a stereo object matching method that exploits both 2D contextual information from images and 3D object-level information.
We present two novel strategies to handle 3D objectness in the cost volume space: selective sampling (RoISelect) and 2D-3D fusion.
arXiv Detail & Related papers (2021-03-23T12:54:43Z) - Displacement-Invariant Matching Cost Learning for Accurate Optical Flow
Estimation [109.64756528516631]
Learning matching costs have been shown to be critical to the success of the state-of-the-art deep stereo matching methods.
This paper proposes a novel solution that is able to bypass the requirement of building a 5D feature volume.
Our approach achieves state-of-the-art accuracy on various datasets, and outperforms all published optical flow methods on the Sintel benchmark.
arXiv Detail & Related papers (2020-10-28T09:57:00Z) - Do End-to-end Stereo Algorithms Under-utilize Information? [7.538482310185133]
We show how deep adaptive filtering and differentiable semi-global aggregation can be integrated in 2D and 3D convolutional networks for end-to-end stereo matching.
The improvements are due to utilizing RGB information from the images as a signal to dynamically guide the matching process.
arXiv Detail & Related papers (2020-10-14T18:32:39Z) - Reinforced Axial Refinement Network for Monocular 3D Object Detection [160.34246529816085]
Monocular 3D object detection aims to extract the 3D position and properties of objects from a 2D input image.
Conventional approaches sample 3D bounding boxes from the space and infer the relationship between the target object and each of them, however, the probability of effective samples is relatively small in the 3D space.
We propose to start with an initial prediction and refine it gradually towards the ground truth, with only one 3d parameter changed in each step.
This requires designing a policy which gets a reward after several steps, and thus we adopt reinforcement learning to optimize it.
arXiv Detail & Related papers (2020-08-31T17:10:48Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.