Continuous Cost Aggregation for Dual-Pixel Disparity Extraction
- URL: http://arxiv.org/abs/2306.07921v1
- Date: Tue, 13 Jun 2023 17:26:50 GMT
- Title: Continuous Cost Aggregation for Dual-Pixel Disparity Extraction
- Authors: Sagi Monin, Sagi Katz and Georgios Evangelidis
- Abstract summary: We propose a continuous cost aggregation scheme for Dual-Pixel (DP) images.
The proposed algorithm fits parabolas to matching costs and aggregates parabola coefficients along image paths.
Experiments on DP data from both DSLR and phone cameras show that the proposed scheme attains state-of-the-art performance in DP disparity estimation.
- Score: 3.1153758106426603
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Recent works have shown that depth information can be obtained from
Dual-Pixel (DP) sensors. A DP arrangement provides two views in a single shot,
thus resembling a stereo image pair with a tiny baseline. However, the
different point spread function (PSF) per view, as well as the small disparity
range, makes the use of typical stereo matching algorithms problematic. To
address the above shortcomings, we propose a Continuous Cost Aggregation (CCA)
scheme within a semi-global matching framework that is able to provide accurate
continuous disparities from DP images. The proposed algorithm fits parabolas to
matching costs and aggregates parabola coefficients along image paths. The
aggregation step is performed subject to a quadratic constraint that not only
enforces the disparity smoothness but also maintains the quadratic form of the
total costs. This gives rise to an inherently efficient disparity propagation
scheme with a pixel-wise minimization in closed-form. Furthermore, the
continuous form allows for a robust multi-scale aggregation that better
compensates for the varying PSF. Experiments on DP data from both DSLR and
phone cameras show that the proposed scheme attains state-of-the-art
performance in DP disparity estimation.
Related papers
- Reducing Semantic Ambiguity In Domain Adaptive Semantic Segmentation Via Probabilistic Prototypical Pixel Contrast [7.092718945468069]
Domain adaptation aims to reduce the model degradation on the target domain caused by the domain shift between the source and target domains.
Probabilistic proto-typical pixel contrast (PPPC) is a universal adaptation framework that models each pixel embedding as a probability.
PPPC not only helps to address ambiguity at the pixel level, yielding discriminative representations but also significant improvements in both synthetic-to-real and day-to-night adaptation tasks.
arXiv Detail & Related papers (2024-09-27T08:25:03Z) - Improved Communication-Privacy Trade-offs in $L_2$ Mean Estimation under Streaming Differential Privacy [47.997934291881414]
Existing mean estimation schemes are usually optimized for $L_infty$ geometry and rely on random rotation or Kashin's representation to adapt to $L$ geometry.
We introduce a novel privacy accounting method for the sparsified Gaussian mechanism that incorporates the randomness inherent in sparsification into the DP.
Unlike previous approaches, our accounting algorithm directly operates in $L$ geometry, yielding MSEs that fast converge to those of the Gaussian mechanism.
arXiv Detail & Related papers (2024-05-02T03:48:47Z) - Passive Snapshot Coded Aperture Dual-Pixel RGB-D Imaging [25.851398356458425]
Single-shot 3D sensing is useful in many application areas such as microscopy, medical imaging, surgical navigation, and autonomous driving.
We propose CADS (Coded Aperture Dual-Pixel Sensing), in which we use a coded aperture in the imaging lens along with a DP sensor.
Our resulting CADS imaging system demonstrates improvement of >1.5dB PSNR in all-in-focus (AIF) estimates and 5-6% in depth estimation quality over naive DP sensing.
arXiv Detail & Related papers (2024-02-28T06:45:47Z) - Semi-Supervised Coupled Thin-Plate Spline Model for Rotation Correction and Beyond [84.56978780892783]
We propose CoupledTPS, which iteratively couples multiple TPS with limited control points into a more flexible and powerful transformation.
In light of the laborious annotation cost, we develop a semi-supervised learning scheme to improve warping quality by exploiting unlabeled data.
Experiments demonstrate the superiority and universality of CoupledTPS over the existing state-of-the-art solutions for rotation correction.
arXiv Detail & Related papers (2024-01-24T13:03:28Z) - Video Frame Interpolation with Many-to-many Splatting and Spatial
Selective Refinement [83.60486465697318]
We propose a fully differentiable Many-to-Many (M2M) splatting framework to interpolate frames efficiently.
For each input frame pair, M2M has a minuscule computational overhead when interpolating an arbitrary number of in-between frames.
We extend an M2M++ framework by introducing a flexible Spatial Selective Refinement component, which allows for trading computational efficiency for quality and vice versa.
arXiv Detail & Related papers (2023-10-29T09:09:32Z) - Improving Misaligned Multi-modality Image Fusion with One-stage
Progressive Dense Registration [67.23451452670282]
Misalignments between multi-modality images pose challenges in image fusion.
We propose a Cross-modality Multi-scale Progressive Dense Registration scheme.
This scheme accomplishes the coarse-to-fine registration exclusively using a one-stage optimization.
arXiv Detail & Related papers (2023-08-22T03:46:24Z) - Spatially Varying Exposure with 2-by-2 Multiplexing: Optimality and
Universality [15.525314212209564]
We propose a new concept known as the spatially varying exposure risk (SVE-Risk) which is a pseudo-idealistic enumeration of the amount of recoverable pixels.
We show that given a multiplex pattern, the conventional optimality criteria based on the input/output-referred signal-to-noise ratio can lead to flawed decisions.
We report a design observation that the design universality pattern can be decoupled from the image reconstruction algorithm.
arXiv Detail & Related papers (2023-06-30T02:08:25Z) - Many-to-many Splatting for Efficient Video Frame Interpolation [80.10804399840927]
Motion-based video frame relies on optical flow to warp pixels from inputs to desired instant.
Many-to-Many (M2M) splatting framework to interpolate frames efficiently.
M2M has minuscule computational overhead when interpolating arbitrary number of in-between frames.
arXiv Detail & Related papers (2022-04-07T15:29:42Z) - Dual Pixel Exploration: Simultaneous Depth Estimation and Image
Restoration [77.1056200937214]
We study the formation of the DP pair which links the blur and the depth information.
We propose an end-to-end DDDNet (DP-based Depth and De Network) to jointly estimate the depth and restore the image.
arXiv Detail & Related papers (2020-12-01T06:53:57Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.