Scale-Consistent Fusion: from Heterogeneous Local Sampling to Global
Immersive Rendering
- URL: http://arxiv.org/abs/2106.09548v1
- Date: Thu, 17 Jun 2021 14:27:08 GMT
- Title: Scale-Consistent Fusion: from Heterogeneous Local Sampling to Global
Immersive Rendering
- Authors: Wenpeng Xing, Jie Chen, Zaifeng Yang and Qiang Wang
- Abstract summary: Image-based geometric modeling and novel view synthesis based on sparse, large-baseline samplings are challenging but important tasks for emerging multimedia applications such as virtual reality and immersive telepresence.
With the popularization of commercial light field (LF) cameras, capturing LF images (LFIs) is as convenient as taking regular photos, and geometry information can be reliably inferred.
We propose a novel scale-consistent volume rescaling algorithm that robustly aligns the disparity probability volumes (DPV) among different captures for scale-consistent global geometry fusion.
- Score: 9.893045525907219
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Image-based geometric modeling and novel view synthesis based on sparse,
large-baseline samplings are challenging but important tasks for emerging
multimedia applications such as virtual reality and immersive telepresence.
Existing methods fail to produce satisfactory results due to the limitation on
inferring reliable depth information over such challenging reference
conditions. With the popularization of commercial light field (LF) cameras,
capturing LF images (LFIs) is as convenient as taking regular photos, and
geometry information can be reliably inferred. This inspires us to use a sparse
set of LF captures to render high-quality novel views globally. However, fusion
of LF captures from multiple angles is challenging due to the scale
inconsistency caused by various capture settings. To overcome this challenge,
we propose a novel scale-consistent volume rescaling algorithm that robustly
aligns the disparity probability volumes (DPV) among different captures for
scale-consistent global geometry fusion. Based on the fused DPV projected to
the target camera frustum, novel learning-based modules have been proposed
(i.e., the attention-guided multi-scale residual fusion module, and the
disparity field guided deep re-regularization module) which comprehensively
regularize noisy observations from heterogeneous captures for high-quality
rendering of novel LFIs. Both quantitative and qualitative experiments over the
Stanford Lytro Multi-view LF dataset show that the proposed method outperforms
state-of-the-art methods significantly under different experiment settings for
disparity inference and LF synthesis.
Related papers
- MultiDiff: Consistent Novel View Synthesis from a Single Image [60.04215655745264]
MultiDiff is a novel approach for consistent novel view synthesis of scenes from a single RGB image.
Our results demonstrate that MultiDiff outperforms state-of-the-art methods on the challenging, real-world datasets RealEstate10K and ScanNet.
arXiv Detail & Related papers (2024-06-26T17:53:51Z) - Diffusion-based Light Field Synthesis [50.24624071354433]
LFdiff is a diffusion-based generative framework tailored for LF synthesis.
We propose DistgUnet, a disentanglement-based noise estimation network, to harness comprehensive LF representations.
Extensive experiments demonstrate that LFdiff excels in synthesizing visually pleasing and disparity-controllable light fields.
arXiv Detail & Related papers (2024-02-01T13:13:16Z) - LFSRDiff: Light Field Image Super-Resolution via Diffusion Models [18.20217829625834]
Light field (LF) image super-resolution (SR) is a challenging problem due to its inherent ill-posed nature.
mainstream LF image SR methods typically adopt a deterministic approach, generating only a single output supervised by pixel-wise loss functions.
We introduce LFSRDiff, the first diffusion-based LF image SR model, by incorporating the LF disentanglement mechanism.
arXiv Detail & Related papers (2023-11-27T07:31:12Z) - Light Field Diffusion for Single-View Novel View Synthesis [32.59286750410843]
Single-view novel view synthesis (NVS) is important but challenging in computer vision.
Recent advancements in NVS have leveraged Denoising Diffusion Probabilistic Models (DDPMs) for their exceptional ability to produce high-fidelity images.
We present Light Field Diffusion (LFD), a novel conditional diffusion-based approach that transcends the conventional reliance on camera pose matrices.
arXiv Detail & Related papers (2023-09-20T03:27:06Z) - Enhancing Low-light Light Field Images with A Deep Compensation Unfolding Network [52.77569396659629]
This paper presents the deep compensation network unfolding (DCUNet) for restoring light field (LF) images captured under low-light conditions.
The framework uses the intermediate enhanced result to estimate the illumination map, which is then employed in the unfolding process to produce a new enhanced result.
To properly leverage the unique characteristics of LF images, this paper proposes a pseudo-explicit feature interaction module.
arXiv Detail & Related papers (2023-08-10T07:53:06Z) - Light Field Reconstruction via Deep Adaptive Fusion of Hybrid Lenses [67.01164492518481]
This paper explores the problem of reconstructing high-resolution light field (LF) images from hybrid lenses.
We propose a novel end-to-end learning-based approach, which can comprehensively utilize the specific characteristics of the input.
Our framework could potentially decrease the cost of high-resolution LF data acquisition and benefit LF data storage and transmission.
arXiv Detail & Related papers (2021-02-14T06:44:47Z) - Deep Selective Combinatorial Embedding and Consistency Regularization
for Light Field Super-resolution [93.95828097088608]
Light field (LF) images acquired by hand-held devices usually suffer from low spatial resolution.
The high-dimensionality characteristic and complex geometrical structure of LF images make the problem more challenging than traditional single-image SR.
We propose a novel learning-based LF spatial SR framework to explore the coherence among LF sub-aperture images.
Experimental results over both synthetic and real-world LF datasets demonstrate the significant advantage of our approach over state-of-the-art methods.
arXiv Detail & Related papers (2020-09-26T08:34:37Z) - Light Field Spatial Super-resolution via Deep Combinatorial Geometry
Embedding and Structural Consistency Regularization [99.96632216070718]
Light field (LF) images acquired by hand-held devices usually suffer from low spatial resolution.
The high-dimensional spatiality characteristic and complex geometrical structure of LF images make the problem more challenging than traditional single-image SR.
We propose a novel learning-based LF framework, in which each view of an LF image is first individually super-resolved.
arXiv Detail & Related papers (2020-04-05T14:39:57Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.