A Novel Light Field Coding Scheme Based on Deep Belief Network &
Weighted Binary Images for Additive Layered Displays
- URL: http://arxiv.org/abs/2210.01447v2
- Date: Fri, 21 Apr 2023 14:59:26 GMT
- Title: A Novel Light Field Coding Scheme Based on Deep Belief Network &
Weighted Binary Images for Additive Layered Displays
- Authors: Sally Khaidem and Mansi Sharma
- Abstract summary: Stacking light attenuating layers is one approach to implement a light field display with a broader depth of field, wide viewing angles and high resolution.
This paper proposes a novel framework for light field representation and coding that utilizes Deep Belief Network (DBN) and weighted binary images.
- Score: 0.30458514384586394
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Light-field displays create an immersive experience by providing binocular
depth sensation and motion parallax. Stacking light attenuating layers is one
approach to implement a light field display with a broader depth of field, wide
viewing angles and high resolution. Due to the transparent holographic optical
element (HOE) layers, additive layered displays can be integrated into
augmented reality (AR) wearables to overlay virtual objects onto the real
world, creating a seamless mixed reality (XR) experience. This paper proposes a
novel framework for light field representation and coding that utilizes Deep
Belief Network (DBN) and weighted binary images suitable for additive layered
displays. The weighted binary representation of layers makes the framework more
flexible for adaptive bitrate encoding. The framework effectively captures
intrinsic redundancies in the light field data, and thus provides a scalable
solution for light field coding suitable for XR display applications. The
latent code is encoded by H.265 codec generating a rate-scalable bit-stream. We
achieve adaptive bitrate decoding by varying the number of weighted binary
images and the H.265 quantization parameter, while maintaining an optimal
reconstruction quality. The framework is tested on real and synthetic benchmark
datasets, and the results validate the rate-scalable property of the proposed
scheme.
Related papers
- CodeEnhance: A Codebook-Driven Approach for Low-Light Image Enhancement [97.95330185793358]
Low-light image enhancement (LLIE) aims to improve low-illumination images.
Existing methods face two challenges: uncertainty in restoration from diverse brightness degradations and loss of texture and color information.
We propose a novel enhancement approach, CodeEnhance, by leveraging quantized priors and image refinement.
arXiv Detail & Related papers (2024-04-08T07:34:39Z) - Enhancing Low-light Light Field Images with A Deep Compensation Unfolding Network [52.77569396659629]
This paper presents the deep compensation network unfolding (DCUNet) for restoring light field (LF) images captured under low-light conditions.
The framework uses the intermediate enhanced result to estimate the illumination map, which is then employed in the unfolding process to produce a new enhanced result.
To properly leverage the unique characteristics of LF images, this paper proposes a pseudo-explicit feature interaction module.
arXiv Detail & Related papers (2023-08-10T07:53:06Z) - Learning Kernel-Modulated Neural Representation for Efficient Light
Field Compression [41.24757573290883]
We design a compact neural network representation for the light field compression task.
It is composed of two types of complementary kernels: descriptive kernels (descriptors) that store scene description information learned during training, and modulatory kernels (modulators) that control the rendering of different SAIs from the queried perspectives.
arXiv Detail & Related papers (2023-07-12T12:58:03Z) - Spatiotemporally Consistent HDR Indoor Lighting Estimation [66.26786775252592]
We propose a physically-motivated deep learning framework to solve the indoor lighting estimation problem.
Given a single LDR image with a depth map, our method predicts spatially consistent lighting at any given image position.
Our framework achieves photorealistic lighting prediction with higher quality compared to state-of-the-art single-image or video-based methods.
arXiv Detail & Related papers (2023-05-07T20:36:29Z) - Scattering-induced entropy boost for highly-compressed optical sensing and encryption [7.502671257653539]
Image sensing often relies on a high-quality machine vision system with a large field of view and high resolution.
We propose a novel image-free sensing framework for resource-efficient image classification.
The proposed framework is shown to obtain over a 95% accuracy at sampling rates of 1% and 5% for classification on the MNIST dataset.
arXiv Detail & Related papers (2022-12-16T09:00:42Z) - RTMV: A Ray-Traced Multi-View Synthetic Dataset for Novel View Synthesis [104.53930611219654]
We present a large-scale synthetic dataset for novel view synthesis consisting of 300k images rendered from nearly 2000 complex scenes.
The dataset is orders of magnitude larger than existing synthetic datasets for novel view synthesis.
Using 4 distinct sources of high-quality 3D meshes, the scenes of our dataset exhibit challenging variations in camera views, lighting, shape, materials, and textures.
arXiv Detail & Related papers (2022-05-14T13:15:32Z) - Neural Étendue Expander for Ultra-Wide-Angle High-Fidelity Holographic Display [51.399291206537384]
Modern holographic displays possess low 'etendue, which is the product of the display area and the maximum solid angle of diffracted light.
We present neural 'etendue expanders, which are learned from a natural image dataset.
With neural 'etendue expanders, we experimentally achieve 64$times$ 'etendue expansion of natural images in full color, expanding the FOV by an order of magnitude horizontally and vertically.
arXiv Detail & Related papers (2021-09-16T17:21:52Z) - A Novel Hierarchical Light Field Coding Scheme Based on Hybrid Stacked
Multiplicative Layers and Fourier Disparity Layers for Glasses-Free 3D
Displays [0.6091702876917279]
We present a novel hierarchical coding scheme for light fields based on transmittance patterns of low-rank multiplicative layers and Fourier disparity layers.
The proposed scheme identifies multiplicative layers of light field view subsets optimized using a convolutional neural network for different scanning orders.
arXiv Detail & Related papers (2021-08-27T17:09:29Z) - DeepLight: Robust & Unobtrusive Real-time Screen-Camera Communication
for Real-World Displays [4.632704227272501]
DeepLight is a system that incorporates machine learning (ML) models in the decoding pipeline to achieve humanly-imperceptible, moderately high SCC rates.
DeepLight's key innovation is the design of a Deep Neural Network (DNN) based decoder that collectively decodes all the bits spatially encoded in a display frame.
arXiv Detail & Related papers (2021-05-11T14:44:12Z) - Generative Modelling of BRDF Textures from Flash Images [50.660026124025265]
We learn a latent space for easy capture, semantic editing, consistent, and efficient reproduction of visual material appearance.
In a second step, conditioned on the material code, our method produces an infinite and diverse spatial field of BRDF model parameters.
arXiv Detail & Related papers (2021-02-23T18:45:18Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.