SphereSR: 360{\deg} Image Super-Resolution with Arbitrary Projection via
Continuous Spherical Image Representation
- URL: http://arxiv.org/abs/2112.06536v2
- Date: Tue, 14 Dec 2021 04:40:16 GMT
- Title: SphereSR: 360{\deg} Image Super-Resolution with Arbitrary Projection via
Continuous Spherical Image Representation
- Authors: Youngho Yoon, Inchul Chung, Lin Wang, and Kuk-Jin Yoon
- Abstract summary: We propose a novel framework to generate a continuous spherical image representation from an LR 360degimage.
Specifically, we first propose a feature extraction module that represents the spherical data based on icosahedron.
We then propose a spherical local implicit image function (SLIIF) to predict RGB values at the spherical coordinates.
- Score: 27.10716804733828
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The 360{\deg}imaging has recently gained great attention; however, its
angular resolution is relatively lower than that of a narrow field-of-view
(FOV) perspective image as it is captured by using fisheye lenses with the same
sensor size. Therefore, it is beneficial to super-resolve a 360{\deg}image.
Some attempts have been made but mostly considered the equirectangular
projection (ERP) as one of the way for 360{\deg}image representation despite of
latitude-dependent distortions. In that case, as the output high-resolution(HR)
image is always in the same ERP format as the low-resolution (LR) input,
another information loss may occur when transforming the HR image to other
projection types. In this paper, we propose SphereSR, a novel framework to
generate a continuous spherical image representation from an LR 360{\deg}image,
aiming at predicting the RGB values at given spherical coordinates for
super-resolution with an arbitrary 360{\deg}image projection. Specifically, we
first propose a feature extraction module that represents the spherical data
based on icosahedron and efficiently extracts features on the spherical
surface. We then propose a spherical local implicit image function (SLIIF) to
predict RGB values at the spherical coordinates. As such, SphereSR flexibly
reconstructs an HR image under an arbitrary projection type. Experiments on
various benchmark datasets show that our method significantly surpasses
existing methods.
Related papers
- GPS-Gaussian+: Generalizable Pixel-wise 3D Gaussian Splatting for Real-Time Human-Scene Rendering from Sparse Views [67.34073368933814]
We propose a generalizable Gaussian Splatting approach for high-resolution image rendering under a sparse-view camera setting.
We train our Gaussian parameter regression module on human-only data or human-scene data, jointly with a depth estimation module to lift 2D parameter maps to 3D space.
Experiments on several datasets demonstrate that our method outperforms state-of-the-art methods while achieving an exceeding rendering speed.
arXiv Detail & Related papers (2024-11-18T08:18:44Z) - ODGS: 3D Scene Reconstruction from Omnidirectional Images with 3D Gaussian Splattings [48.72040500647568]
We present ODGS, a novelization pipeline for omnidirectional images, with geometric interpretation.
The entire pipeline is parallelized using, achieving optimization and speeds 100 times faster than NeRF-based methods.
Results show ODGS restores fine details effectively, even when reconstructing large 3D scenes.
arXiv Detail & Related papers (2024-10-28T02:45:13Z) - Revisiting 360 Depth Estimation with PanoGabor: A New Fusion Perspective [33.85582959047852]
We propose an oriented distortion-aware Gabor Fusion framework (PGFuse) to address the above challenges.
To address the reintroduced distortions, we design a linear latitude-aware distortion representation method to generate customized, distortion-aware Gabor filters.
Considering the orientation sensitivity of the Gabor transform, we introduce a spherical gradient constraint to stabilize this sensitivity.
arXiv Detail & Related papers (2024-08-29T02:58:35Z) - Generative Multiplane Neural Radiance for 3D-Aware Image Generation [102.15322193381617]
We present a method to efficiently generate 3D-aware high-resolution images that are view-consistent across multiple target views.
Our GMNR model generates 3D-aware images of 1024 X 1024 pixels with 17.6 FPS on a single V100.
arXiv Detail & Related papers (2023-04-03T17:41:20Z) - $PC^2$: Projection-Conditioned Point Cloud Diffusion for Single-Image 3D
Reconstruction [97.06927852165464]
Reconstructing the 3D shape of an object from a single RGB image is a long-standing and highly challenging problem in computer vision.
We propose a novel method for single-image 3D reconstruction which generates a sparse point cloud via a conditional denoising diffusion process.
arXiv Detail & Related papers (2023-02-21T13:37:07Z) - VoGE: A Differentiable Volume Renderer using Gaussian Ellipsoids for
Analysis-by-Synthesis [62.47221232706105]
We propose VoGE, which utilizes the Gaussian reconstruction kernels as volumetric primitives.
To efficiently render via VoGE, we propose an approximate closeform solution for the volume density aggregation and a coarse-to-fine rendering strategy.
VoGE outperforms SoTA when applied to various vision tasks, e.g., object pose estimation, shape/texture fitting, and reasoning.
arXiv Detail & Related papers (2022-05-30T19:52:11Z) - Field-of-View IoU for Object Detection in 360{\deg} Images [36.72543749626039]
We propose two fundamental techniques -- Field-of-View IoU (FoV-IoU) and 360Augmentation for object detection in 360deg images.
FoV-IoU computes the intersection-over-union of two Field-of-View bounding boxes in a spherical image which could be used for training, inference, and evaluation.
360Augmentation is a data augmentation technique specific to 360deg object detection task which randomly rotates a spherical image and solves the bias due to the sphere-to-plane projection.
arXiv Detail & Related papers (2022-02-07T14:01:59Z) - 360{\deg} Optical Flow using Tangent Images [18.146747748702513]
equirectangular projection (ERP) is the most common format for storing, processing and visualising 360deg images.
We propose a 360deg optical flow method based on tangent images.
arXiv Detail & Related papers (2021-12-28T23:50:46Z) - Applying VertexShuffle Toward 360-Degree Video Super-Resolution on
Focused-Icosahedral-Mesh [10.29596292902288]
We exploit Focused Icosahedral Mesh to represent a small area and construct matrices to rotate spherical content to the focused mesh area.
We also proposed a novel VertexShuffle operation that can significantly improve both the performance and the efficiency.
Our proposed spherical super-resolution model achieves significant benefits in terms of both performance and inference time.
arXiv Detail & Related papers (2021-06-21T16:53:57Z) - Extreme Rotation Estimation using Dense Correlation Volumes [73.35119461422153]
We present a technique for estimating the relative 3D rotation of an RGB image pair in an extreme setting.
We observe that, even when images do not overlap, there may be rich hidden cues as to their geometric relationship.
We propose a network design that can automatically learn such implicit cues by comparing all pairs of points between the two input images.
arXiv Detail & Related papers (2021-04-28T02:00:04Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.