Large-scale Global Low-rank Optimization for Computational Compressed
Imaging
- URL: http://arxiv.org/abs/2301.03047v1
- Date: Sun, 8 Jan 2023 14:12:51 GMT
- Title: Large-scale Global Low-rank Optimization for Computational Compressed
Imaging
- Authors: Daoyu Li, Hanwen Xu, Miao Cao, Xin Yuan, David J. Brady, and Liheng
Bian
- Abstract summary: We present the global low-rank (GLR) optimization technique, realizing highly-efficient large-scale reconstruction with global self-similarity.
Inspired by the self-attention mechanism in deep learning, GLR extracts image patches by feature detection instead of conventional uniform selection.
We experimentally demonstrate GLR's effectiveness on temporal, frequency, and spectral dimensions.
- Score: 8.594666859332124
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Computational reconstruction plays a vital role in computer vision and
computational photography. Most of the conventional optimization and deep
learning techniques explore local information for reconstruction. Recently,
nonlocal low-rank (NLR) reconstruction has achieved remarkable success in
improving accuracy and generalization. However, the computational cost has
inhibited NLR from seeking global structural similarity, which consequentially
keeps it trapped in the tradeoff between accuracy and efficiency and prevents
it from high-dimensional large-scale tasks. To address this challenge, we
report here the global low-rank (GLR) optimization technique, realizing
highly-efficient large-scale reconstruction with global self-similarity.
Inspired by the self-attention mechanism in deep learning, GLR extracts
exemplar image patches by feature detection instead of conventional uniform
selection. This directly produces key patches using structural features to
avoid burdensome computational redundancy. Further, it performs patch matching
across the entire image via neural-based convolution, which produces the global
similarity heat map in parallel, rather than conventional sequential block-wise
matching. As such, GLR improves patch grouping efficiency by more than one
order of magnitude. We experimentally demonstrate GLR's effectiveness on
temporal, frequency, and spectral dimensions, including different computational
imaging modalities of compressive temporal imaging, magnetic resonance imaging,
and multispectral filter array demosaicing. This work presents the superiority
of inherent fusion of deep learning strategies and iterative optimization, and
breaks the persistent dilemma of the tradeoff between accuracy and efficiency
for various large-scale reconstruction tasks.
Related papers
- LeRF: Learning Resampling Function for Adaptive and Efficient Image Interpolation [64.34935748707673]
Recent deep neural networks (DNNs) have made impressive progress in performance by introducing learned data priors.
We propose a novel method of Learning Resampling (termed LeRF) which takes advantage of both the structural priors learned by DNNs and the locally continuous assumption.
LeRF assigns spatially varying resampling functions to input image pixels and learns to predict the shapes of these resampling functions with a neural network.
arXiv Detail & Related papers (2024-07-13T16:09:45Z) - Efficient Real-world Image Super-Resolution Via Adaptive Directional Gradient Convolution [80.85121353651554]
We introduce kernel-wise differential operations within the convolutional kernel and develop several learnable directional gradient convolutions.
These convolutions are integrated in parallel with a novel linear weighting mechanism to form an Adaptive Directional Gradient Convolution (DGConv)
We further devise an Adaptive Information Interaction Block (AIIBlock) to adeptly balance the enhancement of texture and contrast while meticulously investigating the interdependencies, culminating in the creation of a DGPNet for Real-SR through simple stacking.
arXiv Detail & Related papers (2024-05-11T14:21:40Z) - Efficient Model Agnostic Approach for Implicit Neural Representation
Based Arbitrary-Scale Image Super-Resolution [5.704360536038803]
Single image super-resolution (SISR) has experienced significant advancements, primarily driven by deep convolutional networks.
Traditional networks are limited to upscaling images to a fixed scale, leading to the utilization of implicit neural functions for generating arbitrarily scaled images.
We introduce a novel and efficient framework, the Mixture of Experts Implicit Super-Resolution (MoEISR), which enables super-resolution at arbitrary scales.
arXiv Detail & Related papers (2023-11-20T05:34:36Z) - DiffSCI: Zero-Shot Snapshot Compressive Imaging via Iterative Spectral
Diffusion Model [18.25548360119976]
This paper endeavors to advance the precision of snapshot compressive imaging (SCI) reconstruction for multispectral image (MSI)
We propose a novel structured zero-shot diffusion model, dubbed DiffSCI.
We present extensive testing to show that DiffSCI exhibits discernible performance enhancements over prevailing self-supervised and zero-shot approaches.
arXiv Detail & Related papers (2023-11-19T20:27:14Z) - Distance Weighted Trans Network for Image Completion [52.318730994423106]
We propose a new architecture that relies on Distance-based Weighted Transformer (DWT) to better understand the relationships between an image's components.
CNNs are used to augment the local texture information of coarse priors.
DWT blocks are used to recover certain coarse textures and coherent visual structures.
arXiv Detail & Related papers (2023-10-11T12:46:11Z) - SST-ReversibleNet: Reversible-prior-based Spectral-Spatial Transformer
for Efficient Hyperspectral Image Reconstruction [15.233185887461826]
A novel framework called the reversible-prior-based method is proposed.
ReversibleNet significantly outperforms state-of-the-art methods on simulated and real HSI datasets.
arXiv Detail & Related papers (2023-05-06T14:01:02Z) - Reconstruction of compressed spectral imaging based on global structure
and spectral correlation [17.35611893815407]
The proposed method uses the convolution kernel to operate the global image.
To solve the problem that convolutional sparse coding is insensitive to low frequency, the global total-variation (TV) constraint is added.
The proposed method improves the reconstruction quality by up to 7 dB in PSNR and 10% in SSIM.
arXiv Detail & Related papers (2022-10-27T14:31:02Z) - Loop Unrolled Shallow Equilibrium Regularizer (LUSER) -- A
Memory-Efficient Inverse Problem Solver [26.87738024952936]
In inverse problems we aim to reconstruct some underlying signal of interest from potentially corrupted and often ill-posed measurements.
We propose an LU algorithm with shallow equilibrium regularizers (L)
These implicit models are as expressive as deeper convolutional networks, but far more memory efficient during training.
arXiv Detail & Related papers (2022-10-10T19:50:37Z) - Effective Invertible Arbitrary Image Rescaling [77.46732646918936]
Invertible Neural Networks (INN) are able to increase upscaling accuracy significantly by optimizing the downscaling and upscaling cycle jointly.
A simple and effective invertible arbitrary rescaling network (IARN) is proposed to achieve arbitrary image rescaling by training only one model in this work.
It is shown to achieve a state-of-the-art (SOTA) performance in bidirectional arbitrary rescaling without compromising perceptual quality in LR outputs.
arXiv Detail & Related papers (2022-09-26T22:22:30Z) - FasterPose: A Faster Simple Baseline for Human Pose Estimation [65.8413964785972]
We propose a design paradigm for cost-effective network with LR representation for efficient pose estimation, named FasterPose.
We study the training behavior of FasterPose, and formulate a novel regressive cross-entropy (RCE) loss function for accelerating the convergence.
Compared with the previously dominant network of pose estimation, our method reduces 58% of the FLOPs and simultaneously gains 1.3% improvement of accuracy.
arXiv Detail & Related papers (2021-07-07T13:39:08Z) - Kullback-Leibler Divergence-Based Fuzzy $C$-Means Clustering
Incorporating Morphological Reconstruction and Wavelet Frames for Image
Segmentation [152.609322951917]
We come up with a Kullback-Leibler (KL) divergence-based Fuzzy C-Means (FCM) algorithm by incorporating a tight wavelet frame transform and a morphological reconstruction operation.
The proposed algorithm works well and comes with better segmentation performance than other comparative algorithms.
arXiv Detail & Related papers (2020-02-21T05:19:10Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.