When to Use Convolutional Neural Networks for Inverse Problems
- URL: http://arxiv.org/abs/2003.13820v1
- Date: Mon, 30 Mar 2020 21:08:14 GMT
- Title: When to Use Convolutional Neural Networks for Inverse Problems
- Authors: Nathaniel Chodosh, Simon Lucey
- Abstract summary: We show how a convolutional neural network can be viewed as an approximate solution to a convolutional sparse coding problem.
We argue that for some types of inverse problems the CNN approximation breaks down leading to poor performance.
Specifically we identify JPEG artifact reduction and non-rigid trajectory reconstruction as challenging inverse problems for CNNs.
- Score: 40.60063929073102
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Reconstruction tasks in computer vision aim fundamentally to recover an
undetermined signal from a set of noisy measurements. Examples include
super-resolution, image denoising, and non-rigid structure from motion, all of
which have seen recent advancements through deep learning. However, earlier
work made extensive use of sparse signal reconstruction frameworks (e.g
convolutional sparse coding). While this work was ultimately surpassed by deep
learning, it rested on a much more developed theoretical framework. Recent work
by Papyan et. al provides a bridge between the two approaches by showing how a
convolutional neural network (CNN) can be viewed as an approximate solution to
a convolutional sparse coding (CSC) problem. In this work we argue that for
some types of inverse problems the CNN approximation breaks down leading to
poor performance. We argue that for these types of problems the CSC approach
should be used instead and validate this argument with empirical evidence.
Specifically we identify JPEG artifact reduction and non-rigid trajectory
reconstruction as challenging inverse problems for CNNs and demonstrate state
of the art performance on them using a CSC method. Furthermore, we offer some
practical improvements to this model and its application, and also show how
insights from the CSC model can be used to make CNNs effective in tasks where
their naive application fails.
Related papers
- Distance Weighted Trans Network for Image Completion [52.318730994423106]
We propose a new architecture that relies on Distance-based Weighted Transformer (DWT) to better understand the relationships between an image's components.
CNNs are used to augment the local texture information of coarse priors.
DWT blocks are used to recover certain coarse textures and coherent visual structures.
arXiv Detail & Related papers (2023-10-11T12:46:11Z) - Transferability of Convolutional Neural Networks in Stationary Learning
Tasks [96.00428692404354]
We introduce a novel framework for efficient training of convolutional neural networks (CNNs) for large-scale spatial problems.
We show that a CNN trained on small windows of such signals achieves a nearly performance on much larger windows without retraining.
Our results show that the CNN is able to tackle problems with many hundreds of agents after being trained with fewer than ten.
arXiv Detail & Related papers (2023-07-21T13:51:45Z) - Solving Large-scale Spatial Problems with Convolutional Neural Networks [88.31876586547848]
We employ transfer learning to improve training efficiency for large-scale spatial problems.
We propose that a convolutional neural network (CNN) can be trained on small windows of signals, but evaluated on arbitrarily large signals with little to no performance degradation.
arXiv Detail & Related papers (2023-06-14T01:24:42Z) - FrequencyLowCut Pooling -- Plug & Play against Catastrophic Overfitting [12.062691258844628]
This paper introduces an aliasing free down-sampling operation which can easily be plugged into any CNN architecture.
Our experiments show, that in combination with simple and fast FGSM adversarial training, our hyper- parameter free operator significantly improves model robustness.
arXiv Detail & Related papers (2022-04-01T14:51:28Z) - Image Restoration by Deep Projected GSURE [115.57142046076164]
Ill-posed inverse problems appear in many image processing applications, such as deblurring and super-resolution.
We propose a new image restoration framework that is based on minimizing a loss function that includes a "projected-version" of the Generalized SteinUnbiased Risk Estimator (GSURE) and parameterization of the latent image by a CNN.
arXiv Detail & Related papers (2021-02-04T08:52:46Z) - An End-To-End-Trainable Iterative Network Architecture for Accelerated
Radial Multi-Coil 2D Cine MR Image Reconstruction [4.233498905999929]
We propose a CNN-architecture for image reconstruction of accelerated 2D radial cine MRI with multiple receiver coils.
We investigate the proposed training-strategy and compare our method to other well-known reconstruction techniques with learned and non-learned regularization methods.
arXiv Detail & Related papers (2021-02-01T11:42:04Z) - Spatio-Temporal Inception Graph Convolutional Networks for
Skeleton-Based Action Recognition [126.51241919472356]
We design a simple and highly modularized graph convolutional network architecture for skeleton-based action recognition.
Our network is constructed by repeating a building block that aggregates multi-granularity information from both the spatial and temporal paths.
arXiv Detail & Related papers (2020-11-26T14:43:04Z) - Learning Multiscale Convolutional Dictionaries for Image Reconstruction [28.27195303823472]
Convolutional neural networks (CNNs) have been tremendously successful in solving imaging inverse problems.
Existing convolutional sparse coding (CSC) models underperform leading CNNs in challenging inverse problems.
We propose a multiscale convolutional dictionary structure to close the performance gap.
We show that incorporating the proposed multiscale dictionary in an otherwise standard CSC framework yields performance competitive with state-of-the-art CNNs.
arXiv Detail & Related papers (2020-11-25T15:18:00Z) - On the unreasonable effectiveness of CNNs [7.673853485227739]
Deep learning methods using convolutional neural networks (CNN) have been successfully applied to virtually all imaging problems.
In an attempt to put upper bounds on the capability of baseline CNNs for solving image-to-image problems we applied a widely used standard off-the-shelf network architecture (U-Net) to the "inverse problem" of XOR decryption from noisy data and show acceptable results.
arXiv Detail & Related papers (2020-07-29T11:16:20Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.