Efficient High-Resolution Deep Learning: A Survey
- URL: http://arxiv.org/abs/2207.13050v1
- Date: Tue, 26 Jul 2022 17:13:53 GMT
- Title: Efficient High-Resolution Deep Learning: A Survey
- Authors: Arian Bakhtiarnia, Qi Zhang and Alexandros Iosifidis
- Abstract summary: Cameras in modern devices such as smartphones, satellites and medical equipment are capable of capturing very high resolution images and videos.
Such high-resolution data often need to be processed by deep learning models for cancer detection, automated road navigation, weather prediction, surveillance, optimizing agricultural processes and many other applications.
Using high-resolution images and videos as direct inputs for deep learning models creates many challenges due to their high number of parameters, computation cost, inference latency and GPU memory consumption.
Several works in the literature propose better alternatives in order to deal with the challenges of high-resolution data and improve accuracy and speed while complying with hardware limitations
- Score: 90.76576712433595
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Cameras in modern devices such as smartphones, satellites and medical
equipment are capable of capturing very high resolution images and videos. Such
high-resolution data often need to be processed by deep learning models for
cancer detection, automated road navigation, weather prediction, surveillance,
optimizing agricultural processes and many other applications. Using
high-resolution images and videos as direct inputs for deep learning models
creates many challenges due to their high number of parameters, computation
cost, inference latency and GPU memory consumption. Simple approaches such as
resizing the images to a lower resolution are common in the literature,
however, they typically significantly decrease accuracy. Several works in the
literature propose better alternatives in order to deal with the challenges of
high-resolution data and improve accuracy and speed while complying with
hardware limitations and time restrictions. This survey describes such
efficient high-resolution deep learning methods, summarizes real-world
applications of high-resolution deep learning, and provides comprehensive
information about available high-resolution datasets.
Related papers
- Recurrent Multi-scale Transformer for High-Resolution Salient Object
Detection [68.65338791283298]
Salient Object Detection (SOD) aims to identify and segment the most conspicuous objects in an image or video.
Traditional SOD methods are largely limited to low-resolution images, making them difficult to adapt to the development of High-Resolution SOD.
In this work, we first propose a new HRS10K dataset, which contains 10,500 high-quality annotated images at 2K-8K resolution.
arXiv Detail & Related papers (2023-08-07T17:49:04Z) - Visual Place Recognition with Low-Resolution Images [17.847661026367767]
We analyse the effects of image resolution on the accuracy and robustness of handcrafted Visual Place Recognition pipelines.
This paper aims to help academic researchers and companies in the hardware and software industry co-design VPR solutions and expand the use of VPR algorithms in commercial products.
arXiv Detail & Related papers (2023-05-09T21:34:38Z) - Semi-Supervised Super-Resolution [30.872605139672086]
Super-Resolution is the technique to improve the quality of a low-resolution photo by boosting its plausible resolution.
Previous Super-Resolution methods require vast amounts of data for training which becomes problematic in domains where very few low-resolution, high-resolution pairs might be available.
This paper introduces a semi-supervised way to perform super-resolution that can generate sharp, high-resolution images with as few as 500 paired examples.
arXiv Detail & Related papers (2022-04-18T07:10:29Z) - Towards Unpaired Depth Enhancement and Super-Resolution in the Wild [121.96527719530305]
State-of-the-art data-driven methods of depth map super-resolution rely on registered pairs of low- and high-resolution depth maps of the same scenes.
We consider an approach to depth map enhancement based on learning from unpaired data.
arXiv Detail & Related papers (2021-05-25T16:19:16Z) - A new public Alsat-2B dataset for single-image super-resolution [1.284647943889634]
The paper introduces a novel public remote sensing dataset (Alsat2B) of low and high spatial resolution images (10m and 2.5m respectively) for the single-image super-resolution task.
The high-resolution images are obtained through pan-sharpening.
The obtained results reveal that the proposed scheme is promising and highlight the challenges in the dataset.
arXiv Detail & Related papers (2021-03-21T10:47:38Z) - Interpretable Hyperspectral AI: When Non-Convex Modeling meets
Hyperspectral Remote Sensing [57.52865154829273]
Hyperspectral imaging, also known as image spectrometry, is a landmark technique in geoscience remote sensing (RS)
In the past decade efforts have been made to process analyze these hyperspectral (HS) products mainly by means of seasoned experts.
For this reason, it is urgent to develop more intelligent and automatic approaches for various HS RS applications.
arXiv Detail & Related papers (2021-03-02T03:32:10Z) - Exploiting Raw Images for Real-Scene Super-Resolution [105.18021110372133]
We study the problem of real-scene single image super-resolution to bridge the gap between synthetic data and real captured images.
We propose a method to generate more realistic training data by mimicking the imaging process of digital cameras.
We also develop a two-branch convolutional neural network to exploit the radiance information originally-recorded in raw images.
arXiv Detail & Related papers (2021-02-02T16:10:15Z) - ImagePairs: Realistic Super Resolution Dataset via Beam Splitter Camera
Rig [13.925480922578869]
We propose a new data acquisition technique for gathering real image data set.
We use a beam-splitter to capture the same scene by a low resolution camera and a high resolution camera.
Unlike current small-scale dataset used for these tasks, our proposed dataset includes 11,421 pairs of low-resolution high-resolution images.
arXiv Detail & Related papers (2020-04-18T03:06:41Z) - Learning When and Where to Zoom with Deep Reinforcement Learning [101.79271767464947]
We propose a reinforcement learning approach to identify when and where to use/acquire high resolution data conditioned on paired, cheap, low resolution images.
We conduct experiments on CIFAR10, CIFAR100, ImageNet and fMoW datasets where we use significantly less high resolution data while maintaining similar accuracy to models which use full high resolution images.
arXiv Detail & Related papers (2020-03-01T07:16:46Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.