Learning When and Where to Zoom with Deep Reinforcement Learning
- URL: http://arxiv.org/abs/2003.00425v2
- Date: Mon, 20 Apr 2020 18:25:16 GMT
- Title: Learning When and Where to Zoom with Deep Reinforcement Learning
- Authors: Burak Uzkent, Stefano Ermon
- Abstract summary: We propose a reinforcement learning approach to identify when and where to use/acquire high resolution data conditioned on paired, cheap, low resolution images.
We conduct experiments on CIFAR10, CIFAR100, ImageNet and fMoW datasets where we use significantly less high resolution data while maintaining similar accuracy to models which use full high resolution images.
- Score: 101.79271767464947
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: While high resolution images contain semantically more useful information
than their lower resolution counterparts, processing them is computationally
more expensive, and in some applications, e.g. remote sensing, they can be much
more expensive to acquire. For these reasons, it is desirable to develop an
automatic method to selectively use high resolution data when necessary while
maintaining accuracy and reducing acquisition/run-time cost. In this direction,
we propose PatchDrop a reinforcement learning approach to dynamically identify
when and where to use/acquire high resolution data conditioned on the paired,
cheap, low resolution images. We conduct experiments on CIFAR10, CIFAR100,
ImageNet and fMoW datasets where we use significantly less high resolution data
while maintaining similar accuracy to models which use full high resolution
images.
Related papers
- Large-Scale Data-Free Knowledge Distillation for ImageNet via Multi-Resolution Data Generation [53.95204595640208]
Data-Free Knowledge Distillation (DFKD) is an advanced technique that enables knowledge transfer from a teacher model to a student model without relying on original training data.
Previous approaches have generated synthetic images at high resolutions without leveraging information from real images.
MUSE generates images at lower resolutions while using Class Activation Maps (CAMs) to ensure that the generated images retain critical, class-specific features.
arXiv Detail & Related papers (2024-11-26T02:23:31Z) - Supersampling of Data from Structured-light Scanner with Deep Learning [1.6385815610837167]
Two deep learning models FDSR and DKN are modified to work with high-resolution data.
The resulting high-resolution depth maps are evaluated using qualitative and quantitative metrics.
arXiv Detail & Related papers (2023-11-13T16:04:41Z) - Recurrent Multi-scale Transformer for High-Resolution Salient Object
Detection [68.65338791283298]
Salient Object Detection (SOD) aims to identify and segment the most conspicuous objects in an image or video.
Traditional SOD methods are largely limited to low-resolution images, making them difficult to adapt to the development of High-Resolution SOD.
In this work, we first propose a new HRS10K dataset, which contains 10,500 high-quality annotated images at 2K-8K resolution.
arXiv Detail & Related papers (2023-08-07T17:49:04Z) - Efficient High-Resolution Deep Learning: A Survey [90.76576712433595]
Cameras in modern devices such as smartphones, satellites and medical equipment are capable of capturing very high resolution images and videos.
Such high-resolution data often need to be processed by deep learning models for cancer detection, automated road navigation, weather prediction, surveillance, optimizing agricultural processes and many other applications.
Using high-resolution images and videos as direct inputs for deep learning models creates many challenges due to their high number of parameters, computation cost, inference latency and GPU memory consumption.
Several works in the literature propose better alternatives in order to deal with the challenges of high-resolution data and improve accuracy and speed while complying with hardware limitations
arXiv Detail & Related papers (2022-07-26T17:13:53Z) - Dynamic Low-Resolution Distillation for Cost-Efficient End-to-End Text
Spotting [49.33891486324731]
We propose a novel cost-efficient Dynamic Low-resolution Distillation (DLD) text spotting framework.
It aims to infer images in different small but recognizable resolutions and achieve a better balance between accuracy and efficiency.
The proposed method can be optimized end-to-end and adopted in any current text spotting framework to improve the practicability.
arXiv Detail & Related papers (2022-07-14T06:49:59Z) - A new public Alsat-2B dataset for single-image super-resolution [1.284647943889634]
The paper introduces a novel public remote sensing dataset (Alsat2B) of low and high spatial resolution images (10m and 2.5m respectively) for the single-image super-resolution task.
The high-resolution images are obtained through pan-sharpening.
The obtained results reveal that the proposed scheme is promising and highlight the challenges in the dataset.
arXiv Detail & Related papers (2021-03-21T10:47:38Z) - Efficient Poverty Mapping using Deep Reinforcement Learning [75.6332944247741]
High-resolution satellite imagery and machine learning have proven useful in many sustainability-related tasks.
The accuracy afforded by high-resolution imagery comes at a cost, as such imagery is extremely expensive to purchase at scale.
We propose a reinforcement learning approach in which free low-resolution imagery is used to dynamically identify where to acquire costly high-resolution images.
arXiv Detail & Related papers (2020-06-07T18:30:57Z) - ImagePairs: Realistic Super Resolution Dataset via Beam Splitter Camera
Rig [13.925480922578869]
We propose a new data acquisition technique for gathering real image data set.
We use a beam-splitter to capture the same scene by a low resolution camera and a high resolution camera.
Unlike current small-scale dataset used for these tasks, our proposed dataset includes 11,421 pairs of low-resolution high-resolution images.
arXiv Detail & Related papers (2020-04-18T03:06:41Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.