Resolution Invariant Autoencoder
- URL: http://arxiv.org/abs/2503.09828v1
- Date: Wed, 12 Mar 2025 20:47:19 GMT
- Title: Resolution Invariant Autoencoder
- Authors: Ashay Patel, Michela Antonelli, Sebastien Ourselin, M. Jorge Cardoso,
- Abstract summary: We introduce a resolution-invariant autoencoder that adapts spatial resizing at each layer in the network via a learned variable resizing process.<n>This ensures a consistent latent space resolution, regardless of input or output resolution.<n>We demonstrate its effectiveness in uncertainty-aware super-resolution, classification, and generative modelling tasks.
- Score: 1.149124112513873
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Deep learning has significantly advanced medical imaging analysis, yet variations in image resolution remain an overlooked challenge. Most methods address this by resampling images, leading to either information loss or computational inefficiencies. While solutions exist for specific tasks, no unified approach has been proposed. We introduce a resolution-invariant autoencoder that adapts spatial resizing at each layer in the network via a learned variable resizing process, replacing fixed spatial down/upsampling at the traditional factor of 2. This ensures a consistent latent space resolution, regardless of input or output resolution. Our model enables various downstream tasks to be performed on an image latent whilst maintaining performance across different resolutions, overcoming the shortfalls of traditional methods. We demonstrate its effectiveness in uncertainty-aware super-resolution, classification, and generative modelling tasks and show how our method outperforms conventional baselines with minimal performance loss across resolutions.
Related papers
- Elastic-DETR: Making Image Resolution Learnable with Content-Specific Network Prediction [0.612477318852572]
We introduce a novel strategy for learnable resolution, called Elastic-DETR, enabling elastic utilization of multiple image resolutions.<n>Our network provides an adaptive scale factor based on the content of the image with a compact scale prediction module.<n>By leveraging the resolution's flexibility, we can demonstrate various models that exhibit varying trade-offs between accuracy and computational complexity.
arXiv Detail & Related papers (2024-12-09T09:46:21Z) - Adaptive Resolution Residual Networks -- Generalizing Across Resolutions Easily and Efficiently [7.087237546722617]
We introduce Adaptive Resolution Residual Networks (ARRNs)<n>ARRNs inherit the advantages of adaptive-resolution methods and the ease of use of fixed-resolution methods.<n>We show that ARRNs embrace the challenge posed by diverse resolutions with greater flexibility, robustness, and computational efficiency.
arXiv Detail & Related papers (2024-12-09T04:25:37Z) - Cross-resolution Face Recognition via Identity-Preserving Network and
Knowledge Distillation [12.090322373964124]
Cross-resolution face recognition is a challenging problem for modern deep face recognition systems.
This paper proposes a new approach that enforces the network to focus on the discriminative information stored in the low-frequency components of a low-resolution image.
arXiv Detail & Related papers (2023-03-15T14:52:46Z) - Learning Resolution-Adaptive Representations for Cross-Resolution Person
Re-Identification [49.57112924976762]
Cross-resolution person re-identification problem aims to match low-resolution (LR) query identity images against high resolution (HR) gallery images.
It is a challenging and practical problem since the query images often suffer from resolution degradation due to the different capturing conditions from real-world cameras.
This paper explores an alternative SR-free paradigm to directly compare HR and LR images via a dynamic metric, which is adaptive to the resolution of a query image.
arXiv Detail & Related papers (2022-07-09T03:49:51Z) - Resolution based Feature Distillation for Cross Resolution Person
Re-Identification [17.86505685442293]
Person re-identification (re-id) aims to retrieve images of same identities across different camera views.
Resolution mismatch occurs due to varying distances between person of interest and cameras.
We propose a Resolution based Feature Distillation (RFD) approach to overcome the problem of multiple resolutions.
arXiv Detail & Related papers (2021-09-16T11:07:59Z) - Spatially-Adaptive Image Restoration using Distortion-Guided Networks [51.89245800461537]
We present a learning-based solution for restoring images suffering from spatially-varying degradations.
We propose SPAIR, a network design that harnesses distortion-localization information and dynamically adjusts to difficult regions in the image.
arXiv Detail & Related papers (2021-08-19T11:02:25Z) - Image Matching with Scale Adjustment [57.18604132027697]
We show how to represent and extract interest points at variable scales.
We devise a method allowing the comparison of two images at two different resolutions.
arXiv Detail & Related papers (2020-12-10T11:03:25Z) - 3D Human Shape and Pose from a Single Low-Resolution Image with
Self-Supervised Learning [105.49950571267715]
Existing deep learning methods for 3D human shape and pose estimation rely on relatively high-resolution input images.
We propose RSC-Net, which consists of a Resolution-aware network, a Self-supervision loss, and a Contrastive learning scheme.
We show that both these new training losses provide robustness when learning 3D shape and pose in a weakly-supervised manner.
arXiv Detail & Related papers (2020-07-27T16:19:52Z) - Invertible Image Rescaling [118.2653765756915]
We develop an Invertible Rescaling Net (IRN) to produce visually-pleasing low-resolution images.
We capture the distribution of the lost information using a latent variable following a specified distribution in the downscaling process.
arXiv Detail & Related papers (2020-05-12T09:55:53Z) - Gated Fusion Network for Degraded Image Super Resolution [78.67168802945069]
We propose a dual-branch convolutional neural network to extract base features and recovered features separately.
By decomposing the feature extraction step into two task-independent streams, the dual-branch model can facilitate the training process.
arXiv Detail & Related papers (2020-03-02T13:28:32Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.