Impact of Clinical Image Quality on Efficient Foundation Model Finetuning
- URL: http://arxiv.org/abs/2508.11864v2
- Date: Wed, 20 Aug 2025 08:42:48 GMT
- Title: Impact of Clinical Image Quality on Efficient Foundation Model Finetuning
- Authors: Yucheng Tang, Pawel Rajwa, Alexander Ng, Yipei Wang, Wen Yan, Natasha Thorley, Aqua Asif, Clare Allen, Louise Dickinson, Francesco Giganti, Shonit Punwani, Daniel C. Alexander, Veeru Kasivisvanathan, Yipeng Hu,
- Abstract summary: Foundation models in medical imaging have shown promising label efficiency, achieving high performance on downstream tasks.<n>We investigate the impact of variable image quality on the label-efficient finetuning, by quantifying the generalisability of the finetuned models.<n>Our findings indicate that image quality distribution and its finetune-and-test mismatch significantly affect model performance.
- Score: 37.729881862462925
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Foundation models in medical imaging have shown promising label efficiency, achieving high performance on downstream tasks using only a fraction of the annotated data otherwise required. In this study, we evaluate this potential in the context of prostate multiparametric MRI using ProFound, a recently developed domain-specific vision foundation model pretrained on large-scale prostate MRI datasets. We investigate the impact of variable image quality on the label-efficient finetuning, by quantifying the generalisability of the finetuned models. We conduct a comprehensive set of experiments by systematically varying the ratios of high- and low-quality images in the finetuning and evaluation sets. Our findings indicate that image quality distribution and its finetune-and-test mismatch significantly affect model performance. In particular: a) Varying the ratio of high- to low-quality images between finetuning and test sets leads to notable differences in downstream performance; and b) The presence of sufficient high-quality images in the finetuning set is critical for maintaining strong performance, whilst the importance of matched finetuning and testing distribution varies between different downstream tasks, such as automated radiology reporting and prostate cancer detection. Importantly, experimental results also show that, although finetuning requires significantly less labeled data compared to training from scratch when the quality ratio is consistent, this label efficiency is not independent of the image quality distribution. For example, we show cases that, without sufficient high-quality images in finetuning, finetuned models may fail to outperform those without pretraining.
Related papers
- X-Mark: Saliency-Guided Robust Dataset Ownership Verification for Medical Imaging [67.85884025186755]
High-quality medical imaging datasets are essential for training deep learning models, but their unauthorized use raises serious copyright and ethical concerns.<n>Medical imaging presents a unique challenge for existing dataset ownership verification methods designed for natural images.<n>We propose X-Mark, a sample-specific clean-label watermarking method for chest x-ray copyright protection.
arXiv Detail & Related papers (2026-02-10T00:03:43Z) - Beyond Pixels: Medical Image Quality Assessment with Implicit Neural Representations [2.0934875997852096]
Artifacts pose a significant challenge in medical imaging, impacting diagnostic accuracy and downstream analysis.<n>We propose the use of implicit neural representations (INRs) for image quality assessment.<n>Our method is evaluated on the ACDC dataset with synthetically generated artifact patterns.
arXiv Detail & Related papers (2025-08-07T09:00:06Z) - Metrics that matter: Evaluating image quality metrics for medical image generation [48.85783422900129]
This study comprehensively assesses commonly used no-reference image quality metrics using brain MRI data.<n>We evaluate metric sensitivity to a range of challenges, including noise, distribution shifts, and, critically, morphological alterations designed to mimic clinically relevant inaccuracies.
arXiv Detail & Related papers (2025-05-12T01:57:25Z) - Latent Drifting in Diffusion Models for Counterfactual Medical Image Synthesis [55.959002385347645]
Latent Drifting enables diffusion models to be conditioned for medical images fitted for the complex task of counterfactual image generation.<n>We evaluate our method on three public longitudinal benchmark datasets of brain MRI and chest X-rays for counterfactual image generation.
arXiv Detail & Related papers (2024-12-30T01:59:34Z) - DP-IQA: Utilizing Diffusion Prior for Blind Image Quality Assessment in the Wild [73.6767681305851]
Blind image quality assessment (IQA) in the wild presents significant challenges.<n>Given the difficulty in collecting large-scale training data, leveraging limited data to develop a model with strong generalization remains an open problem.<n>Motivated by the robust image perception capabilities of pre-trained text-to-image (T2I) diffusion models, we propose a novel IQA method, diffusion priors-based IQA.
arXiv Detail & Related papers (2024-05-30T12:32:35Z) - Opinion-Unaware Blind Image Quality Assessment using Multi-Scale Deep Feature Statistics [54.08757792080732]
We propose integrating deep features from pre-trained visual models with a statistical analysis model to achieve opinion-unaware BIQA (OU-BIQA)
Our proposed model exhibits superior consistency with human visual perception compared to state-of-the-art BIQA models.
arXiv Detail & Related papers (2024-05-29T06:09:34Z) - On the Out of Distribution Robustness of Foundation Models in Medical
Image Segmentation [47.95611203419802]
Foundations for vision and language, pre-trained on extensive sets of natural image and text data, have emerged as a promising approach.
We compare the generalization performance to unseen domains of various pre-trained models after being fine-tuned on the same in-distribution dataset.
We further developed a new Bayesian uncertainty estimation for frozen models and used them as an indicator to characterize the model's performance on out-of-distribution data.
arXiv Detail & Related papers (2023-11-18T14:52:10Z) - Robustness Stress Testing in Medical Image Classification [26.094688963784254]
We employ stress testing to assess model robustness and subgroup performance disparities in disease detection models.
We apply stress tests to measure the robustness of disease detection models for chest X-ray and skin lesion images.
Our experiments indicate that some models may yield more robust and equitable performance than others.
arXiv Detail & Related papers (2023-08-14T02:02:56Z) - Test Time Adaptation for Blind Image Quality Assessment [20.50795362928567]
We introduce two novel quality-relevant auxiliary tasks at the batch and sample levels to enable TTA for blind IQA.
Our experiments reveal that even using a small batch of images from the test distribution helps achieve significant improvement in performance.
arXiv Detail & Related papers (2023-07-27T09:43:06Z) - OTRE: Where Optimal Transport Guided Unpaired Image-to-Image Translation
Meets Regularization by Enhancing [4.951748109810726]
Optimal retinal image quality is mandated for accurate medical diagnoses and automated analyses.
We propose an unpaired image-to-image translation scheme for mapping low-quality retinal CFPs to high-quality counterparts.
We validated the integrated framework, OTRE, on three publicly available retinal image datasets.
arXiv Detail & Related papers (2023-02-06T18:39:40Z) - Self-supervised Domain Adaptation for Breaking the Limits of Low-quality
Fundus Image Quality Enhancement [14.677912534121273]
Low-quality fundus images and style inconsistency potentially increase uncertainty in the diagnosis of fundus disease.
We formulate two self-supervised domain adaptation tasks to disentangle the features of image content, low-quality factor and style information.
Our DASQE method achieves new state-of-the-art performance when only low-quality images are available.
arXiv Detail & Related papers (2023-01-17T15:07:20Z) - A Decoupled Uncertainty Model for MRI Segmentation Quality Estimation [4.104181348044472]
We propose a novel CNN architecture to decouple sources of uncertainty related to the task and different k-space artefacts.
We show that our uncertainty predictions provide a better estimate of MRI quality from the point of view of the task.
arXiv Detail & Related papers (2021-09-06T12:54:44Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.