A study on the adequacy of common IQA measures for medical images
- URL: http://arxiv.org/abs/2405.19224v4
- Date: Fri, 20 Dec 2024 16:04:16 GMT
- Title: A study on the adequacy of common IQA measures for medical images
- Authors: Anna Breger, Clemens Karner, Ian Selby, Janek Gröhl, Sören Dittmer, Edward Lilley, Judith Babar, Jake Beckford, Thomas R Else, Timothy J Sadler, Shahab Shahipasand, Arthikkaa Thavakumar, Michael Roberts, Carola-Bibiane Schönlieb,
- Abstract summary: Reported inconsistencies arising in medical images are not surprising, as they have different properties than natural images.
In this study, we test the applicability of common IQA measures for medical image data by comparing their assessment to manually rated chest X-ray (5 experts) and photoacoustic image data (2 experts)
- Score: 6.580928439802918
- License:
- Abstract: Image quality assessment (IQA) is standard practice in the development stage of novel machine learning algorithms that operate on images. The most commonly used IQA measures have been developed and tested for natural images, but not in the medical setting. Reported inconsistencies arising in medical images are not surprising, as they have different properties than natural images. In this study, we test the applicability of common IQA measures for medical image data by comparing their assessment to manually rated chest X-ray (5 experts) and photoacoustic image data (2 experts). Moreover, we include supplementary studies on grayscale natural images and accelerated brain MRI data. The results of all experiments show a similar outcome in line with previous findings for medical images: PSNR and SSIM in the default setting are in the lower range of the result list and HaarPSI outperforms the other tested measures in the overall performance. Also among the top performers in our experiments are the full reference measures FSIM, LPIPS and MS-SSIM. Generally, the results on natural images yield considerably higher correlations, suggesting that additional employment of tailored IQA measures for medical imaging algorithms is needed.
Related papers
- HarmonyIQA: Pioneering Benchmark and Model for Image Harmonization Quality Assessment [66.17085272972885]
We introduce the first Image Quality Assessment Database for image Harmony evaluation (HarmonyIQAD)
Based on this database, we propose a Harmony Image Quality Assessment (HarmonyIQA) to predict human visual preference for harmonized images.
Experiments show that HarmonyIQA achieves state-of-the-art performance on human visual preference evaluation for harmonized images.
arXiv Detail & Related papers (2025-01-02T07:30:17Z) - Parameter choices in HaarPSI for IQA with medical images [6.133660772208096]
We optimize parameters for two annotated medical data sets, a photoacoustic and a chest X-Ray data set.
We denote the optimized setting, which improves the performance for the medical images notably, by HaarPSI$_MED$.
The results suggest that adapting common IQA measures within their frameworks for medical images can provide a valuable, generalizable addition to the employment of more specific task-based measures.
arXiv Detail & Related papers (2024-10-31T16:28:49Z) - A study of why we need to reassess full reference image quality assessment with medical images [7.018256825895632]
PSNR and SSIM are known and tested for working successfully in many natural imaging tasks.
discrepancies in medical scenarios have been reported, highlighting the gap between development and actual clinical application.
This paper provides a structured and comprehensive overview of examples where PSNR and SSIM prove to be unsuitable for the assessment of novel algorithms.
arXiv Detail & Related papers (2024-05-29T14:01:40Z) - AIGCOIQA2024: Perceptual Quality Assessment of AI Generated Omnidirectional Images [70.42666704072964]
We establish a large-scale AI generated omnidirectional image IQA database named AIGCOIQA2024.
A subjective IQA experiment is conducted to assess human visual preferences from three perspectives.
We conduct a benchmark experiment to evaluate the performance of state-of-the-art IQA models on our database.
arXiv Detail & Related papers (2024-04-01T10:08:23Z) - Comparing Results of Thermographic Images Based Diagnosis for Breast
Diseases [0.0]
This paper examines the potential contribution of infrared (IR) imaging in breast diseases detection.
We used lO2 IR single breast images from the Pro Engenharia (PROENG) public database.
These images were collected from Universidade Federal de Pernambuco (UFPE) Hospital.
arXiv Detail & Related papers (2022-08-30T17:22:52Z) - Automated SSIM Regression for Detection and Quantification of Motion
Artefacts in Brain MR Images [54.739076152240024]
Motion artefacts in magnetic resonance brain images are a crucial issue.
The assessment of MR image quality is fundamental before proceeding with the clinical diagnosis.
An automated image quality assessment based on the structural similarity index (SSIM) regression has been proposed here.
arXiv Detail & Related papers (2022-06-14T10:16:54Z) - Image Quality Assessment for Magnetic Resonance Imaging [4.05136808278614]
Image quality assessment (IQA) algorithms aim to reproduce the human's perception of the image quality.
We use outputs of neural network models trained to solve problems relevant to MRI.
Seven trained radiologists assess distorted images, with their verdicts then correlated with 35 different image quality metrics.
arXiv Detail & Related papers (2022-03-15T11:52:29Z) - Solving Inverse Problems in Medical Imaging with Score-Based Generative
Models [87.48867245544106]
Reconstructing medical images from partial measurements is an important inverse problem in Computed Tomography (CT) and Magnetic Resonance Imaging (MRI)
Existing solutions based on machine learning typically train a model to directly map measurements to medical images.
We propose a fully unsupervised technique for inverse problem solving, leveraging the recently introduced score-based generative models.
arXiv Detail & Related papers (2021-11-15T05:41:12Z) - Artifact- and content-specific quality assessment for MRI with image
rulers [11.551528894727573]
In clinical practice MR images are often first seen by radiologists long after the scan.
If image quality is inadequate either patients have to return for an additional scan, or a suboptimal interpretation is rendered.
We propose a framework with multi-task CNN model trained with calibrated labels and inferenced with image rulers.
arXiv Detail & Related papers (2021-11-06T02:17:12Z) - Learning Conditional Knowledge Distillation for Degraded-Reference Image
Quality Assessment [157.1292674649519]
We propose a practical solution named degraded-reference IQA (DR-IQA)
DR-IQA exploits the inputs of IR models, degraded images, as references.
Our results can even be close to the performance of full-reference settings.
arXiv Detail & Related papers (2021-08-18T02:35:08Z) - Evaluation of Complexity Measures for Deep Learning Generalization in
Medical Image Analysis [77.34726150561087]
PAC-Bayes flatness-based and path norm-based measures produce the most consistent explanation for the combination of models and data.
We also investigate the use of multi-task classification and segmentation approach for breast images.
arXiv Detail & Related papers (2021-03-04T20:58:22Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.