Improving Nonalcoholic Fatty Liver Disease Classification Performance
With Latent Diffusion Models
- URL: http://arxiv.org/abs/2307.06507v2
- Date: Wed, 15 Nov 2023 02:24:21 GMT
- Title: Improving Nonalcoholic Fatty Liver Disease Classification Performance
With Latent Diffusion Models
- Authors: Romain Hardy, Joe Klepich, Ryan Mitchell, Steve Hall, Jericho
Villareal, Cornelia Ilin
- Abstract summary: We show that by combining synthetic images, generated using diffusion models, with real images, we can enhance nonalcoholic fatty liver disease classification performance.
Our results show superior performance for the diffusion-generated images, with a maximum IS score of $1.90$ compared to $1.67$ for GANs, and a minimum FID score of $69.45$ compared to $100.05$ for GANs.
- Score: 0.0
- License: http://creativecommons.org/licenses/by-sa/4.0/
- Abstract: Integrating deep learning with clinical expertise holds great potential for
addressing healthcare challenges and empowering medical professionals with
improved diagnostic tools. However, the need for annotated medical images is
often an obstacle to leveraging the full power of machine learning models. Our
research demonstrates that by combining synthetic images, generated using
diffusion models, with real images, we can enhance nonalcoholic fatty liver
disease (NAFLD) classification performance even in low-data regime settings. We
evaluate the quality of the synthetic images by comparing two metrics:
Inception Score (IS) and Fr\'{e}chet Inception Distance (FID), computed on
diffusion- and generative adversarial network (GAN)-generated images. Our
results show superior performance for the diffusion-generated images, with a
maximum IS score of $1.90$ compared to $1.67$ for GANs, and a minimum FID score
of $69.45$ compared to $100.05$ for GANs. Utilizing a partially frozen CNN
backbone (EfficientNet v1), our synthetic augmentation method achieves a
maximum image-level ROC AUC of $0.904$ on a NAFLD prediction task.
Related papers
- Ultrasound Image Enhancement using CycleGAN and Perceptual Loss [4.428854369140015]
This work introduces an advanced framework designed to enhance ultrasound images, especially those captured by portable hand-held devices.
We utilize an enhanced generative adversarial network (CycleGAN) model for ultrasound image enhancement across five organ systems.
arXiv Detail & Related papers (2023-12-18T23:21:00Z) - DiffMIC: Dual-Guidance Diffusion Network for Medical Image
Classification [32.67098520984195]
We propose the first diffusion-based model (named DiffMIC) to address general medical image classification.
Our experimental results demonstrate that DiffMIC outperforms state-of-the-art methods by a significant margin.
arXiv Detail & Related papers (2023-03-19T09:15:45Z) - Semi-supervised GAN for Bladder Tissue Classification in Multi-Domain
Endoscopic Images [10.48945682277992]
We propose a semi-surprised Generative Adrial Network (GAN)-based method composed of three main components.
The overall average classification accuracy, precision, and recall obtained with the proposed method for tissue classification are 0.90, 0.88, and 0.89 respectively.
arXiv Detail & Related papers (2022-12-21T21:32:36Z) - Preservation of High Frequency Content for Deep Learning-Based Medical
Image Classification [74.84221280249876]
An efficient analysis of large amounts of chest radiographs can aid physicians and radiologists.
We propose a novel Discrete Wavelet Transform (DWT)-based method for the efficient identification and encoding of visual information.
arXiv Detail & Related papers (2022-05-08T15:29:54Z) - Harmonizing Pathological and Normal Pixels for Pseudo-healthy Synthesis [68.5287824124996]
We present a new type of discriminator, the segmentor, to accurately locate the lesions and improve the visual quality of pseudo-healthy images.
We apply the generated images into medical image enhancement and utilize the enhanced results to cope with the low contrast problem.
Comprehensive experiments on the T2 modality of BraTS demonstrate that the proposed method substantially outperforms the state-of-the-art methods.
arXiv Detail & Related papers (2022-03-29T08:41:17Z) - Deep AUC Maximization for Medical Image Classification: Challenges and
Opportunities [60.079782224958414]
We will present and discuss opportunities and challenges brought by a new deep learning method by AUC (aka underlinebf Deep underlinebf AUC classification)
arXiv Detail & Related papers (2021-11-01T15:31:32Z) - Vision Transformers for femur fracture classification [59.99241204074268]
The Vision Transformer (ViT) was able to correctly predict 83% of the test images.
Good results were obtained in sub-fractures with the largest and richest dataset ever.
arXiv Detail & Related papers (2021-08-07T10:12:42Z) - Prediction of low-keV monochromatic images from polyenergetic CT scans
for improved automatic detection of pulmonary embolism [21.47219330040151]
We are training convolutional neural networks that can emulate the generation of monoE images from conventional single energy CT acquisitions.
We expand on these methods through the use of a multi-task optimization approach, under which the networks achieve improved classification as well as generation results.
arXiv Detail & Related papers (2021-02-02T11:42:31Z) - Robust Deep AUC Maximization: A New Surrogate Loss and Empirical Studies
on Medical Image Classification [63.44396343014749]
We propose a new margin-based surrogate loss function for the AUC score.
It is more robust than the commonly used.
square loss while enjoying the same advantage in terms of large-scale optimization.
To the best of our knowledge, this is the first work that makes DAM succeed on large-scale medical image datasets.
arXiv Detail & Related papers (2020-12-06T03:41:51Z) - An interpretable classifier for high-resolution breast cancer screening
images utilizing weakly supervised localization [45.00998416720726]
We propose a framework to address the unique properties of medical images.
This model first uses a low-capacity, yet memory-efficient, network on the whole image to identify the most informative regions.
It then applies another higher-capacity network to collect details from chosen regions.
Finally, it employs a fusion module that aggregates global and local information to make a final prediction.
arXiv Detail & Related papers (2020-02-13T15:28:42Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.