A Survey on Training Challenges in Generative Adversarial Networks for
Biomedical Image Analysis
- URL: http://arxiv.org/abs/2201.07646v4
- Date: Fri, 11 Aug 2023 00:13:54 GMT
- Title: A Survey on Training Challenges in Generative Adversarial Networks for
Biomedical Image Analysis
- Authors: Muhammad Muneeb Saad, Ruairi O'Reilly, and Mubashir Husain Rehmani
- Abstract summary: Generative Adversarial Networks (GANs) have been widely utilized to address data limitations through the generation of synthetic biomedical images.
GANs can experience several technical challenges that impede the generation of suitable synthetic imagery.
This work presents a review and taxonomy based on solutions to the training problems of GANs in the biomedical imaging domain.
- Score: 0.6308539010172307
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In biomedical image analysis, the applicability of deep learning methods is
directly impacted by the quantity of image data available. This is due to deep
learning models requiring large image datasets to provide high-level
performance. Generative Adversarial Networks (GANs) have been widely utilized
to address data limitations through the generation of synthetic biomedical
images. GANs consist of two models. The generator, a model that learns how to
produce synthetic images based on the feedback it receives. The discriminator,
a model that classifies an image as synthetic or real and provides feedback to
the generator. Throughout the training process, a GAN can experience several
technical challenges that impede the generation of suitable synthetic imagery.
First, the mode collapse problem whereby the generator either produces an
identical image or produces a uniform image from distinct input features.
Second, the non-convergence problem whereby the gradient descent optimizer
fails to reach a Nash equilibrium. Thirdly, the vanishing gradient problem
whereby unstable training behavior occurs due to the discriminator achieving
optimal classification performance resulting in no meaningful feedback being
provided to the generator. These problems result in the production of synthetic
imagery that is blurry, unrealistic, and less diverse. To date, there has been
no survey article outlining the impact of these technical challenges in the
context of the biomedical imagery domain. This work presents a review and
taxonomy based on solutions to the training problems of GANs in the biomedical
imaging domain. This survey highlights important challenges and outlines future
research directions about the training of GANs in the domain of biomedical
imagery.
Related papers
- Unleashing the Potential of Synthetic Images: A Study on Histopathology Image Classification [0.12499537119440242]
Histopathology image classification is crucial for the accurate identification and diagnosis of various diseases.
We show that synthetic images can effectively augment existing datasets, ultimately improving the performance of the downstream histopathology image classification task.
arXiv Detail & Related papers (2024-09-24T12:02:55Z) - BiomedJourney: Counterfactual Biomedical Image Generation by
Instruction-Learning from Multimodal Patient Journeys [99.7082441544384]
We present BiomedJourney, a novel method for counterfactual biomedical image generation by instruction-learning.
We use GPT-4 to process the corresponding imaging reports and generate a natural language description of disease progression.
The resulting triples are then used to train a latent diffusion model for counterfactual biomedical image generation.
arXiv Detail & Related papers (2023-10-16T18:59:31Z) - Adaptive Input-image Normalization for Solving the Mode Collapse Problem in GAN-based X-ray Images [0.08192907805418582]
This work contributes an empirical demonstration of the benefits of integrating the adaptive input-image normalization with the Deep Conversaal GAN and Auxiliary GAN to alleviate the mode collapse problems.
Results demonstrate that the DCGAN and the ACGAN with adaptive input-image normalization outperform the DCGAN and ACGAN with un-normalized X-ray images.
arXiv Detail & Related papers (2023-09-21T16:43:29Z) - ViT-DAE: Transformer-driven Diffusion Autoencoder for Histopathology
Image Analysis [4.724009208755395]
We present ViT-DAE, which integrates vision transformers (ViT) and diffusion autoencoders for high-quality histopathology image synthesis.
Our approach outperforms recent GAN-based and vanilla DAE methods in generating realistic images.
arXiv Detail & Related papers (2023-04-03T15:00:06Z) - Unsupervised Domain Transfer with Conditional Invertible Neural Networks [83.90291882730925]
We propose a domain transfer approach based on conditional invertible neural networks (cINNs)
Our method inherently guarantees cycle consistency through its invertible architecture, and network training can efficiently be conducted with maximum likelihood.
Our method enables the generation of realistic spectral data and outperforms the state of the art on two downstream classification tasks.
arXiv Detail & Related papers (2023-03-17T18:00:27Z) - Person Image Synthesis via Denoising Diffusion Model [116.34633988927429]
We show how denoising diffusion models can be applied for high-fidelity person image synthesis.
Our results on two large-scale benchmarks and a user study demonstrate the photorealism of our proposed approach under challenging scenarios.
arXiv Detail & Related papers (2022-11-22T18:59:50Z) - Harmonizing Pathological and Normal Pixels for Pseudo-healthy Synthesis [68.5287824124996]
We present a new type of discriminator, the segmentor, to accurately locate the lesions and improve the visual quality of pseudo-healthy images.
We apply the generated images into medical image enhancement and utilize the enhanced results to cope with the low contrast problem.
Comprehensive experiments on the T2 modality of BraTS demonstrate that the proposed method substantially outperforms the state-of-the-art methods.
arXiv Detail & Related papers (2022-03-29T08:41:17Z) - Addressing the Intra-class Mode Collapse Problem using Adaptive Input
Image Normalization in GAN-based X-ray Images [0.7090165638014329]
This work contributes an empirical demonstration of the benefits of integrating the adaptive input-image normalization for the Deep Convolutional GAN.
Results demonstrate that the DCGAN with adaptive input-image normalization outperforms DCGAN with un-normalized X-ray images.
arXiv Detail & Related papers (2022-01-25T13:54:25Z) - Ensembling with Deep Generative Views [72.70801582346344]
generative models can synthesize "views" of artificial images that mimic real-world variations, such as changes in color or pose.
Here, we investigate whether such views can be applied to real images to benefit downstream analysis tasks such as image classification.
We use StyleGAN2 as the source of generative augmentations and investigate this setup on classification tasks involving facial attributes, cat faces, and cars.
arXiv Detail & Related papers (2021-04-29T17:58:35Z) - Generative Adversarial U-Net for Domain-free Medical Image Augmentation [49.72048151146307]
The shortage of annotated medical images is one of the biggest challenges in the field of medical image computing.
In this paper, we develop a novel generative method named generative adversarial U-Net.
Our newly designed model is domain-free and generalizable to various medical images.
arXiv Detail & Related papers (2021-01-12T23:02:26Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.