This Intestine Does Not Exist: Multiscale Residual Variational
Autoencoder for Realistic Wireless Capsule Endoscopy Image Generation
- URL: http://arxiv.org/abs/2302.02150v2
- Date: Tue, 7 Feb 2023 03:50:25 GMT
- Title: This Intestine Does Not Exist: Multiscale Residual Variational
Autoencoder for Realistic Wireless Capsule Endoscopy Image Generation
- Authors: Dimitrios E. Diamantis, Panagiota Gatoula, Anastasios Koulaouzidis,
and Dimitris K. Iakovidis
- Abstract summary: A novel Variational Autoencoder architecture is proposed, namely "This Intestine Does not Exist" (TIDE)
The proposed architecture comprises multiscale feature extraction convolutional blocks and residual connections, which enable the generation of high-quality and diverse datasets.
Contrary to the current approaches, which are oriented towards the augmentation of the available datasets, this study demonstrates that using TIDE, real WCE datasets can be fully substituted.
- Score: 7.430724826764835
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Medical image synthesis has emerged as a promising solution to address the
limited availability of annotated medical data needed for training machine
learning algorithms in the context of image-based Clinical Decision Support
(CDS) systems. To this end, Generative Adversarial Networks (GANs) have been
mainly applied to support the algorithm training process by generating
synthetic images for data augmentation. However, in the field of Wireless
Capsule Endoscopy (WCE), the limited content diversity and size of existing
publicly available annotated datasets, adversely affect both the training
stability and synthesis performance of GANs. Aiming to a viable solution for
WCE image synthesis, a novel Variational Autoencoder architecture is proposed,
namely "This Intestine Does not Exist" (TIDE). The proposed architecture
comprises multiscale feature extraction convolutional blocks and residual
connections, which enable the generation of high-quality and diverse datasets
even with a limited number of training images. Contrary to the current
approaches, which are oriented towards the augmentation of the available
datasets, this study demonstrates that using TIDE, real WCE datasets can be
fully substituted by artificially generated ones, without compromising
classification performance. Furthermore, qualitative and user evaluation
studies by experienced WCE specialists, validate from a medical viewpoint that
both the normal and abnormal WCE images synthesized by TIDE are sufficiently
realistic.
Related papers
- Clinical Evaluation of Medical Image Synthesis: A Case Study in Wireless Capsule Endoscopy [63.39037092484374]
This study focuses on the clinical evaluation of medical Synthetic Data Generation using Artificial Intelligence (AI) models.
The paper contributes by a) presenting a protocol for the systematic evaluation of synthetic images by medical experts and b) applying it to assess TIDE-II, a novel variational autoencoder-based model for high-resolution WCE image synthesis.
The results show that TIDE-II generates clinically relevant WCE images, helping to address data scarcity and enhance diagnostic tools.
arXiv Detail & Related papers (2024-10-31T19:48:50Z) - Medical Imaging Complexity and its Effects on GAN Performance [1.776717121506676]
Medical image synthesis via generative adversarial networks (GANs) emerged as a powerful method for synthetically generating photo-realistic images.
We experimentally establish benchmarks that measure the relationship between a sample dataset size and the fidelity of the generated images.
We conduct experiments with two state-of-the-art GANs, StyleGAN 3 and SPADE-GAN, trained on multiple medical imaging datasets with variable sample sizes.
arXiv Detail & Related papers (2024-10-23T15:28:25Z) - Discriminative Hamiltonian Variational Autoencoder for Accurate Tumor Segmentation in Data-Scarce Regimes [2.8498944632323755]
We propose an end-to-end hybrid architecture for medical image segmentation.
We use Hamiltonian Variational Autoencoders (HVAE) and a discriminative regularization to improve the quality of generated images.
Our architecture operates on a slice-by-slice basis to segment 3D volumes, capitilizing on the richly augmented dataset.
arXiv Detail & Related papers (2024-06-17T15:42:08Z) - Radiology Report Generation Using Transformers Conditioned with
Non-imaging Data [55.17268696112258]
This paper proposes a novel multi-modal transformer network that integrates chest x-ray (CXR) images and associated patient demographic information.
The proposed network uses a convolutional neural network to extract visual features from CXRs and a transformer-based encoder-decoder network that combines the visual features with semantic text embeddings of patient demographic information.
arXiv Detail & Related papers (2023-11-18T14:52:26Z) - EMIT-Diff: Enhancing Medical Image Segmentation via Text-Guided
Diffusion Model [4.057796755073023]
We develop controllable diffusion models for medical image synthesis, called EMIT-Diff.
We leverage recent diffusion probabilistic models to generate realistic and diverse synthetic medical image data.
In our approach, we ensure that the synthesized samples adhere to medically relevant constraints.
arXiv Detail & Related papers (2023-10-19T16:18:02Z) - Generalizing Medical Image Representations via Quaternion Wavelet Networks [9.836302410524842]
We introduce a novel, generalizable, data- and task-agnostic framework able to extract salient features from medical images.
The proposed quaternion wavelet network (QUAVE) can be easily integrated with any pre-existing medical image analysis or synthesis task.
arXiv Detail & Related papers (2023-10-16T09:34:06Z) - SIAN: Style-Guided Instance-Adaptive Normalization for Multi-Organ
Histopathology Image Synthesis [63.845552349914186]
We propose a style-guided instance-adaptive normalization (SIAN) to synthesize realistic color distributions and textures for different organs.
The four phases work together and are integrated into a generative network to embed image semantics, style, and instance-level boundaries.
arXiv Detail & Related papers (2022-09-02T16:45:46Z) - Evaluating the Quality and Diversity of DCGAN-based Generatively
Synthesized Diabetic Retinopathy Imagery [0.07499722271664144]
Publicly available diabetic retinopathy (DR) datasets are imbalanced, containing limited numbers of images with DR.
The imbalance can be addressed using Geneversarative Adrial Networks (GANs) to augment the datasets with synthetic images.
To evaluate the quality and diversity of synthetic images, several evaluation metrics, such as Multi-Scale Structural Similarity Index (MS-SSIM), Cosine Distance (CD), and Fr't Inception Distance (FID) are used.
arXiv Detail & Related papers (2022-08-10T23:50:01Z) - OADAT: Experimental and Synthetic Clinical Optoacoustic Data for
Standardized Image Processing [62.993663757843464]
Optoacoustic (OA) imaging is based on excitation of biological tissues with nanosecond-duration laser pulses followed by detection of ultrasound waves generated via light-absorption-mediated thermoelastic expansion.
OA imaging features a powerful combination between rich optical contrast and high resolution in deep tissues.
No standardized datasets generated with different types of experimental set-up and associated processing methods are available to facilitate advances in broader applications of OA in clinical settings.
arXiv Detail & Related papers (2022-06-17T08:11:26Z) - Deep Co-Attention Network for Multi-View Subspace Learning [73.3450258002607]
We propose a deep co-attention network for multi-view subspace learning.
It aims to extract both the common information and the complementary information in an adversarial setting.
In particular, it uses a novel cross reconstruction loss and leverages the label information to guide the construction of the latent representation.
arXiv Detail & Related papers (2021-02-15T18:46:44Z) - Pathological Retinal Region Segmentation From OCT Images Using Geometric
Relation Based Augmentation [84.7571086566595]
We propose improvements over previous GAN-based medical image synthesis methods by jointly encoding the intrinsic relationship of geometry and shape.
The proposed method outperforms state-of-the-art segmentation methods on the public RETOUCH dataset having images captured from different acquisition procedures.
arXiv Detail & Related papers (2020-03-31T11:50:43Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.