The Beauty or the Beast: Which Aspect of Synthetic Medical Images
  Deserves Our Focus?
        - URL: http://arxiv.org/abs/2305.09789v2
- Date: Wed, 14 Jun 2023 14:39:17 GMT
- Title: The Beauty or the Beast: Which Aspect of Synthetic Medical Images
  Deserves Our Focus?
- Authors: Xiaodan Xing, Yang Nan, Federico Felder, Simon Walsh and Guang Yang
- Abstract summary: Training medical AI algorithms requires large volumes of accurately labeled datasets.
Synthetic images generated from deep generative models can help alleviate the data scarcity problem, but their effectiveness relies on their fidelity to real-world images.
- Score: 1.6305276867803995
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract:   Training medical AI algorithms requires large volumes of accurately labeled
datasets, which are difficult to obtain in the real world. Synthetic images
generated from deep generative models can help alleviate the data scarcity
problem, but their effectiveness relies on their fidelity to real-world images.
Typically, researchers select synthesis models based on image quality
measurements, prioritizing synthetic images that appear realistic. However, our
empirical analysis shows that high-fidelity and visually appealing synthetic
images are not necessarily superior. In fact, we present a case where
low-fidelity synthetic images outperformed their high-fidelity counterparts in
downstream tasks. Our findings highlight the importance of comprehensive
analysis before incorporating synthetic data into real-world applications. We
hope our results will raise awareness among the research community of the value
of low-fidelity synthetic images in medical AI algorithm training.
 
      
        Related papers
        - CO-SPY: Combining Semantic and Pixel Features to Detect Synthetic Images   by AI [58.35348718345307]
 Current efforts to distinguish between real and AI-generated images may lack generalization.
We propose a novel framework, Co-Spy, that first enhances existing semantic features.
We also create Co-Spy-Bench, a comprehensive dataset comprising 5 real image datasets and 22 state-of-the-art generative models.
 arXiv  Detail & Related papers  (2025-03-24T01:59:29Z)
- FairDiff: Fair Segmentation with Point-Image Diffusion [15.490776421216689]
 Our research adopts a data-driven strategy-enhancing data balance by integrating synthetic images.
We formulate the problem in a joint optimization manner, in which three networks are optimized towards the goal of empirical risk and fairness.
Our model achieves superior fairness segmentation performance compared to the state-of-the-art fairness learning models.
 arXiv  Detail & Related papers  (2024-07-08T17:59:58Z)
- MediSyn: A Generalist Text-Guided Latent Diffusion Model For Diverse   Medical Image Synthesis [4.541407789437896]
 MediSyn is a text-guided latent diffusion model capable of generating synthetic images from 6 medical specialties and 10 image types.
A direct comparison of the synthetic images against the real images confirms that our model synthesizes novel images and, crucially, may preserve patient privacy.
Our findings highlight the immense potential for generalist image generative models to accelerate algorithmic research and development in medicine.
 arXiv  Detail & Related papers  (2024-05-16T04:28:44Z)
- Is Synthetic Image Useful for Transfer Learning? An Investigation into   Data Generation, Volume, and Utilization [62.157627519792946]
 We introduce a novel framework called bridged transfer, which initially employs synthetic images for fine-tuning a pre-trained model to improve its transferability.
We propose dataset style inversion strategy to improve the stylistic alignment between synthetic and real images.
Our proposed methods are evaluated across 10 different datasets and 5 distinct models, demonstrating consistent improvements.
 arXiv  Detail & Related papers  (2024-03-28T22:25:05Z)
- Training Robust Deep Physiological Measurement Models with Synthetic
  Video-based Data [11.31971398273479]
 We propose measures to add real-world noise to synthetic physiological signals and corresponding facial videos.
Our results show that we were able to reduce the average MAE from 6.9 to 2.0.
 arXiv  Detail & Related papers  (2023-11-09T13:55:45Z)
- UAV-Sim: NeRF-based Synthetic Data Generation for UAV-based Perception [62.71374902455154]
 We leverage recent advancements in neural rendering to improve static and dynamic novelview UAV-based image rendering.
We demonstrate a considerable performance boost when a state-of-the-art detection model is optimized primarily on hybrid sets of real and synthetic data.
 arXiv  Detail & Related papers  (2023-10-25T00:20:37Z)
- Augmenting medical image classifiers with synthetic data from latent
  diffusion models [12.077733447347592]
 We show that latent diffusion models can scalably generate images of skin disease.
We generate and analyze a new dataset of 458,920 synthetic images produced using several generation strategies.
 arXiv  Detail & Related papers  (2023-08-23T22:34:49Z)
- You Don't Have to Be Perfect to Be Amazing: Unveil the Utility of
  Synthetic Images [2.0790547421662064]
 We have established a comprehensive set of evaluators for synthetic images, including fidelity, variety, privacy, and utility.
By analyzing more than 100k chest X-ray images and their synthetic copies, we have demonstrated that there is an inevitable trade-off between synthetic image fidelity, variety, and privacy.
 arXiv  Detail & Related papers  (2023-05-25T13:47:04Z)
- ContraNeRF: Generalizable Neural Radiance Fields for Synthetic-to-real
  Novel View Synthesis via Contrastive Learning [102.46382882098847]
 We first investigate the effects of synthetic data in synthetic-to-real novel view synthesis.
We propose to introduce geometry-aware contrastive learning to learn multi-view consistent features with geometric constraints.
Our method can render images with higher quality and better fine-grained details, outperforming existing generalizable novel view synthesis methods in terms of PSNR, SSIM, and LPIPS.
 arXiv  Detail & Related papers  (2023-03-20T12:06:14Z)
- Synthetic Data for Object Classification in Industrial Applications [53.180678723280145]
 In object classification, capturing a large number of images per object and in different conditions is not always possible.
This work explores the creation of artificial images using a game engine to cope with limited data in the training dataset.
 arXiv  Detail & Related papers  (2022-12-09T11:43:04Z)
- Is synthetic data from generative models ready for image recognition? [69.42645602062024]
 We study whether and how synthetic images generated from state-of-the-art text-to-image generation models can be used for image recognition tasks.
We showcase the powerfulness and shortcomings of synthetic data from existing generative models, and propose strategies for better applying synthetic data for recognition tasks.
 arXiv  Detail & Related papers  (2022-10-14T06:54:24Z)
- A Shared Representation for Photorealistic Driving Simulators [83.5985178314263]
 We propose to improve the quality of generated images by rethinking the discriminator architecture.
The focus is on the class of problems where images are generated given semantic inputs, such as scene segmentation maps or human body poses.
We aim to learn a shared latent representation that encodes enough information to jointly do semantic segmentation, content reconstruction, along with a coarse-to-fine grained adversarial reasoning.
 arXiv  Detail & Related papers  (2021-12-09T18:59:21Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
       
     
           This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.