Synthetic DOmain-Targeted Augmentation (S-DOTA) Improves Model
Generalization in Digital Pathology
- URL: http://arxiv.org/abs/2305.02401v1
- Date: Wed, 3 May 2023 19:53:30 GMT
- Title: Synthetic DOmain-Targeted Augmentation (S-DOTA) Improves Model
Generalization in Digital Pathology
- Authors: Sai Chowdary Gullapally, Yibo Zhang, Nitin Kumar Mittal, Deeksha
Kartik, Sandhya Srinivasan, Kevin Rose, Daniel Shenker, Dinkar Juyal,
Harshith Padigela, Raymond Biju, Victor Minden, Chirag Maheshwari, Marc
Thibault, Zvi Goldstein, Luke Novak, Nidhi Chandra, Justin Lee, Aaditya
Prakash, Chintan Shah, John Abel, Darren Fahy, Amaro Taylor-Weiner, Anand
Sampat
- Abstract summary: Machine learning algorithms have the potential to improve patient outcomes in digital pathology.
generalization is limited by sensitivity to variations in tissue preparation, staining procedures and scanning equipment.
We studied the effectiveness of two Synthetic DOmain-Targeted Augmentation (S-DOTA) methods, namely CycleGAN-enabled Scanner Transform (ST) and targeted Stain Vector Augmentation (SVA)
We evaluated the ability of these techniques to improve model generalization to various tasks and settings.
- Score: 1.488519799639108
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Machine learning algorithms have the potential to improve patient outcomes in
digital pathology. However, generalization of these tools is currently limited
by sensitivity to variations in tissue preparation, staining procedures and
scanning equipment that lead to domain shift in digitized slides. To overcome
this limitation and improve model generalization, we studied the effectiveness
of two Synthetic DOmain-Targeted Augmentation (S-DOTA) methods, namely
CycleGAN-enabled Scanner Transform (ST) and targeted Stain Vector Augmentation
(SVA), and compared them against the International Color Consortium (ICC)
profile-based color calibration (ICC Cal) method and a baseline method using
traditional brightness, color and noise augmentations. We evaluated the ability
of these techniques to improve model generalization to various tasks and
settings: four models, two model types (tissue segmentation and cell
classification), two loss functions, six labs, six scanners, and three
indications (hepatocellular carcinoma (HCC), nonalcoholic steatohepatitis
(NASH), prostate adenocarcinoma). We compared these methods based on the
macro-averaged F1 scores on in-distribution (ID) and out-of-distribution (OOD)
test sets across multiple domains, and found that S-DOTA methods (i.e., ST and
SVA) led to significant improvements over ICC Cal and baseline on OOD data
while maintaining comparable performance on ID data. Thus, we demonstrate that
S-DOTA may help address generalization due to domain shift in real world
applications.
Related papers
- Understanding Stain Separation Improves Cross-Scanner Adenocarcinoma Segmentation with Joint Multi-Task Learning [2.3470559413988936]
Digital pathology has made significant advances in tumor diagnosis and segmentation, but image variability limits the effectiveness of current algorithms.
The COSAS (Cross-Organ and Cross-Scanner Adenocarcinoma) challenge addresses this issue by improving the resilience of segmentation algorithms to domain shift.
Our approach employs unsupervised learning through stain separation within a multi-task learning framework using a multi-decoder autoencoder.
arXiv Detail & Related papers (2024-09-20T06:12:52Z) - PD-L1 Classification of Weakly-Labeled Whole Slide Images of Breast Cancer [0.0]
This study aims to develop and compare models able to classify PD-L1 positivity of breast cancer samples based on WSI analysis.
The task consists of two phases: identifying regions of interest (ROI) and classifying tumors as PD-L1 positive or negative.
arXiv Detail & Related papers (2024-04-15T23:06:58Z) - Generative Adversarial Networks for Stain Normalisation in
Histopathology [2.2166690647926037]
One of the significant roadblocks to current research is the high level of visual variability across digital pathology images.
Sten normalisation aims to standardise the visual profile of digital pathology images without changing the structural content of the images.
This is an ongoing field of study as researchers aim to identify a method which efficiently normalises pathology images to make AI models more robust and generalisable.
arXiv Detail & Related papers (2023-08-05T11:38:05Z) - A Laplacian Pyramid Based Generative H&E Stain Augmentation Network [5.841841666625825]
Generative Stain Augmentation Network (G-SAN) is a GAN-based framework that augments a collection of cell images with simulated stain variations.
Using G-SAN-augmented training data provides on average 15.7% improvement in F1 score and 7.3% improvement in panoptic quality.
arXiv Detail & Related papers (2023-05-23T17:43:18Z) - Rethinking Semi-Supervised Medical Image Segmentation: A
Variance-Reduction Perspective [51.70661197256033]
We propose ARCO, a semi-supervised contrastive learning framework with stratified group theory for medical image segmentation.
We first propose building ARCO through the concept of variance-reduced estimation and show that certain variance-reduction techniques are particularly beneficial in pixel/voxel-level segmentation tasks.
We experimentally validate our approaches on eight benchmarks, i.e., five 2D/3D medical and three semantic segmentation datasets, with different label settings.
arXiv Detail & Related papers (2023-02-03T13:50:25Z) - PrepNet: A Convolutional Auto-Encoder to Homogenize CT Scans for
Cross-Dataset Medical Image Analysis [0.22485007639406518]
COVID-19 diagnosis can now be done efficiently using PCR tests, but this use case exemplifies the need for a methodology to overcome data variability issues.
We propose a novel generative approach that aims at erasing the differences induced by e.g. the imaging technology while simultaneously introducing minimal changes to the CT scans.
arXiv Detail & Related papers (2022-08-19T15:49:47Z) - Harmonizing Pathological and Normal Pixels for Pseudo-healthy Synthesis [68.5287824124996]
We present a new type of discriminator, the segmentor, to accurately locate the lesions and improve the visual quality of pseudo-healthy images.
We apply the generated images into medical image enhancement and utilize the enhanced results to cope with the low contrast problem.
Comprehensive experiments on the T2 modality of BraTS demonstrate that the proposed method substantially outperforms the state-of-the-art methods.
arXiv Detail & Related papers (2022-03-29T08:41:17Z) - Multiple Time Series Fusion Based on LSTM An Application to CAP A Phase
Classification Using EEG [56.155331323304]
Deep learning based electroencephalogram channels' feature level fusion is carried out in this work.
Channel selection, fusion, and classification procedures were optimized by two optimization algorithms.
arXiv Detail & Related papers (2021-12-18T14:17:49Z) - Lung Cancer Lesion Detection in Histopathology Images Using Graph-Based
Sparse PCA Network [93.22587316229954]
We propose a graph-based sparse principal component analysis (GS-PCA) network, for automated detection of cancerous lesions on histological lung slides stained by hematoxylin and eosin (H&E)
We evaluate the performance of the proposed algorithm on H&E slides obtained from an SVM K-rasG12D lung cancer mouse model using precision/recall rates, F-score, Tanimoto coefficient, and area under the curve (AUC) of the receiver operator characteristic (ROC)
arXiv Detail & Related papers (2021-10-27T19:28:36Z) - Many-to-One Distribution Learning and K-Nearest Neighbor Smoothing for
Thoracic Disease Identification [83.6017225363714]
deep learning has become the most powerful computer-aided diagnosis technology for improving disease identification performance.
For chest X-ray imaging, annotating large-scale data requires professional domain knowledge and is time-consuming.
In this paper, we propose many-to-one distribution learning (MODL) and K-nearest neighbor smoothing (KNNS) methods to improve a single model's disease identification performance.
arXiv Detail & Related papers (2021-02-26T02:29:30Z) - Performance of Dual-Augmented Lagrangian Method and Common Spatial
Patterns applied in classification of Motor-Imagery BCI [68.8204255655161]
Motor-imagery based brain-computer interfaces (MI-BCI) have the potential to become ground-breaking technologies for neurorehabilitation.
Due to the noisy nature of the used EEG signal, reliable BCI systems require specialized procedures for features optimization and extraction.
arXiv Detail & Related papers (2020-10-13T20:50:13Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.