Domain adaptation using optimal transport for invariant learning using
histopathology datasets
- URL: http://arxiv.org/abs/2303.02241v1
- Date: Fri, 3 Mar 2023 22:19:19 GMT
- Title: Domain adaptation using optimal transport for invariant learning using
histopathology datasets
- Authors: Kianoush Falahkheirkhah, Alex Lu, David Alvarez-Melis, Grace Huynh
- Abstract summary: Histopathology is critical for the diagnosis of many diseases, including cancer.
computational techniques are limited by batch effects, where technical factors like differences in preparation protocol or scanners can alter the appearance of slides.
Here, we propose a domain adaptation method that improves the generalization of histological models to data from unseen institutions.
- Score: 13.133231212085988
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Histopathology is critical for the diagnosis of many diseases, including
cancer. These protocols typically require pathologists to manually evaluate
slides under a microscope, which is time-consuming and subjective, leading to
interest in machine learning to automate analysis. However, computational
techniques are limited by batch effects, where technical factors like
differences in preparation protocol or scanners can alter the appearance of
slides, causing models trained on one institution to fail when generalizing to
others. Here, we propose a domain adaptation method that improves the
generalization of histopathological models to data from unseen institutions,
without the need for labels or retraining in these new settings. Our approach
introduces an optimal transport (OT) loss, that extends adversarial methods
that penalize models if images from different institutions can be distinguished
in their representation space. Unlike previous methods, which operate on single
samples, our loss accounts for distributional differences between batches of
images. We show that on the Camelyon17 dataset, while both methods can adapt to
global differences in color distribution, only our OT loss can reliably
classify a cancer phenotype unseen during training. Together, our results
suggest that OT improves generalization on rare but critical phenotypes that
may only make up a small fraction of the total tiles and variation in a slide.
Related papers
- Adapting Visual-Language Models for Generalizable Anomaly Detection in Medical Images [68.42215385041114]
This paper introduces a novel lightweight multi-level adaptation and comparison framework to repurpose the CLIP model for medical anomaly detection.
Our approach integrates multiple residual adapters into the pre-trained visual encoder, enabling a stepwise enhancement of visual features across different levels.
Our experiments on medical anomaly detection benchmarks demonstrate that our method significantly surpasses current state-of-the-art models.
arXiv Detail & Related papers (2024-03-19T09:28:19Z) - Realistic Data Enrichment for Robust Image Segmentation in
Histopathology [2.248423960136122]
We propose a new approach, based on diffusion models, which can enrich an imbalanced dataset with plausible examples from underrepresented groups.
Our method can simply expand limited clinical datasets making them suitable to train machine learning pipelines.
arXiv Detail & Related papers (2023-04-19T09:52:50Z) - Rethinking Semi-Supervised Medical Image Segmentation: A
Variance-Reduction Perspective [51.70661197256033]
We propose ARCO, a semi-supervised contrastive learning framework with stratified group theory for medical image segmentation.
We first propose building ARCO through the concept of variance-reduced estimation and show that certain variance-reduction techniques are particularly beneficial in pixel/voxel-level segmentation tasks.
We experimentally validate our approaches on eight benchmarks, i.e., five 2D/3D medical and three semantic segmentation datasets, with different label settings.
arXiv Detail & Related papers (2023-02-03T13:50:25Z) - CellMix: A General Instance Relationship based Method for Data
Augmentation Towards Pathology Image Classification [6.9596321268519326]
In pathology image analysis, obtaining and maintaining high-quality annotated samples is an extremely labor-intensive task.
We propose the CellMix framework, which employs a novel distribution-oriented in-place shuffle approach.
Our experiments in pathology image classification tasks demonstrate state-of-the-art (SOTA) performance on 7 distinct datasets.
arXiv Detail & Related papers (2023-01-27T03:17:35Z) - Intra-class Adaptive Augmentation with Neighbor Correction for Deep
Metric Learning [99.14132861655223]
We propose a novel intra-class adaptive augmentation (IAA) framework for deep metric learning.
We reasonably estimate intra-class variations for every class and generate adaptive synthetic samples to support hard samples mining.
Our method significantly improves and outperforms the state-of-the-art methods on retrieval performances by 3%-6%.
arXiv Detail & Related papers (2022-11-29T14:52:38Z) - Stain-invariant self supervised learning for histopathology image
analysis [74.98663573628743]
We present a self-supervised algorithm for several classification tasks within hematoxylin and eosin stained images of breast cancer.
Our method achieves the state-of-the-art performance on several publicly available breast cancer datasets.
arXiv Detail & Related papers (2022-11-14T18:16:36Z) - PatchNR: Learning from Small Data by Patch Normalizing Flow
Regularization [57.37911115888587]
We introduce a regularizer for the variational modeling of inverse problems in imaging based on normalizing flows.
Our regularizer, called patchNR, involves a normalizing flow learned on patches of very few images.
arXiv Detail & Related papers (2022-05-24T12:14:26Z) - About Explicit Variance Minimization: Training Neural Networks for
Medical Imaging With Limited Data Annotations [2.3204178451683264]
Variance Aware Training (VAT) method exploits this property by introducing the variance error into the model loss function.
We validate VAT on three medical imaging datasets from diverse domains and various learning objectives.
arXiv Detail & Related papers (2021-05-28T21:34:04Z) - A Study of Domain Generalization on Ultrasound-based Multi-Class
Segmentation of Arteries, Veins, Ligaments, and Nerves Using Transfer
Learning [12.869637050331306]
We study the US-based segmentation of multiple classes through transfer learning by fine-tuning different contiguous blocks within the model.
We propose a simple method for predicting generalization on unseen datasets and observe statistically significant differences between the fine-tuning methods.
arXiv Detail & Related papers (2020-11-13T16:59:20Z) - Select-ProtoNet: Learning to Select for Few-Shot Disease Subtype
Prediction [55.94378672172967]
We focus on few-shot disease subtype prediction problem, identifying subgroups of similar patients.
We introduce meta learning techniques to develop a new model, which can extract the common experience or knowledge from interrelated clinical tasks.
Our new model is built upon a carefully designed meta-learner, called Prototypical Network, that is a simple yet effective meta learning machine for few-shot image classification.
arXiv Detail & Related papers (2020-09-02T02:50:30Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.