Out of Distribution Generalization via Interventional Style Transfer in
Single-Cell Microscopy
- URL: http://arxiv.org/abs/2306.11890v1
- Date: Thu, 15 Jun 2023 20:08:16 GMT
- Title: Out of Distribution Generalization via Interventional Style Transfer in
Single-Cell Microscopy
- Authors: Wolfgang M. Pernice, Michael Doron, Alex Quach, Aditya Pratapa, Sultan
Kenjeyev, Nicholas De Veaux, Michio Hirano, Juan C. Caicedo
- Abstract summary: Real-world deployment of computer vision systems requires causal representations that are invariant to contextual nuisances.
We propose tests to assess the extent to which models learn causal representations across increasingly challenging levels of OOD-generalization.
We show that despite seemingly strong performance, as assessed by other established metrics, both naive and contemporary baselines designed to ward against confounding, collapse on these tests.
- Score: 1.7778546320705952
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Real-world deployment of computer vision systems, including in the discovery
processes of biomedical research, requires causal representations that are
invariant to contextual nuisances and generalize to new data. Leveraging the
internal replicate structure of two novel single-cell fluorescent microscopy
datasets, we propose generally applicable tests to assess the extent to which
models learn causal representations across increasingly challenging levels of
OOD-generalization. We show that despite seemingly strong performance, as
assessed by other established metrics, both naive and contemporary baselines
designed to ward against confounding, collapse on these tests. We introduce a
new method, Interventional Style Transfer (IST), that substantially improves
OOD generalization by generating interventional training distributions in which
spurious correlations between biological causes and nuisances are mitigated. We
publish our code and datasets.
Related papers
- Bridging the Generalisation Gap: Synthetic Data Generation for Multi-Site Clinical Model Validation [0.3362278589492841]
Existing model evaluation approaches often rely on real-world datasets, which are limited in availability, embed confounding biases, and lack flexibility needed for systematic experimentation.
We propose a novel structured synthetic data framework designed for the controlled robustness of benchmarking model, fairness, and generalisability.
arXiv Detail & Related papers (2025-04-29T11:04:28Z) - Domain-incremental White Blood Cell Classification with Privacy-aware Continual Learning [3.053782081947358]
We propose a generative replay-based Continual Learning (CL) strategy designed to prevent forgetting in foundation models for WBC classification.
Our method employs lightweight generators to mimic past data with a synthetic latent representation to enable privacy-preserving replay.
This work presents a practical solution for maintaining reliable WBC classification in real-world clinical settings.
arXiv Detail & Related papers (2025-03-25T16:30:58Z) - Generalize Drug Response Prediction by Latent Independent Projection for Asymmetric Constrained Domain Generalization [11.649397977546435]
We propose a novel domain generalization framework, termed panCancerDR, to address this challenge.
We conceptualize each cancer type as a distinct source domain, with its cell lines serving as domain-specific samples.
Our empirical experiments demonstrate that panCancerDR effectively learns task-relevant features from diverse source domains.
arXiv Detail & Related papers (2025-02-06T12:53:45Z) - Back to Bayesics: Uncovering Human Mobility Distributions and Anomalies with an Integrated Statistical and Neural Framework [14.899157568336731]
DeepBayesic is a novel framework that integrates Bayesian principles with deep neural networks to model the underlying distributions.
We evaluate our approach on several mobility datasets, demonstrating significant improvements over state-of-the-art anomaly detection methods.
arXiv Detail & Related papers (2024-10-01T19:02:06Z) - Targeted Cause Discovery with Data-Driven Learning [66.86881771339145]
We propose a novel machine learning approach for inferring causal variables of a target variable from observations.
We employ a neural network trained to identify causality through supervised learning on simulated data.
Empirical results demonstrate the effectiveness of our method in identifying causal relationships within large-scale gene regulatory networks.
arXiv Detail & Related papers (2024-08-29T02:21:11Z) - Learning Divergence Fields for Shift-Robust Graph Representations [73.11818515795761]
In this work, we propose a geometric diffusion model with learnable divergence fields for the challenging problem with interdependent data.
We derive a new learning objective through causal inference, which can guide the model to learn generalizable patterns of interdependence that are insensitive across domains.
arXiv Detail & Related papers (2024-06-07T14:29:21Z) - Utilizing Weak-to-Strong Consistency for Semi-Supervised Glomeruli Segmentation [0.803784679671919]
We present a semi-supervised learning approach for glomeruli segmentation based on the weak-to-strong consistency framework validated on multiple real-world datasets.
Our experimental results on 3 independent datasets indicate superior performance of our approach as compared with existing supervised baseline models.
arXiv Detail & Related papers (2024-05-30T10:19:21Z) - Rethinking Model Prototyping through the MedMNIST+ Dataset Collection [0.11999555634662634]
This work presents a benchmark for the MedMNIST+ database to diversify the evaluation landscape.
We conduct a thorough analysis of common convolutional neural networks (CNNs) and Transformer-based architectures, for medical image classification.
Our findings suggest that computationally efficient training schemes and modern foundation models hold promise in bridging the gap between expensive end-to-end training and more resource-refined approaches.
arXiv Detail & Related papers (2024-04-24T10:19:25Z) - Seeing Unseen: Discover Novel Biomedical Concepts via
Geometry-Constrained Probabilistic Modeling [53.7117640028211]
We present a geometry-constrained probabilistic modeling treatment to resolve the identified issues.
We incorporate a suite of critical geometric properties to impose proper constraints on the layout of constructed embedding space.
A spectral graph-theoretic method is devised to estimate the number of potential novel classes.
arXiv Detail & Related papers (2024-03-02T00:56:05Z) - A Comprehensive Augmentation Framework for Anomaly Detection [1.6114012813668932]
This paper analyzes crucial traits of simulated anomalies that contribute to the training of reconstructive networks.
We integrate this framework with a reconstruction-based approach and concurrently propose a split training strategy.
arXiv Detail & Related papers (2023-08-29T07:00:35Z) - Causal Balancing for Domain Generalization [95.97046583437145]
We propose a balanced mini-batch sampling strategy to reduce the domain-specific spurious correlations in observed training distributions.
We provide an identifiability guarantee of the source of spuriousness and show that our proposed approach provably samples from a balanced, spurious-free distribution.
arXiv Detail & Related papers (2022-06-10T17:59:11Z) - Learning Neural Causal Models with Active Interventions [83.44636110899742]
We introduce an active intervention-targeting mechanism which enables a quick identification of the underlying causal structure of the data-generating process.
Our method significantly reduces the required number of interactions compared with random intervention targeting.
We demonstrate superior performance on multiple benchmarks from simulated to real-world data.
arXiv Detail & Related papers (2021-09-06T13:10:37Z) - Cohort Bias Adaptation in Aggregated Datasets for Lesion Segmentation [0.8466401378239363]
We propose a generalized affine conditioning framework to learn and account for cohort biases across multi-source datasets.
We show that our cohort bias adaptation method improves performance of the network on pooled datasets.
arXiv Detail & Related papers (2021-08-02T08:32:57Z) - Adversarial Sample Enhanced Domain Adaptation: A Case Study on
Predictive Modeling with Electronic Health Records [57.75125067744978]
We propose a data augmentation method to facilitate domain adaptation.
adversarially generated samples are used during domain adaptation.
Results confirm the effectiveness of our method and the generality on different tasks.
arXiv Detail & Related papers (2021-01-13T03:20:20Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.