Modelling Cellular Perturbations with the Sparse Additive Mechanism
Shift Variational Autoencoder
- URL: http://arxiv.org/abs/2311.02794v2
- Date: Tue, 16 Jan 2024 01:18:50 GMT
- Title: Modelling Cellular Perturbations with the Sparse Additive Mechanism
Shift Variational Autoencoder
- Authors: Michael Bereket, Theofanis Karaletsos
- Abstract summary: We propose the Sparse Additive Mechanism Shift Variational Autoencoder, SAMS-VAE, to combine compositionality, disentanglement, and interpretability for perturbation models.
SAMS-VAE models the latent state of a perturbed sample as the sum of a local latent variable capturing sample-specific variation and sparse global variables of latent intervention effects.
We evaluate SAMS-VAE both and qualitatively on a range of tasks using two popular single cell sequencing datasets.
- Score: 6.352775857356592
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Generative models of observations under interventions have been a vibrant
topic of interest across machine learning and the sciences in recent years. For
example, in drug discovery, there is a need to model the effects of diverse
interventions on cells in order to characterize unknown biological mechanisms
of action. We propose the Sparse Additive Mechanism Shift Variational
Autoencoder, SAMS-VAE, to combine compositionality, disentanglement, and
interpretability for perturbation models. SAMS-VAE models the latent state of a
perturbed sample as the sum of a local latent variable capturing
sample-specific variation and sparse global variables of latent intervention
effects. Crucially, SAMS-VAE sparsifies these global latent variables for
individual perturbations to identify disentangled, perturbation-specific latent
subspaces that are flexibly composable. We evaluate SAMS-VAE both
quantitatively and qualitatively on a range of tasks using two popular single
cell sequencing datasets. In order to measure perturbation-specific
model-properties, we also introduce a framework for evaluation of perturbation
models based on average treatment effects with links to posterior predictive
checks. SAMS-VAE outperforms comparable models in terms of generalization
across in-distribution and out-of-distribution tasks, including a combinatorial
reasoning task under resource paucity, and yields interpretable latent
structures which correlate strongly to known biological mechanisms. Our results
suggest SAMS-VAE is an interesting addition to the modeling toolkit for machine
learning-driven scientific discovery.
Related papers
- Generative Intervention Models for Causal Perturbation Modeling [80.72074987374141]
In many applications, it is a priori unknown which mechanisms of a system are modified by an external perturbation.
We propose a generative intervention model (GIM) that learns to map these perturbation features to distributions over atomic interventions.
arXiv Detail & Related papers (2024-11-21T10:37:57Z) - Omics-driven hybrid dynamic modeling of bioprocesses with uncertainty estimation [0.0]
This work presents an omics-driven modeling pipeline that integrates machine-learning tools.
Random forests and permutation feature importance are proposed to mine omics datasets.
Continuous and differentiable machine-learning functions can be trained to link the reduced omics feature set to key components of the dynamic model.
arXiv Detail & Related papers (2024-10-24T15:50:35Z) - Adjoint Sensitivity Analysis on Multi-Scale Bioprocess Stochastic Reaction Network [2.6130735302655554]
We introduce an adjoint sensitivity approach to expedite the learning of mechanistic model parameters.
In this paper, we consider enzymatic analysis (SA) representing a multi-scale bioprocess mechanistic model.
arXiv Detail & Related papers (2024-05-07T05:06:45Z) - Self-Labeling in Multivariate Causality and Quantification for Adaptive Machine Learning [0.0]
An interactive causality based self-labeling method was proposed to autonomously associate causally related data streams for domain adaptation.
This paper further develops the self-labeling framework and its theoretical foundations to address these research questions.
arXiv Detail & Related papers (2024-04-08T18:16:22Z) - Variational Inference of Parameters in Opinion Dynamics Models [9.51311391391997]
This work uses variational inference to estimate the parameters of an opinion dynamics ABM.
We transform the inference process into an optimization problem suitable for automatic differentiation.
Our approach estimates both macroscopic (bounded confidence intervals and backfire thresholds) and microscopic ($200$ categorical, agent-level roles) more accurately than simulation-based and MCMC methods.
arXiv Detail & Related papers (2024-03-08T14:45:18Z) - Synthetic location trajectory generation using categorical diffusion
models [50.809683239937584]
Diffusion models (DPMs) have rapidly evolved to be one of the predominant generative models for the simulation of synthetic data.
We propose using DPMs for the generation of synthetic individual location trajectories (ILTs) which are sequences of variables representing physical locations visited by individuals.
arXiv Detail & Related papers (2024-02-19T15:57:39Z) - Learning Causal Representations of Single Cells via Sparse Mechanism
Shift Modeling [3.2435888122704037]
We propose a deep generative model of single-cell gene expression data for which each perturbation is treated as an intervention targeting an unknown, but sparse, subset of latent variables.
We benchmark these methods on simulated single-cell data to evaluate their performance at latent units recovery, causal target identification and out-of-domain generalization.
arXiv Detail & Related papers (2022-11-07T15:47:40Z) - Differentiable Agent-based Epidemiology [71.81552021144589]
We introduce GradABM: a scalable, differentiable design for agent-based modeling that is amenable to gradient-based learning with automatic differentiation.
GradABM can quickly simulate million-size populations in few seconds on commodity hardware, integrate with deep neural networks and ingest heterogeneous data sources.
arXiv Detail & Related papers (2022-07-20T07:32:02Z) - Integrating Expert ODEs into Neural ODEs: Pharmacology and Disease
Progression [71.7560927415706]
latent hybridisation model (LHM) integrates a system of expert-designed ODEs with machine-learned Neural ODEs to fully describe the dynamics of the system.
We evaluate LHM on synthetic data as well as real-world intensive care data of COVID-19 patients.
arXiv Detail & Related papers (2021-06-05T11:42:45Z) - Adversarial Sample Enhanced Domain Adaptation: A Case Study on
Predictive Modeling with Electronic Health Records [57.75125067744978]
We propose a data augmentation method to facilitate domain adaptation.
adversarially generated samples are used during domain adaptation.
Results confirm the effectiveness of our method and the generality on different tasks.
arXiv Detail & Related papers (2021-01-13T03:20:20Z) - Robust Finite Mixture Regression for Heterogeneous Targets [70.19798470463378]
We propose an FMR model that finds sample clusters and jointly models multiple incomplete mixed-type targets simultaneously.
We provide non-asymptotic oracle performance bounds for our model under a high-dimensional learning framework.
The results show that our model can achieve state-of-the-art performance.
arXiv Detail & Related papers (2020-10-12T03:27:07Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.