Correlational Lagrangian Schr\"odinger Bridge: Learning Dynamics with
Population-Level Regularization
- URL: http://arxiv.org/abs/2402.10227v1
- Date: Sun, 4 Feb 2024 19:33:44 GMT
- Title: Correlational Lagrangian Schr\"odinger Bridge: Learning Dynamics with
Population-Level Regularization
- Authors: Yuning You, Ruida Zhou, Yang Shen
- Abstract summary: We introduce a novel framework dubbed correlational Lagrangian Schr"odinger bridge ( CLSB)
CLSB seeks for the evolution "bridging" among cross-text observations, while regularized for the minimal population "cost"
Our contributions include (1) a new class of population regularizers capturing the temporal variations in multivariate relations, with the tractable formulation derived, and (3) three domain-informed instantiations based on genetic co-expression stability.
- Score: 27.855576268065857
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Accurate modeling of system dynamics holds intriguing potential in broad
scientific fields including cytodynamics and fluid mechanics. This task often
presents significant challenges when (i) observations are limited to
cross-sectional samples (where individual trajectories are inaccessible for
learning), and moreover, (ii) the behaviors of individual particles are
heterogeneous (especially in biological systems due to biodiversity). To
address them, we introduce a novel framework dubbed correlational Lagrangian
Schr\"odinger bridge (CLSB), aiming to seek for the evolution "bridging" among
cross-sectional observations, while regularized for the minimal population
"cost". In contrast to prior methods relying on \textit{individual}-level
regularizers for all particles \textit{homogeneously} (e.g. restraining
individual motions), CLSB operates at the population level admitting the
heterogeneity nature, resulting in a more generalizable modeling in practice.
To this end, our contributions include (1) a new class of population
regularizers capturing the temporal variations in multivariate relations, with
the tractable formulation derived, (2) three domain-informed instantiations
based on genetic co-expression stability, and (3) an integration of population
regularizers into data-driven generative models as constrained optimization,
and a numerical solution, with further extension to conditional generative
models. Empirically, we demonstrate the superiority of CLSB in single-cell
sequencing data analyses such as simulating cell development over time and
predicting cellular responses to drugs of varied doses.
Related papers
- Inferring Parameter Distributions in Heterogeneous Motile Particle Ensembles: A Likelihood Approach for Second Order Langevin Models [0.8274836883472768]
Inference methods are required to understand and predict the motion patterns from time discrete trajectory data provided by experiments.
We propose a new method to approximate the likelihood for non-linear second order Langevin models.
We thereby pave the way for the systematic, data-driven inference of dynamical models for actively driven entities.
arXiv Detail & Related papers (2024-11-13T15:27:02Z) - Meta Flow Matching: Integrating Vector Fields on the Wasserstein Manifold [83.18058549195855]
We argue that multiple processes in natural sciences have to be represented as vector fields on the Wasserstein manifold of probability densities.
In particular, this is crucial for personalized medicine where the development of diseases and their respective treatment response depends on the microenvironment of cells specific to each patient.
We propose Meta Flow Matching (MFM), a practical approach to integrating along these vector fields on the Wasserstein manifold by amortizing the flow model over the initial populations.
arXiv Detail & Related papers (2024-08-26T20:05:31Z) - Generating Multi-Modal and Multi-Attribute Single-Cell Counts with CFGen [76.02070962797794]
We present Cell Flow for Generation, a flow-based conditional generative model for multi-modal single-cell counts.
Our results suggest improved recovery of crucial biological data characteristics while accounting for novel generative tasks.
arXiv Detail & Related papers (2024-07-16T14:05:03Z) - Integrating GNN and Neural ODEs for Estimating Non-Reciprocal Two-Body Interactions in Mixed-Species Collective Motion [0.0]
We present a novel deep learning framework for estimating the underlying equations of motion from observed trajectories.
Our framework integrates graph neural networks with neural differential equations, enabling effective prediction of two-body interactions.
arXiv Detail & Related papers (2024-05-26T09:47:17Z) - Mixed Models with Multiple Instance Learning [51.440557223100164]
We introduce MixMIL, a framework integrating Generalized Linear Mixed Models (GLMM) and Multiple Instance Learning (MIL)
Our empirical results reveal that MixMIL outperforms existing MIL models in single-cell datasets.
arXiv Detail & Related papers (2023-11-04T16:42:42Z) - Causal machine learning for single-cell genomics [94.28105176231739]
We discuss the application of machine learning techniques to single-cell genomics and their challenges.
We first present the model that underlies most of current causal approaches to single-cell biology.
We then identify open problems in the application of causal approaches to single-cell data.
arXiv Detail & Related papers (2023-10-23T13:35:24Z) - Unbalanced Diffusion Schr\"odinger Bridge [71.31485908125435]
We introduce unbalanced DSBs which model the temporal evolution of marginals with arbitrary finite mass.
This is achieved by deriving the time reversal of differential equations with killing and birth terms.
We present two novel algorithmic schemes that comprise a scalable objective function for training unbalanced DSBs.
arXiv Detail & Related papers (2023-06-15T12:51:56Z) - Learning Causal Representations of Single Cells via Sparse Mechanism
Shift Modeling [3.2435888122704037]
We propose a deep generative model of single-cell gene expression data for which each perturbation is treated as an intervention targeting an unknown, but sparse, subset of latent variables.
We benchmark these methods on simulated single-cell data to evaluate their performance at latent units recovery, causal target identification and out-of-domain generalization.
arXiv Detail & Related papers (2022-11-07T15:47:40Z) - Learning Anisotropic Interaction Rules from Individual Trajectories in a
Heterogeneous Cellular Population [0.0]
We develop WSINDy for second order IPSs to model the movement of communities of cells.
Our approach learns the interaction rules that govern the dynamics of a heterogeneous population of migrating cells.
We demonstrate the efficiency and proficiency of the method on several test scenarios, motivated by common cell migration experiments.
arXiv Detail & Related papers (2022-04-29T15:00:21Z) - Multi-modal Self-supervised Pre-training for Regulatory Genome Across
Cell Types [75.65676405302105]
We propose a simple yet effective approach for pre-training genome data in a multi-modal and self-supervised manner, which we call GeneBERT.
We pre-train our model on the ATAC-seq dataset with 17 million genome sequences.
arXiv Detail & Related papers (2021-10-11T12:48:44Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.