Omics-driven hybrid dynamic modeling of bioprocesses with uncertainty estimation
- URL: http://arxiv.org/abs/2410.18864v1
- Date: Thu, 24 Oct 2024 15:50:35 GMT
- Title: Omics-driven hybrid dynamic modeling of bioprocesses with uncertainty estimation
- Authors: Sebastián Espinel-Ríos, José Montaño López, José L. Avalos,
- Abstract summary: This work presents an omics-driven modeling pipeline that integrates machine-learning tools.
Random forests and permutation feature importance are proposed to mine omics datasets.
Continuous and differentiable machine-learning functions can be trained to link the reduced omics feature set to key components of the dynamic model.
- Score: 0.0
- License:
- Abstract: This work presents an omics-driven modeling pipeline that integrates machine-learning tools to facilitate the dynamic modeling of multiscale biological systems. Random forests and permutation feature importance are proposed to mine omics datasets, guiding feature selection and dimensionality reduction for dynamic modeling. Continuous and differentiable machine-learning functions can be trained to link the reduced omics feature set to key components of the dynamic model, resulting in a hybrid model. As proof of concept, we apply this framework to a high-dimensional proteomics dataset of $\textit{Saccharomyces cerevisiae}$. After identifying key intracellular proteins that correlate with cell growth, targeted dynamic experiments are designed, and key model parameters are captured as functions of the selected proteins using Gaussian processes. This approach captures the dynamic behavior of yeast strains under varying proteome profiles while estimating the uncertainty in the hybrid model's predictions. The outlined modeling framework is adaptable to other scenarios, such as integrating additional layers of omics data for more advanced multiscale biological systems, or employing alternative machine-learning methods to handle larger datasets. Overall, this study outlines a strategy for leveraging omics data to inform multiscale dynamic modeling in systems biology and bioprocess engineering.
Related papers
- SFM-Protein: Integrative Co-evolutionary Pre-training for Advanced Protein Sequence Representation [97.99658944212675]
We introduce a novel pre-training strategy for protein foundation models.
It emphasizes the interactions among amino acid residues to enhance the extraction of both short-range and long-range co-evolutionary features.
Trained on a large-scale protein sequence dataset, our model demonstrates superior generalization ability.
arXiv Detail & Related papers (2024-10-31T15:22:03Z) - Generating Multi-Modal and Multi-Attribute Single-Cell Counts with CFGen [76.02070962797794]
We present Cell Flow for Generation, a flow-based conditional generative model for multi-modal single-cell counts.
Our results suggest improved recovery of crucial biological data characteristics while accounting for novel generative tasks.
arXiv Detail & Related papers (2024-07-16T14:05:03Z) - Integrating GNN and Neural ODEs for Estimating Non-Reciprocal Two-Body Interactions in Mixed-Species Collective Motion [0.0]
We present a novel deep learning framework for estimating the underlying equations of motion from observed trajectories.
Our framework integrates graph neural networks with neural differential equations, enabling effective prediction of two-body interactions.
arXiv Detail & Related papers (2024-05-26T09:47:17Z) - Modelling Cellular Perturbations with the Sparse Additive Mechanism
Shift Variational Autoencoder [6.352775857356592]
We propose the Sparse Additive Mechanism Shift Variational Autoencoder, SAMS-VAE, to combine compositionality, disentanglement, and interpretability for perturbation models.
SAMS-VAE models the latent state of a perturbed sample as the sum of a local latent variable capturing sample-specific variation and sparse global variables of latent intervention effects.
We evaluate SAMS-VAE both and qualitatively on a range of tasks using two popular single cell sequencing datasets.
arXiv Detail & Related papers (2023-11-05T23:37:31Z) - Causal machine learning for single-cell genomics [94.28105176231739]
We discuss the application of machine learning techniques to single-cell genomics and their challenges.
We first present the model that underlies most of current causal approaches to single-cell biology.
We then identify open problems in the application of causal approaches to single-cell data.
arXiv Detail & Related papers (2023-10-23T13:35:24Z) - Data-driven and Physics Informed Modelling of Chinese Hamster Ovary Cell
Bioreactors [0.0]
We propose a data-driven hybrid model to learn models of the dynamical evolution of Chinese Hamster Ovary cell bioreactors from process data.
We encode the convex optimization step of the overdetermined metabolic biophysical system as a differentiable, feed-forward layer into our architectures.
arXiv Detail & Related papers (2023-05-05T03:09:33Z) - Differentiable Agent-based Epidemiology [71.81552021144589]
We introduce GradABM: a scalable, differentiable design for agent-based modeling that is amenable to gradient-based learning with automatic differentiation.
GradABM can quickly simulate million-size populations in few seconds on commodity hardware, integrate with deep neural networks and ingest heterogeneous data sources.
arXiv Detail & Related papers (2022-07-20T07:32:02Z) - Emerging Patterns in the Continuum Representation of Protein-Lipid
Fingerprints [12.219106300827798]
We evaluate the capabilities of a continuum model developed using 1-dimensional statistics from a molecular dynamics model.
We develop a highly predictive classification model that identifies complex and emergent behavior from the continuum model.
Our approach confirms the existence of protein-specific "lipid fingerprints", i.e. spatial rearrangements of lipids in response to proteins of interest.
arXiv Detail & Related papers (2022-07-09T20:07:49Z) - Capturing Actionable Dynamics with Structured Latent Ordinary
Differential Equations [68.62843292346813]
We propose a structured latent ODE model that captures system input variations within its latent representation.
Building on a static variable specification, our model learns factors of variation for each input to the system, thus separating the effects of the system inputs in the latent space.
arXiv Detail & Related papers (2022-02-25T20:00:56Z) - Inference of cell dynamics on perturbation data using adjoint
sensitivity [4.606583317143614]
Data-driven dynamic models of cell biology can be used to predict cell response to unseen perturbations.
Recent work had demonstrated the derivation of interpretable models with explicit interaction terms.
This work aims to extend the range of applicability of this model inference approach to a diversity of biological systems.
arXiv Detail & Related papers (2021-04-13T19:15:56Z) - Towards an Automatic Analysis of CHO-K1 Suspension Growth in
Microfluidic Single-cell Cultivation [63.94623495501023]
We propose a novel Machine Learning architecture, which allows us to infuse a neural deep network with human-powered abstraction on the level of data.
Specifically, we train a generative model simultaneously on natural and synthetic data, so that it learns a shared representation, from which a target variable, such as the cell count, can be reliably estimated.
arXiv Detail & Related papers (2020-10-20T08:36:51Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.