SBMLtoODEjax: Efficient Simulation and Optimization of Biological
Network Models in JAX
- URL: http://arxiv.org/abs/2307.08452v2
- Date: Sun, 29 Oct 2023 06:29:33 GMT
- Title: SBMLtoODEjax: Efficient Simulation and Optimization of Biological
Network Models in JAX
- Authors: Mayalen Etcheverry, Michael Levin, Cl\'ement Moulin-Frier, Pierre-Yves
Oudeyer
- Abstract summary: This paper introduces SBMLtoODEjax, a lightweight library designed to seamlessly integrate SBML models with ML-supported pipelines, powered by JAX.
It harnesses JAX's capabilities for efficient parallel simulations and optimization, with the aim to accelerate research in biological network analysis.
- Score: 19.55237447763145
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Advances in bioengineering and biomedicine demand a deep understanding of the
dynamic behavior of biological systems, ranging from protein pathways to
complex cellular processes. Biological networks like gene regulatory networks
and protein pathways are key drivers of embryogenesis and physiological
processes. Comprehending their diverse behaviors is essential for tackling
diseases, including cancer, as well as for engineering novel biological
constructs. Despite the availability of extensive mathematical models
represented in Systems Biology Markup Language (SBML), researchers face
significant challenges in exploring the full spectrum of behaviors and
optimizing interventions to efficiently shape those behaviors. Existing tools
designed for simulation of biological network models are not tailored to
facilitate interventions on network dynamics nor to facilitate automated
discovery. Leveraging recent developments in machine learning (ML), this paper
introduces SBMLtoODEjax, a lightweight library designed to seamlessly integrate
SBML models with ML-supported pipelines, powered by JAX. SBMLtoODEjax
facilitates the reuse and customization of SBML-based models, harnessing JAX's
capabilities for efficient parallel simulations and optimization, with the aim
to accelerate research in biological network analysis.
Related papers
- BioMARS: A Multi-Agent Robotic System for Autonomous Biological Experiments [8.317138109309967]
Large language models (LLMs) and vision-language models (VLMs) have the potential to transform biological research by enabling autonomous experimentation.<n>Here we introduce BioMARS, an intelligent platform that integrates LLMs, VLMs, and modular robotics to autonomously design, plan, and execute biological experiments.<n>A web interface enables real-time human-AI collaboration, while a modular backend allows scalable integration with laboratory hardware.
arXiv Detail & Related papers (2025-07-02T08:47:02Z) - Graph Neural Network-Based Reinforcement Learning for Controlling Biological Networks: The GATTACA Framework [0.0]
We explore the use of deep reinforcement learning (DRL) to control network models of complex biological systems.<n>We formulate a novel control problem for Boolean network models under the asynchronous update mode in the context of cellular reprogramming.<n>To leverage the structure of biological systems, we incorporate graph neural networks with graph convolutions into the artificial neural network approximator for the action-value function learned by the DRL agent.
arXiv Detail & Related papers (2025-05-05T15:07:20Z) - BioMaze: Benchmarking and Enhancing Large Language Models for Biological Pathway Reasoning [49.487327661584686]
We introduce BioMaze, a dataset with 5.1K complex pathway problems from real research.
Our evaluation of methods such as CoT and graph-augmented reasoning, shows that LLMs struggle with pathway reasoning.
To address this, we propose PathSeeker, an LLM agent that enhances reasoning through interactive subgraph-based navigation.
arXiv Detail & Related papers (2025-02-23T17:38:10Z) - GENERator: A Long-Context Generative Genomic Foundation Model [66.46537421135996]
We present GENERator, a generative genomic foundation model featuring a context length of 98k base pairs (bp) and 1.2B parameters.
Trained on an expansive dataset comprising 386B bp of DNA, the GENERator demonstrates state-of-the-art performance across both established and newly proposed benchmarks.
It also shows significant promise in sequence optimization, particularly through the prompt-responsive generation of enhancer sequences with specific activity profiles.
arXiv Detail & Related papers (2025-02-11T05:39:49Z) - Biology Instructions: A Dataset and Benchmark for Multi-Omics Sequence Understanding Capability of Large Language Models [51.316001071698224]
We introduce Biology-Instructions, the first large-scale multi-omics biological sequences-related instruction-tuning dataset.
This dataset can bridge the gap between large language models (LLMs) and complex biological sequences-related tasks.
We also develop a strong baseline called ChatMultiOmics with a novel three-stage training pipeline.
arXiv Detail & Related papers (2024-12-26T12:12:23Z) - Omics-driven hybrid dynamic modeling of bioprocesses with uncertainty estimation [0.0]
This work presents an omics-driven modeling pipeline that integrates machine-learning tools.
Random forests and permutation feature importance are proposed to mine omics datasets.
Continuous and differentiable machine-learning functions can be trained to link the reduced omics feature set to key components of the dynamic model.
arXiv Detail & Related papers (2024-10-24T15:50:35Z) - Boolean matrix logic programming for active learning of gene functions in genome-scale metabolic network models [4.762323642506732]
We seek to apply logic-based machine learning techniques to facilitate cellular engineering and drive biological discovery.
We introduce a new system, $BMLP_active$, which efficiently explores the genomic hypothesis space by guiding informative experimentation.
$BMLP_active$ can successfully learn the interaction between a gene pair with fewer training examples than random experimentation.
arXiv Detail & Related papers (2024-05-10T09:51:06Z) - An Evaluation of Large Language Models in Bioinformatics Research [52.100233156012756]
We study the performance of large language models (LLMs) on a wide spectrum of crucial bioinformatics tasks.
These tasks include the identification of potential coding regions, extraction of named entities for genes and proteins, detection of antimicrobial and anti-cancer peptides, molecular optimization, and resolution of educational bioinformatics problems.
Our findings indicate that, given appropriate prompts, LLMs like GPT variants can successfully handle most of these tasks.
arXiv Detail & Related papers (2024-02-21T11:27:31Z) - Improving Biomedical Entity Linking with Retrieval-enhanced Learning [53.24726622142558]
$k$NN-BioEL provides a BioEL model with the ability to reference similar instances from the entire training corpus as clues for prediction.
We show that $k$NN-BioEL outperforms state-of-the-art baselines on several datasets.
arXiv Detail & Related papers (2023-12-15T14:04:23Z) - Human Comprehensible Active Learning of Genome-Scale Metabolic Networks [7.838090421892651]
A comprehensible machine learning approach that efficiently explores the hypothesis space and guides experimental design is urgently needed.
We introduce a novel machine learning framework ILP-iML1515 based on Inductive Logic Programming (ILP)
ILP-iML1515 is built on comprehensible logical representations of a genome-scale metabolic model and can update the model by learning new logical structures from auxotrophic mutant trials.
arXiv Detail & Related papers (2023-08-24T12:42:00Z) - R-Mixup: Riemannian Mixup for Biological Networks [15.48899766304136]
We propose R-mixUP, a Mixup-based data augmentation technique that suits the symmetric positive definite (SPD) property of adjacency matrices from biological networks.
We demonstrate the effectiveness of R-mixUP with five real-world biological network datasets on both regression and classification tasks.
arXiv Detail & Related papers (2023-06-05T01:41:23Z) - Contrastive-Signal-Dependent Plasticity: Self-Supervised Learning in Spiking Neural Circuits [61.94533459151743]
This work addresses the challenge of designing neurobiologically-motivated schemes for adjusting the synapses of spiking networks.
Our experimental simulations demonstrate a consistent advantage over other biologically-plausible approaches when training recurrent spiking networks.
arXiv Detail & Related papers (2023-03-30T02:40:28Z) - Machine Learning for Uncovering Biological Insights in Spatial
Transcriptomics Data [0.0]
Development and homeostasis in multicellular systems require exquisite control over spatial molecular pattern formation and maintenance.
Advances in spatial transcriptomics (ST) have led to rapid development of innovative machine learning (ML) tools.
We summarize major ST analysis goals that ML can help address and current analysis trends.
arXiv Detail & Related papers (2023-03-29T14:22:08Z) - Differentiable Agent-based Epidemiology [71.81552021144589]
We introduce GradABM: a scalable, differentiable design for agent-based modeling that is amenable to gradient-based learning with automatic differentiation.
GradABM can quickly simulate million-size populations in few seconds on commodity hardware, integrate with deep neural networks and ingest heterogeneous data sources.
arXiv Detail & Related papers (2022-07-20T07:32:02Z) - Deep Bayesian Active Learning for Accelerating Stochastic Simulation [74.58219903138301]
Interactive Neural Process (INP) is a deep active learning framework for simulations and with active learning approaches.
For active learning, we propose a novel acquisition function, Latent Information Gain (LIG), calculated in the latent space of NP based models.
The results demonstrate STNP outperforms the baselines in the learning setting and LIG achieves the state-of-the-art for active learning.
arXiv Detail & Related papers (2021-06-05T01:31:51Z) - Machine Learning in Nano-Scale Biomedical Engineering [77.75587007080894]
We review the existing research regarding the use of machine learning in nano-scale biomedical engineering.
The main challenges that can be formulated as ML problems are classified into the three main categories.
For each of the presented methodologies, special emphasis is given to its principles, applications, and limitations.
arXiv Detail & Related papers (2020-08-05T15:45:54Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.