Granger causal inference on DAGs identifies genomic loci regulating
transcription
- URL: http://arxiv.org/abs/2210.10168v1
- Date: Tue, 18 Oct 2022 21:15:10 GMT
- Title: Granger causal inference on DAGs identifies genomic loci regulating
transcription
- Authors: Rohit Singh, Alexander P. Wu, Bonnie Berger
- Abstract summary: GrID-Net is a framework based on graph neural networks with lagged message passing for Granger causal inference on DAG-structured systems.
Our application is the analysis of single-cell multimodal data to identify genomic loci that mediate the regulation of specific genes.
- Score: 77.58911272503771
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: When a dynamical system can be modeled as a sequence of observations, Granger
causality is a powerful approach for detecting predictive interactions between
its variables. However, traditional Granger causal inference has limited
utility in domains where the dynamics need to be represented as directed
acyclic graphs (DAGs) rather than as a linear sequence, such as with cell
differentiation trajectories. Here, we present GrID-Net, a framework based on
graph neural networks with lagged message passing for Granger causal inference
on DAG-structured systems. Our motivating application is the analysis of
single-cell multimodal data to identify genomic loci that mediate the
regulation of specific genes. To our knowledge, GrID-Net is the first
single-cell analysis tool that accounts for the temporal lag between a genomic
locus becoming accessible and its downstream effect on a target gene's
expression. We applied GrID-Net on multimodal single-cell assays that profile
chromatin accessibility (ATAC-seq) and gene expression (RNA-seq) in the same
cell and show that it dramatically outperforms existing methods for inferring
regulatory locus-gene links, achieving up to 71% greater agreement with
independent population genetics-based estimates. By extending Granger causality
to DAG-structured dynamical systems, our work unlocks new domains for causal
analyses and, more specifically, opens a path towards elucidating gene
regulatory interactions relevant to cellular differentiation and complex human
diseases at unprecedented scale and resolution.
Related papers
- GENERator: A Long-Context Generative Genomic Foundation Model [66.46537421135996]
We present a generative genomic foundation model featuring a context length of 98k base pairs (bp) and 1.2B parameters.
The model adheres to the central dogma of molecular biology, accurately generating protein-coding sequences.
It also shows significant promise in sequence optimization, particularly through the prompt-responsive generation of promoter sequences.
arXiv Detail & Related papers (2025-02-11T05:39:49Z) - Gene Regulatory Network Inference in the Presence of Selection Bias and Latent Confounders [14.626706466908386]
Gene Regulatory Network Inference (GRNI) aims to identify causal relationships among genes using gene expression data.
Gene expression is influenced by latent confounders, such as non-coding RNAs, which add complexity to GRNI.
We propose GISL (Gene Regulatory Network Inference in the presence of Selection bias and Latent confounders) to infer true regulatory relationships in the presence of selection and confounding issues.
arXiv Detail & Related papers (2025-01-17T11:27:58Z) - Interpretable Neural ODEs for Gene Regulatory Network Discovery under Perturbations [4.34315395377214]
We propose PerturbODE, a novel framework that incorporates biologically informative neural ordinary differential equations (neural ODEs) to model cell state trajectories under perturbations.
We demonstrate PerturbODE's efficacy in trajectory prediction and GRN inference across simulated and real over-expression datasets.
arXiv Detail & Related papers (2025-01-05T01:04:23Z) - Granger Causality Detection with Kolmogorov-Arnold Networks [20.96356350801151]
This study contributes to the definition of neural Granger causality models.
We develop a framework called Granger Causality KAN (GC-KAN) along with a tailored training approach designed specifically for Granger causality detection.
Our findings show the potential of KANs to outperform Kolmogorov-Arnoldgressive networks (KANs) in discerning interpretable Granger causal relationships.
arXiv Detail & Related papers (2024-12-19T20:10:34Z) - A Misclassification Network-Based Method for Comparative Genomic Analysis [3.7671415694914927]
Classifying genome sequences based on metadata has been an active area of research in comparative genomics for decades.
In this study, we integrate AI and network science approaches to develop a comparative genomic analysis framework.
arXiv Detail & Related papers (2024-12-09T23:22:15Z) - Targeted Cause Discovery with Data-Driven Learning [66.86881771339145]
We propose a novel machine learning approach for inferring causal variables of a target variable from observations.
We employ a neural network trained to identify causality through supervised learning on simulated data.
Empirical results demonstrate the effectiveness of our method in identifying causal relationships within large-scale gene regulatory networks.
arXiv Detail & Related papers (2024-08-29T02:21:11Z) - Inference of dynamical gene regulatory networks from single-cell data
with physics informed neural networks [0.0]
We show how physics-informed neural networks (PINNs) can be used to infer the parameters of predictive, dynamical GRNs.
Specifically we study GRNs that exhibit bifurcation behavior and can therefore model cell differentiation.
arXiv Detail & Related papers (2024-01-14T21:43:10Z) - Causal Inference in Gene Regulatory Networks with GFlowNet: Towards
Scalability in Large Systems [87.45270862120866]
We introduce Swift-DynGFN as a novel framework that enhances causal structure learning in GRNs.
Specifically, Swift-DynGFN exploits gene-wise independence to boost parallelization and to lower computational cost.
arXiv Detail & Related papers (2023-10-05T14:59:19Z) - DynGFN: Towards Bayesian Inference of Gene Regulatory Networks with
GFlowNets [81.75973217676986]
Gene regulatory networks (GRN) describe interactions between genes and their products that control gene expression and cellular function.
Existing methods either focus on challenge (1), identifying cyclic structure from dynamics, or on challenge (2) learning complex Bayesian posteriors over DAGs, but not both.
In this paper we leverage the fact that it is possible to estimate the "velocity" of gene expression with RNA velocity techniques to develop an approach that addresses both challenges.
arXiv Detail & Related papers (2023-02-08T16:36:40Z) - Multi-modal Self-supervised Pre-training for Regulatory Genome Across
Cell Types [75.65676405302105]
We propose a simple yet effective approach for pre-training genome data in a multi-modal and self-supervised manner, which we call GeneBERT.
We pre-train our model on the ATAC-seq dataset with 17 million genome sequences.
arXiv Detail & Related papers (2021-10-11T12:48:44Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.