Related papers: Granger causal inference on DAGs identifies genomic loci regulating transcription

Granger causal inference on DAGs identifies genomic loci regulating transcription

URL: http://arxiv.org/abs/2210.10168v1
Date: Tue, 18 Oct 2022 21:15:10 GMT
Title: Granger causal inference on DAGs identifies genomic loci regulating transcription
Authors: Rohit Singh, Alexander P. Wu, Bonnie Berger
Abstract summary: GrID-Net is a framework based on graph neural networks with lagged message passing for Granger causal inference on DAG-structured systems. Our application is the analysis of single-cell multimodal data to identify genomic loci that mediate the regulation of specific genes.
Score: 77.58911272503771
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: When a dynamical system can be modeled as a sequence of observations, Granger causality is a powerful approach for detecting predictive interactions between its variables. However, traditional Granger causal inference has limited utility in domains where the dynamics need to be represented as directed acyclic graphs (DAGs) rather than as a linear sequence, such as with cell differentiation trajectories. Here, we present GrID-Net, a framework based on graph neural networks with lagged message passing for Granger causal inference on DAG-structured systems. Our motivating application is the analysis of single-cell multimodal data to identify genomic loci that mediate the regulation of specific genes. To our knowledge, GrID-Net is the first single-cell analysis tool that accounts for the temporal lag between a genomic locus becoming accessible and its downstream effect on a target gene's expression. We applied GrID-Net on multimodal single-cell assays that profile chromatin accessibility (ATAC-seq) and gene expression (RNA-seq) in the same cell and show that it dramatically outperforms existing methods for inferring regulatory locus-gene links, achieving up to 71% greater agreement with independent population genetics-based estimates. By extending Granger causality to DAG-structured dynamical systems, our work unlocks new domains for causal analyses and, more specifically, opens a path towards elucidating gene regulatory interactions relevant to cellular differentiation and complex human diseases at unprecedented scale and resolution.

Related papers

Multi-omic Causal Discovery using Genotypes and Gene Expression [0.0]
We introduce GENESIS, a constraint-based causal algorithm to infer ancestral relationships in transcriptomic data.<n>By integrating genotypes as fixed causal anchors, GENESIS provides a principled head start'' to classical causal discovery algorithms.<n>This framework offers a powerful avenue for uncovering causal pathways in complex traits, with promising applications to functional genomics, drug discovery, and precision medicine.
arXiv Detail & Related papers (2025-05-21T11:52:23Z)
GRAPE: Heterogeneous Graph Representation Learning for Genetic Perturbation with Coding and Non-Coding Biotype [51.58774936662233]
Building gene regulatory networks (GRN) is essential to understand and predict the effects of genetic perturbations.<n>In this work, we leverage pre-trained large language model and DNA sequence model to extract features from gene descriptions and DNA sequence data.<n>We introduce gene biotype information for the first time in genetic perturbation, simulating the distinct roles of genes with different biotypes in regulating cellular processes.
arXiv Detail & Related papers (2025-05-06T03:35:24Z)
A scalable gene network model of regulatory dynamics in single cells [88.48246132084441]
We introduce a Functional Learnable model of Cell dynamicS, FLeCS, that incorporates gene network structure into coupled differential equations to model gene regulatory functions. Given (pseudo)time-series single-cell data, FLeCS accurately infers cell dynamics at scale.
arXiv Detail & Related papers (2025-03-25T19:19:21Z)
GENERator: A Long-Context Generative Genomic Foundation Model [66.46537421135996]
We present GENERator, a generative genomic foundation model featuring a context length of 98k base pairs (bp) and 1.2B parameters. Trained on an expansive dataset comprising 386B bp of DNA, the GENERator demonstrates state-of-the-art performance across both established and newly proposed benchmarks. It also shows significant promise in sequence optimization, particularly through the prompt-responsive generation of enhancer sequences with specific activity profiles.
arXiv Detail & Related papers (2025-02-11T05:39:49Z)
Gene Regulatory Network Inference in the Presence of Selection Bias and Latent Confounders [14.626706466908386]
Gene Regulatory Network Inference (GRNI) aims to identify causal relationships among genes using gene expression data. Gene expression is influenced by latent confounders, such as non-coding RNAs, which add complexity to GRNI. We propose GISL (Gene Regulatory Network Inference in the presence of Selection bias and Latent confounders) to infer true regulatory relationships in the presence of selection and confounding issues.
arXiv Detail & Related papers (2025-01-17T11:27:58Z)
Interpretable Neural ODEs for Gene Regulatory Network Discovery under Perturbations [4.34315395377214]
We propose PerturbODE, a novel framework that incorporates biologically informative neural ordinary differential equations (neural ODEs) to model cell state trajectories under perturbations. We demonstrate PerturbODE's efficacy in trajectory prediction and GRN inference across simulated and real over-expression datasets.
arXiv Detail & Related papers (2025-01-05T01:04:23Z)
Granger Causality Detection with Kolmogorov-Arnold Networks [20.96356350801151]
This study contributes to the definition of neural Granger causality models. We develop a framework called Granger Causality KAN (GC-KAN) along with a tailored training approach designed specifically for Granger causality detection. Our findings show the potential of KANs to outperform Kolmogorov-Arnoldgressive networks (KANs) in discerning interpretable Granger causal relationships.
arXiv Detail & Related papers (2024-12-19T20:10:34Z)
A Misclassification Network-Based Method for Comparative Genomic Analysis [3.7671415694914927]
Classifying genome sequences based on metadata has been an active area of research in comparative genomics for decades. In this study, we integrate AI and network science approaches to develop a comparative genomic analysis framework.
arXiv Detail & Related papers (2024-12-09T23:22:15Z)
Targeted Cause Discovery with Data-Driven Learning [66.86881771339145]
We propose a novel machine learning approach for inferring causal variables of a target variable from observations. We employ a neural network trained to identify causality through supervised learning on simulated data. Empirical results demonstrate the effectiveness of our method in identifying causal relationships within large-scale gene regulatory networks.
arXiv Detail & Related papers (2024-08-29T02:21:11Z)
Gene Regulatory Network Inference from Pre-trained Single-Cell Transcriptomics Transformer with Joint Graph Learning [10.44434676119443]
Inferring gene regulatory networks (GRNs) from single-cell RNA sequencing (scRNA-seq) data is a complex challenge. In this study, we tackle this challenge by leveraging the single-cell BERT-based pre-trained transformer model (scBERT) We introduce a novel joint graph learning approach that combines the rich contextual representations learned by single-cell language models with the structured knowledge encoded in GRNs.
arXiv Detail & Related papers (2024-07-25T16:42:08Z)
Inference of dynamical gene regulatory networks from single-cell data with physics informed neural networks [0.0]
We show how physics-informed neural networks (PINNs) can be used to infer the parameters of predictive, dynamical GRNs. Specifically we study GRNs that exhibit bifurcation behavior and can therefore model cell differentiation.
arXiv Detail & Related papers (2024-01-14T21:43:10Z)
Causal Inference in Gene Regulatory Networks with GFlowNet: Towards Scalability in Large Systems [87.45270862120866]
We introduce Swift-DynGFN as a novel framework that enhances causal structure learning in GRNs. Specifically, Swift-DynGFN exploits gene-wise independence to boost parallelization and to lower computational cost.
arXiv Detail & Related papers (2023-10-05T14:59:19Z)
Stability Analysis of Non-Linear Classifiers using Gene Regulatory Neural Network for Biological AI [2.0755366440393743]
We develop a mathematical model of gene-perceptron using a dual-layered transcription-translation chemical reaction model. We perform stability analysis for each gene-perceptron within the fully-connected GRNN sub network to determine temporal as well as stable concentration outputs.
arXiv Detail & Related papers (2023-09-14T21:37:38Z)
DynGFN: Towards Bayesian Inference of Gene Regulatory Networks with GFlowNets [81.75973217676986]
Gene regulatory networks (GRN) describe interactions between genes and their products that control gene expression and cellular function. Existing methods either focus on challenge (1), identifying cyclic structure from dynamics, or on challenge (2) learning complex Bayesian posteriors over DAGs, but not both. In this paper we leverage the fact that it is possible to estimate the "velocity" of gene expression with RNA velocity techniques to develop an approach that addresses both challenges.
arXiv Detail & Related papers (2023-02-08T16:36:40Z)
Topological Data Analysis in Time Series: Temporal Filtration and Application to Single-Cell Genomics [13.173307471333619]
We propose the single-cell topological simplicial analysis (scTSA) Applying this approach to the single-cell gene expression profiles from local networks of cells reveals a previously unseen topology of cellular ecology. Benchmarked on the single-cell RNA-seq data of zebrafish embryogenesis spanning 38,731 cells, 25 cell types and 12 time steps, our approach highlights the gastrulation as the most critical stage.
arXiv Detail & Related papers (2022-04-29T12:46:14Z)
Effect Identification in Cluster Causal Diagrams [51.42809552422494]
We introduce a new type of graphical model called cluster causal diagrams (for short, C-DAGs) C-DAGs allow for the partial specification of relationships among variables based on limited prior knowledge. We develop the foundations and machinery for valid causal inferences over C-DAGs.
arXiv Detail & Related papers (2022-02-22T21:27:31Z)
Multi-modal Self-supervised Pre-training for Regulatory Genome Across Cell Types [75.65676405302105]
We propose a simple yet effective approach for pre-training genome data in a multi-modal and self-supervised manner, which we call GeneBERT. We pre-train our model on the ATAC-seq dataset with 17 million genome sequences.
arXiv Detail & Related papers (2021-10-11T12:48:44Z)
Interpretable Models for Granger Causality Using Self-explaining Neural Networks [4.56877715768796]
We propose a novel framework for inferring Granger causality under nonlinear dynamics based on an extension of self-explaining neural networks. This framework is more interpretable than other neural-network-based techniques for inferring Granger causality.
arXiv Detail & Related papers (2021-01-19T12:59:00Z)

This list is automatically generated from the titles and abstracts of the papers in this site.