Clustering with Communication: A Variational Framework for Single Cell Representation Learning
- URL: http://arxiv.org/abs/2505.04891v1
- Date: Thu, 08 May 2025 01:53:36 GMT
- Title: Clustering with Communication: A Variational Framework for Single Cell Representation Learning
- Authors: Cong Qi, Yeqing Chen, Jie Zhang, Wei Zhi,
- Abstract summary: We propose CCCVAE, a variational autoencoder framework that incorporates CCC signals into single-cell representation learning.<n>We show that CCCVAE improves clustering performance, achieving higher evaluation scores than standard VAE baselines.
- Score: 2.275097126764287
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Single-cell RNA sequencing (scRNA-seq) has revealed complex cellular heterogeneity, but recent studies emphasize that understanding biological function also requires modeling cell-cell communication (CCC), the signaling interactions mediated by ligand-receptor pairs that coordinate cellular behavior. Tools like CellChat have demonstrated that CCC plays a critical role in processes such as cell differentiation, tissue regeneration, and immune response, and that transcriptomic data inherently encodes rich information about intercellular signaling. We propose CCCVAE, a novel variational autoencoder framework that incorporates CCC signals into single-cell representation learning. By leveraging a communication-aware kernel derived from ligand-receptor interactions and a sparse Gaussian process, CCCVAE encodes biologically informed priors into the latent space. Unlike conventional VAEs that treat each cell independently, CCCVAE encourages latent embeddings to reflect both transcriptional similarity and intercellular signaling context. Empirical results across four scRNA-seq datasets show that CCCVAE improves clustering performance, achieving higher evaluation scores than standard VAE baselines. This work demonstrates the value of embedding biological priors into deep generative models for unsupervised single-cell analysis.
Related papers
- STAGED: A Multi-Agent Neural Network for Learning Cellular Interaction Dynamics [8.659754814655303]
Single-cell technology has improved our understanding of cellular states and subpopulations in various tissues under normal and diseased conditions.<n>With spatial transcriptomics, we can represent cellular organization, along with dynamic cell-cell interactions that lead to changes in cell state.<n>We introduce Spatio Temporal Agent-Based Graph Evolution Dynamics(STAGED) to model intercellular communication, and its effect on the intracellular gene regulatory network.
arXiv Detail & Related papers (2025-07-15T18:46:07Z) - Unlasting: Unpaired Single-Cell Multi-Perturbation Estimation by Dual Conditional Diffusion Implicit Bridges [68.98973318553983]
We propose a framework based on Dual Diffusion Implicit Bridges (DDIB) to learn the mapping between different data distributions.<n>We integrate gene regulatory network (GRN) information to propagate perturbation signals in a biologically meaningful way.<n>We also incorporate a masking mechanism to predict silent genes, improving the quality of generated profiles.
arXiv Detail & Related papers (2025-06-26T09:05:38Z) - scSiameseClu: A Siamese Clustering Framework for Interpreting single-cell RNA Sequencing Data [27.903476609960062]
Single-cell RNA sequencing (scRNA-seq) reveals cell heterogeneity.<n>Cell clustering plays a key role in identifying cell types and marker genes.<n>Recent advances, especially graph neural networks (GNNs)-based methods, have significantly improved clustering performance.<n>We propose scSiameseClu, a novel Siamese Clustering framework for interpreting single-cell RNA-seq data.
arXiv Detail & Related papers (2025-05-19T02:17:09Z) - A scalable gene network model of regulatory dynamics in single cells [88.48246132084441]
We introduce a Functional Learnable model of Cell dynamicS, FLeCS, that incorporates gene network structure into coupled differential equations to model gene regulatory functions.<n>Given (pseudo)time-series single-cell data, FLeCS accurately infers cell dynamics at scale.
arXiv Detail & Related papers (2025-03-25T19:19:21Z) - DynaCLR: Contrastive Learning of Cellular Dynamics with Temporal Regularization [0.5385157903509136]
DynaCLR is a self-supervised method for embedding cell and organelle Dynamics via Contrastive Learning of Representations of time-lapse images.<n>We integrate single-cell tracking and time-aware contrastive sampling to learn robust, temporally regularized representations of cell dynamics.
arXiv Detail & Related papers (2024-10-15T05:01:25Z) - Gene Regulatory Network Inference from Pre-trained Single-Cell Transcriptomics Transformer with Joint Graph Learning [10.44434676119443]
Inferring gene regulatory networks (GRNs) from single-cell RNA sequencing (scRNA-seq) data is a complex challenge.
In this study, we tackle this challenge by leveraging the single-cell BERT-based pre-trained transformer model (scBERT)
We introduce a novel joint graph learning approach that combines the rich contextual representations learned by single-cell language models with the structured knowledge encoded in GRNs.
arXiv Detail & Related papers (2024-07-25T16:42:08Z) - Multi-Modal and Multi-Attribute Generation of Single Cells with CFGen [76.02070962797794]
This work introduces CellFlow for Generation (CFGen), a flow-based conditional generative model that preserves the inherent discreteness of single-cell data.<n>CFGen generates whole-genome multi-modal single-cell data reliably, improving the recovery of crucial biological data characteristics.
arXiv Detail & Related papers (2024-07-16T14:05:03Z) - A Kolmogorov metric embedding for live cell microscopy signaling patterns [0.1547863211792184]
We present a metric embedding that captures patterns of cell signaling dynamics in 5-D $(x,yz,channel,time)$ live cell microscopy movies.<n>The embedding uses a metric distance called the normalized information distance (NID) based on Kolmogorov complexity theory an absolute measure of information content between objects.
arXiv Detail & Related papers (2024-01-04T19:25:00Z) - scBiGNN: Bilevel Graph Representation Learning for Cell Type
Classification from Single-cell RNA Sequencing Data [62.87454293046843]
Graph neural networks (GNNs) have been widely used for automatic cell type classification.
scBiGNN comprises two GNN modules to identify cell types.
scBiGNN outperforms a variety of existing methods for cell type classification from scRNA-seq data.
arXiv Detail & Related papers (2023-12-16T03:54:26Z) - Single-Cell Deep Clustering Method Assisted by Exogenous Gene
Information: A Novel Approach to Identifying Cell Types [50.55583697209676]
We develop an attention-enhanced graph autoencoder, which is designed to efficiently capture the topological features between cells.
During the clustering process, we integrated both sets of information and reconstructed the features of both cells and genes to generate a discriminative representation.
This research offers enhanced insights into the characteristics and distribution of cells, thereby laying the groundwork for early diagnosis and treatment of diseases.
arXiv Detail & Related papers (2023-11-28T09:14:55Z) - Granger causal inference on DAGs identifies genomic loci regulating
transcription [77.58911272503771]
GrID-Net is a framework based on graph neural networks with lagged message passing for Granger causal inference on DAG-structured systems.
Our application is the analysis of single-cell multimodal data to identify genomic loci that mediate the regulation of specific genes.
arXiv Detail & Related papers (2022-10-18T21:15:10Z) - Topological Data Analysis in Time Series: Temporal Filtration and
Application to Single-Cell Genomics [13.173307471333619]
We propose the single-cell topological simplicial analysis (scTSA)
Applying this approach to the single-cell gene expression profiles from local networks of cells reveals a previously unseen topology of cellular ecology.
Benchmarked on the single-cell RNA-seq data of zebrafish embryogenesis spanning 38,731 cells, 25 cell types and 12 time steps, our approach highlights the gastrulation as the most critical stage.
arXiv Detail & Related papers (2022-04-29T12:46:14Z) - SimCD: Simultaneous Clustering and Differential expression analysis for
single-cell transcriptomic data [22.702909270039314]
Single-Cell RNA sequencing (scRNA-seq) has facilitated genome-scale transcriptomic profiling of individual cells.
Several scRNA-seq analysis methods have been proposed to first identify cell sub-populations by clustering and then separately perform differential expression analysis to understand gene expression changes.
We develop a new method -- SimCD -- that explicitly models cell heterogeneity and dynamic differential changes in one unified hierarchical gamma-negative binomial model.
arXiv Detail & Related papers (2021-04-04T01:06:18Z) - Review of Single-cell RNA-seq Data Clustering for Cell Type
Identification and Characterization [12.655970720359297]
Unsupervised learning has become the central component to identify and characterize novel cell types and gene expression patterns.
We review the existing single-cell RNA-seq data clustering methods with critical insights into the related advantages and limitations.
We conduct performance comparison experiments to evaluate several popular single-cell RNA-seq clustering approaches on two single-cell transcriptomic datasets.
arXiv Detail & Related papers (2020-01-03T22:48:10Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.