CloudPred: Predicting Patient Phenotypes From Single-cell RNA-seq
- URL: http://arxiv.org/abs/2110.07069v1
- Date: Wed, 13 Oct 2021 22:41:30 GMT
- Title: CloudPred: Predicting Patient Phenotypes From Single-cell RNA-seq
- Authors: Bryan He, Matthew Thomson, Meena Subramaniam, Richard Perez, Chun
Jimmie Ye, James Zou
- Abstract summary: Single-cell RNA sequencing (scRNA-seq) has the potential to provide powerful, high-resolution signatures to inform disease prognosis and precision medicine.
This paper develops an interpretable machine learning algorithm, CloudPred, to predict individuals' disease phenotypes from their scRNA-seq data.
- Score: 6.669618903574761
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Single-cell RNA sequencing (scRNA-seq) has the potential to provide powerful,
high-resolution signatures to inform disease prognosis and precision medicine.
This paper takes an important first step towards this goal by developing an
interpretable machine learning algorithm, CloudPred, to predict individuals'
disease phenotypes from their scRNA-seq data. Predicting phenotype from
scRNA-seq is challenging for standard machine learning methods -- the number of
cells measured can vary by orders of magnitude across individuals and the cell
populations are also highly heterogeneous. Typical analysis creates pseudo-bulk
samples which are biased toward prior annotations and also lose the single cell
resolution. CloudPred addresses these challenges via a novel end-to-end
differentiable learning algorithm which is coupled with a biologically informed
mixture of cell types model. CloudPred automatically infers the cell
subpopulation that are salient for the phenotype without prior annotations. We
developed a systematic simulation platform to evaluate the performance of
CloudPred and several alternative methods we propose, and find that CloudPred
outperforms the alternative methods across several settings. We further
validated CloudPred on a real scRNA-seq dataset of 142 lupus patients and
controls. CloudPred achieves AUROC of 0.98 while identifying a specific
subpopulation of CD4 T cells whose presence is highly indicative of lupus.
CloudPred is a powerful new framework to predict clinical phenotypes from
scRNA-seq data and to identify relevant cells.
Related papers
- Generating Multi-Modal and Multi-Attribute Single-Cell Counts with CFGen [76.02070962797794]
We present Cell Flow for Generation, a flow-based conditional generative model for multi-modal single-cell counts.
Our results suggest improved recovery of crucial biological data characteristics while accounting for novel generative tasks.
arXiv Detail & Related papers (2024-07-16T14:05:03Z) - Single-Cell Deep Clustering Method Assisted by Exogenous Gene
Information: A Novel Approach to Identifying Cell Types [50.55583697209676]
We develop an attention-enhanced graph autoencoder, which is designed to efficiently capture the topological features between cells.
During the clustering process, we integrated both sets of information and reconstructed the features of both cells and genes to generate a discriminative representation.
This research offers enhanced insights into the characteristics and distribution of cells, thereby laying the groundwork for early diagnosis and treatment of diseases.
arXiv Detail & Related papers (2023-11-28T09:14:55Z) - Interpretable pap smear cell representation for cervical cancer
screening [3.8656297418166305]
We introduce a method to learn explainable deep cervical cell representations for pap smear images based on one class classification using variational autoencoders.
Our model can discriminate abnormality without the need of additional training of deep models.
arXiv Detail & Related papers (2023-11-17T01:29:16Z) - scHyena: Foundation Model for Full-Length Single-Cell RNA-Seq Analysis
in Brain [46.39828178736219]
We introduce scHyena, a foundation model designed to address these challenges and enhance the accuracy of scRNA-seq analysis in the brain.
scHyena is equipped with a linear adaptor layer, the positional encoding via gene-embedding, and a bidirectional Hyena operator.
This enables us to process full-length scRNA-seq data without losing any information from the raw data.
arXiv Detail & Related papers (2023-10-04T10:30:08Z) - Continuous time recurrent neural networks: overview and application to
forecasting blood glucose in the intensive care unit [56.801856519460465]
Continuous time autoregressive recurrent neural networks (CTRNNs) are a deep learning model that account for irregular observations.
We demonstrate the application of these models to probabilistic forecasting of blood glucose in a critical care setting.
arXiv Detail & Related papers (2023-04-14T09:39:06Z) - N-ACT: An Interpretable Deep Learning Model for Automatic Cell Type and
Salient Gene Identification [0.0]
A major limitation in most scRNAseq analysis pipelines is the reliance on manual annotations to determine cell identities.
N-ACT is the first-of-its-kind interpretable deep neural network for ACTI utilizing neural-attention to detect salient genes for use in cell-type identification.
arXiv Detail & Related papers (2022-05-08T18:13:28Z) - A systematic evaluation of methods for cell phenotype classification
using single-cell RNA sequencing data [7.62849213621469]
This study evaluates 13 popular supervised machine learning algorithms to classify cell phenotypes.
The study outcomes showed that ElasticNet with interactions performed best in small and medium data sets.
arXiv Detail & Related papers (2021-10-01T23:24:15Z) - Approximate kNN Classification for Biomedical Data [1.1852406625172218]
Single-cell RNA-seq (scRNA-seq) is an emerging DNA sequencing technology with promising capabilities but significant computational challenges.
We propose the utilization of approximate nearest neighbor search algorithms for the task of kNN classification in scRNA-seq data.
arXiv Detail & Related papers (2020-12-03T18:30:43Z) - Select-ProtoNet: Learning to Select for Few-Shot Disease Subtype
Prediction [55.94378672172967]
We focus on few-shot disease subtype prediction problem, identifying subgroups of similar patients.
We introduce meta learning techniques to develop a new model, which can extract the common experience or knowledge from interrelated clinical tasks.
Our new model is built upon a carefully designed meta-learner, called Prototypical Network, that is a simple yet effective meta learning machine for few-shot image classification.
arXiv Detail & Related papers (2020-09-02T02:50:30Z) - Cell Type Identification from Single-Cell Transcriptomic Data via
Semi-supervised Learning [2.4271601178529063]
Cell type identification from single-cell transcriptomic data is a common goal of single-cell RNA sequencing (scRNAseq) data analysis.
We propose a semi-supervised learning model to use unlabeled scRNAseq cells and limited amount of labeled scRNAseq cells to implement cell identification.
It is observed that the proposed model is able to achieve encouraging performance by learning on very limited amount of labeled scRNAseq cells.
arXiv Detail & Related papers (2020-05-06T19:15:43Z) - A Systematic Approach to Featurization for Cancer Drug Sensitivity
Predictions with Deep Learning [49.86828302591469]
We train >35,000 neural network models, sweeping over common featurization techniques.
We found the RNA-seq to be highly redundant and informative even with subsets larger than 128 features.
arXiv Detail & Related papers (2020-04-30T20:42:17Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.