Uncertainty Quantification for Atlas-Level Cell Type Transfer
- URL: http://arxiv.org/abs/2211.03793v1
- Date: Mon, 7 Nov 2022 16:08:28 GMT
- Title: Uncertainty Quantification for Atlas-Level Cell Type Transfer
- Authors: Jan Engelmann, Leon Hetzel, Giovanni Palla, Lisa Sikkema, Malte
Luecken, Fabian Theis
- Abstract summary: We introduce uncertainty quantification methods for cell type classification on single-cell reference atlases.
We benchmark four model classes and show that currently used models lack calibration, robustness, and actionable uncertainty scores.
We demonstrate how models that quantify uncertainty are better suited to detect unseen cell types in the setting of atlas-level cell type transfer.
- Score: 0.4893345190925178
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Single-cell reference atlases are large-scale, cell-level maps that capture
cellular heterogeneity within an organ using single cell genomics. Given their
size and cellular diversity, these atlases serve as high-quality training data
for the transfer of cell type labels to new datasets. Such label transfer,
however, must be robust to domain shifts in gene expression due to measurement
technique, lab specifics and more general batch effects. This requires methods
that provide uncertainty estimates on the cell type predictions to ensure
correct interpretation. Here, for the first time, we introduce uncertainty
quantification methods for cell type classification on single-cell reference
atlases. We benchmark four model classes and show that currently used models
lack calibration, robustness, and actionable uncertainty scores. Furthermore,
we demonstrate how models that quantify uncertainty are better suited to detect
unseen cell types in the setting of atlas-level cell type transfer.
Related papers
- Lower-dimensional projections of cellular expression improves cell type classification from single-cell RNA sequencing [12.66369956714212]
Single-cell RNA sequencing (scRNA-seq) enables the study of cellular diversity at single cell level.
Various statistical, machine and deep learning-based methods have been proposed for cell-type classification.
In this work, we proposed a reference-based method for cell type classification, called EnProCell.
arXiv Detail & Related papers (2024-10-13T19:01:38Z) - MMIL: A novel algorithm for disease associated cell type discovery [58.044870442206914]
Single-cell datasets often lack individual cell labels, making it challenging to identify cells associated with disease.
We introduce Mixture Modeling for Multiple Learning Instance (MMIL), an expectation method that enables the training and calibration of cell-level classifiers.
arXiv Detail & Related papers (2024-06-12T15:22:56Z) - UniCell: Universal Cell Nucleus Classification via Prompt Learning [76.11864242047074]
We propose a universal cell nucleus classification framework (UniCell)
It employs a novel prompt learning mechanism to uniformly predict the corresponding categories of pathological images from different dataset domains.
In particular, our framework adopts an end-to-end architecture for nuclei detection and classification, and utilizes flexible prediction heads for adapting various datasets.
arXiv Detail & Related papers (2024-02-20T11:50:27Z) - scBiGNN: Bilevel Graph Representation Learning for Cell Type
Classification from Single-cell RNA Sequencing Data [62.87454293046843]
Graph neural networks (GNNs) have been widely used for automatic cell type classification.
scBiGNN comprises two GNN modules to identify cell types.
scBiGNN outperforms a variety of existing methods for cell type classification from scRNA-seq data.
arXiv Detail & Related papers (2023-12-16T03:54:26Z) - Single-Cell Deep Clustering Method Assisted by Exogenous Gene
Information: A Novel Approach to Identifying Cell Types [50.55583697209676]
We develop an attention-enhanced graph autoencoder, which is designed to efficiently capture the topological features between cells.
During the clustering process, we integrated both sets of information and reconstructed the features of both cells and genes to generate a discriminative representation.
This research offers enhanced insights into the characteristics and distribution of cells, thereby laying the groundwork for early diagnosis and treatment of diseases.
arXiv Detail & Related papers (2023-11-28T09:14:55Z) - Kernel-Based Testing for Single-Cell Differential Analysis [23.769396655341204]
We propose a kernel-testing framework for non-linear cell-wise distribution comparison.
Our method allows feature-wise and global transcriptome/epigenome comparisons, revealing cell population heterogeneities.
arXiv Detail & Related papers (2023-07-17T14:10:01Z) - A biology-driven deep generative model for cell-type annotation in
cytometry [0.0]
We introduce Scyan, a Single-cell Cytometry Network that automatically annotates cell types using only prior expert knowledge.
Scyan significantly outperforms the related state-of-the-art models on multiple public datasets while being faster and interpretable.
In addition, Scyan overcomes several complementary tasks such as batch-effect removal, debarcoding, and population discovery.
arXiv Detail & Related papers (2022-08-11T10:50:44Z) - Seamless Iterative Semi-Supervised Correction of Imperfect Labels in
Microscopy Images [57.42492501915773]
In-vitro tests are an alternative to animal testing for the toxicity of medical devices.
Human fatigue plays a role in error making, making the use of deep learning appealing.
We propose Seamless Iterative Semi-Supervised correction of Imperfect labels (SISSI)
Our method successfully provides an adaptive early learning correction technique for object detection.
arXiv Detail & Related papers (2022-08-05T18:52:20Z) - Interpretable Single-Cell Set Classification with Kernel Mean Embeddings [14.686560033030101]
Kernel Mean Embedding encodes the cellular landscape of each profiled biological sample.
We train a simple linear classifier and achieve state-of-the-art classification accuracy on 3 flow and mass datasets.
arXiv Detail & Related papers (2022-01-18T21:40:36Z) - Multi-modal Self-supervised Pre-training for Regulatory Genome Across
Cell Types [75.65676405302105]
We propose a simple yet effective approach for pre-training genome data in a multi-modal and self-supervised manner, which we call GeneBERT.
We pre-train our model on the ATAC-seq dataset with 17 million genome sequences.
arXiv Detail & Related papers (2021-10-11T12:48:44Z) - Distinguishing Cell Phenotype Using Cell Epigenotype [0.0]
Relationship between microscopic observations and macroscopic behavior is a fundamental open question in biophysical systems.
We develop a unified approach that---in contrast with existing methods---predicts cell type from macromolecular data even when accounting for the scale of human tissue diversity and limitations in the available data.
arXiv Detail & Related papers (2020-03-20T18:00:07Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.