A biology-driven deep generative model for cell-type annotation in
cytometry
- URL: http://arxiv.org/abs/2208.05745v2
- Date: Fri, 21 Apr 2023 15:52:06 GMT
- Title: A biology-driven deep generative model for cell-type annotation in
cytometry
- Authors: Quentin Blampey, Nad\`ege Bercovici, Charles-Antoine Dutertre,
Isabelle Pic, Fabrice Andr\'e, Joana Mourato Ribeiro, and Paul-Henry
Courn\`ede
- Abstract summary: We introduce Scyan, a Single-cell Cytometry Network that automatically annotates cell types using only prior expert knowledge.
Scyan significantly outperforms the related state-of-the-art models on multiple public datasets while being faster and interpretable.
In addition, Scyan overcomes several complementary tasks such as batch-effect removal, debarcoding, and population discovery.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Cytometry enables precise single-cell phenotyping within heterogeneous
populations. These cell types are traditionally annotated via manual gating,
but this method suffers from a lack of reproducibility and sensitivity to
batch-effect. Also, the most recent cytometers - spectral flow or mass
cytometers - create rich and high-dimensional data whose analysis via manual
gating becomes challenging and time-consuming. To tackle these limitations, we
introduce Scyan (https://github.com/MICS-Lab/scyan), a Single-cell Cytometry
Annotation Network that automatically annotates cell types using only prior
expert knowledge about the cytometry panel. We demonstrate that Scyan
significantly outperforms the related state-of-the-art models on multiple
public datasets while being faster and interpretable. In addition, Scyan
overcomes several complementary tasks such as batch-effect removal,
debarcoding, and population discovery. Overall, this model accelerates and
eases cell population characterisation, quantification, and discovery in
cytometry.
Related papers
- Generating Multi-Modal and Multi-Attribute Single-Cell Counts with CFGen [76.02070962797794]
We present Cell Flow for Generation, a flow-based conditional generative model for multi-modal single-cell counts.
Our results suggest improved recovery of crucial biological data characteristics while accounting for novel generative tasks.
arXiv Detail & Related papers (2024-07-16T14:05:03Z) - Single-Cell Deep Clustering Method Assisted by Exogenous Gene
Information: A Novel Approach to Identifying Cell Types [50.55583697209676]
We develop an attention-enhanced graph autoencoder, which is designed to efficiently capture the topological features between cells.
During the clustering process, we integrated both sets of information and reconstructed the features of both cells and genes to generate a discriminative representation.
This research offers enhanced insights into the characteristics and distribution of cells, thereby laying the groundwork for early diagnosis and treatment of diseases.
arXiv Detail & Related papers (2023-11-28T09:14:55Z) - Mixed Models with Multiple Instance Learning [51.440557223100164]
We introduce MixMIL, a framework integrating Generalized Linear Mixed Models (GLMM) and Multiple Instance Learning (MIL)
Our empirical results reveal that MixMIL outperforms existing MIL models in single-cell datasets.
arXiv Detail & Related papers (2023-11-04T16:42:42Z) - Causal machine learning for single-cell genomics [94.28105176231739]
We discuss the application of machine learning techniques to single-cell genomics and their challenges.
We first present the model that underlies most of current causal approaches to single-cell biology.
We then identify open problems in the application of causal approaches to single-cell data.
arXiv Detail & Related papers (2023-10-23T13:35:24Z) - Tertiary Lymphoid Structures Generation through Graph-based Diffusion [54.37503714313661]
In this work, we leverage state-of-the-art graph-based diffusion models to generate biologically meaningful cell-graphs.
We show that the adopted graph diffusion model is able to accurately learn the distribution of cells in terms of their tertiary lymphoid structures (TLS) content.
arXiv Detail & Related papers (2023-10-10T14:37:17Z) - Machine Learning for Flow Cytometry Data Analysis [0.0]
Flow cytometers can rapidly analyse tens of thousands of cells at the same time while also measuring multiple parameters from a single cell.
Researchers need to be able to distinguish interesting-looking cell populations manually in multi-dimensional data collected from millions of cells.
Three representative automated clustering algorithms are selected to be applied, compared and evaluated by completely and partially automated gating.
arXiv Detail & Related papers (2023-03-16T00:43:46Z) - Learning Causal Representations of Single Cells via Sparse Mechanism
Shift Modeling [3.2435888122704037]
We propose a deep generative model of single-cell gene expression data for which each perturbation is treated as an intervention targeting an unknown, but sparse, subset of latent variables.
We benchmark these methods on simulated single-cell data to evaluate their performance at latent units recovery, causal target identification and out-of-domain generalization.
arXiv Detail & Related papers (2022-11-07T15:47:40Z) - Distribution-based Sketching of Single-Cell Samples [6.904244323294012]
We propose a novel sketching approach based on Kernel Herding that selects a limited subsample of all cells while preserving the underlying frequencies of immune cell-types.
We tested our approach on three flow and mass datasets and on one single-cell RNA sequencing dataset.
arXiv Detail & Related papers (2022-06-30T19:43:06Z) - Machine learning based lens-free imaging technique for field-portable
cytometry [0.0]
The performance of our proposed method shows an increase in accuracy >98% along with the signal enhancement of >5 dB for most of the cell types.
The model is adaptive to learn new type of samples within a few learning iterations and able to successfully classify the newly introduced sample.
arXiv Detail & Related papers (2022-03-02T07:09:29Z) - Interpretable Single-Cell Set Classification with Kernel Mean Embeddings [14.686560033030101]
Kernel Mean Embedding encodes the cellular landscape of each profiled biological sample.
We train a simple linear classifier and achieve state-of-the-art classification accuracy on 3 flow and mass datasets.
arXiv Detail & Related papers (2022-01-18T21:40:36Z) - Towards an Automatic Analysis of CHO-K1 Suspension Growth in
Microfluidic Single-cell Cultivation [63.94623495501023]
We propose a novel Machine Learning architecture, which allows us to infuse a neural deep network with human-powered abstraction on the level of data.
Specifically, we train a generative model simultaneously on natural and synthetic data, so that it learns a shared representation, from which a target variable, such as the cell count, can be reliably estimated.
arXiv Detail & Related papers (2020-10-20T08:36:51Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.