Revolutionizing Single Cell Analysis: The Power of Large Language Models
for Cell Type Annotation
- URL: http://arxiv.org/abs/2304.02697v1
- Date: Wed, 5 Apr 2023 18:45:54 GMT
- Title: Revolutionizing Single Cell Analysis: The Power of Large Language Models
for Cell Type Annotation
- Authors: Zehua Zeng and Hongwu Du
- Abstract summary: Large language models such as ChatGPT and New Bing provide accurate annotations of cell types.
By using ChatGPT to annotate single cell data, we can relate rare cell type to their function.
This can have important applications in understanding cancer progression, mammalian development, and stem cell differentiation.
- Score: 0.0
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: In recent years, single cell RNA sequencing has become a widely used
technique to study cellular diversity and function. However, accurately
annotating cell types from single cell data has been a challenging task, as it
requires extensive knowledge of cell biology and gene function. The emergence
of large language models such as ChatGPT and New Bing in 2023 has
revolutionized this process by integrating the scientific literature and
providing accurate annotations of cell types. This breakthrough enables
researchers to conduct literature reviews more efficiently and accurately, and
can potentially uncover new insights into cell type annotation. By using
ChatGPT to annotate single cell data, we can relate rare cell type to their
function and reveal specific differentiation trajectories of cell subtypes that
were previously overlooked. This can have important applications in
understanding cancer progression, mammalian development, and stem cell
differentiation, and can potentially lead to the discovery of key cells that
interrupt the differentiation pathway and solve key problems in the life
sciences. Overall, the future of cell type annotation in single cell data looks
promising and the Large Language model will be an important milestone in the
history of single cell analysis.
Related papers
- How to Build the Virtual Cell with Artificial Intelligence: Priorities and Opportunities [46.671834972945874]
We propose a vision of leveraging advances in AI to construct virtual cells.
We discuss desired capabilities of such AI Virtual Cells, including generating universal representations of biological entities.
We envision a future where AI Virtual Cells help identify new drug targets, predict cellular responses to perturbations, as well as scale hypothesis exploration.
arXiv Detail & Related papers (2024-09-18T02:41:50Z) - Cell-ontology guided transcriptome foundation model [18.51941953027685]
We present textbfsingle textbfcell, textbfCell-textbfontology guided TFM scCello.
Our TFM demonstrates competitive and transferability performance over the existing TFMs on biologically important tasks.
arXiv Detail & Related papers (2024-08-22T13:15:49Z) - LangCell: Language-Cell Pre-training for Cell Identity Understanding [3.6518971609937068]
We introduce LangCell, a unified representation of single-cell data and natural language during the pre-training phase.
Results show that LangCell is the only single-cell PLM that can work effectively in zero-shot cell identity understanding scenarios.
arXiv Detail & Related papers (2024-05-09T10:04:05Z) - UniCell: Universal Cell Nucleus Classification via Prompt Learning [76.11864242047074]
We propose a universal cell nucleus classification framework (UniCell)
It employs a novel prompt learning mechanism to uniformly predict the corresponding categories of pathological images from different dataset domains.
In particular, our framework adopts an end-to-end architecture for nuclei detection and classification, and utilizes flexible prediction heads for adapting various datasets.
arXiv Detail & Related papers (2024-02-20T11:50:27Z) - scInterpreter: Training Large Language Models to Interpret scRNA-seq
Data for Cell Type Annotation [15.718901418627366]
This research focuses on how to train and adapt the Large Language Model with the capability to interpret and distinguish cell types in single-cell RNA sequencing data.
arXiv Detail & Related papers (2024-02-18T05:39:00Z) - ChatCell: Facilitating Single-Cell Analysis with Natural Language [40.4429032376233]
ChatCell is a tool for facilitating single-cell analysis with natural language.
ChatCell has acquired profound expertise in single-cell biology.
Our project homepage is available at https://zjunlp.io/project/ChatCell.
arXiv Detail & Related papers (2024-02-13T09:06:14Z) - Single-Cell Deep Clustering Method Assisted by Exogenous Gene
Information: A Novel Approach to Identifying Cell Types [50.55583697209676]
We develop an attention-enhanced graph autoencoder, which is designed to efficiently capture the topological features between cells.
During the clustering process, we integrated both sets of information and reconstructed the features of both cells and genes to generate a discriminative representation.
This research offers enhanced insights into the characteristics and distribution of cells, thereby laying the groundwork for early diagnosis and treatment of diseases.
arXiv Detail & Related papers (2023-11-28T09:14:55Z) - Mixed Models with Multiple Instance Learning [51.440557223100164]
We introduce MixMIL, a framework integrating Generalized Linear Mixed Models (GLMM) and Multiple Instance Learning (MIL)
Our empirical results reveal that MixMIL outperforms existing MIL models in single-cell datasets.
arXiv Detail & Related papers (2023-11-04T16:42:42Z) - Causal machine learning for single-cell genomics [94.28105176231739]
We discuss the application of machine learning techniques to single-cell genomics and their challenges.
We first present the model that underlies most of current causal approaches to single-cell biology.
We then identify open problems in the application of causal approaches to single-cell data.
arXiv Detail & Related papers (2023-10-23T13:35:24Z) - A biology-driven deep generative model for cell-type annotation in
cytometry [0.0]
We introduce Scyan, a Single-cell Cytometry Network that automatically annotates cell types using only prior expert knowledge.
Scyan significantly outperforms the related state-of-the-art models on multiple public datasets while being faster and interpretable.
In addition, Scyan overcomes several complementary tasks such as batch-effect removal, debarcoding, and population discovery.
arXiv Detail & Related papers (2022-08-11T10:50:44Z) - Towards an Automatic Analysis of CHO-K1 Suspension Growth in
Microfluidic Single-cell Cultivation [63.94623495501023]
We propose a novel Machine Learning architecture, which allows us to infuse a neural deep network with human-powered abstraction on the level of data.
Specifically, we train a generative model simultaneously on natural and synthetic data, so that it learns a shared representation, from which a target variable, such as the cell count, can be reliably estimated.
arXiv Detail & Related papers (2020-10-20T08:36:51Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.