scICML: Information-theoretic Co-clustering-based Multi-view Learning
for the Integrative Analysis of Single-cell Multi-omics data
- URL: http://arxiv.org/abs/2205.09523v1
- Date: Thu, 19 May 2022 12:41:55 GMT
- Title: scICML: Information-theoretic Co-clustering-based Multi-view Learning
for the Integrative Analysis of Single-cell Multi-omics data
- Authors: Pengcheng Zeng, Zhixiang Lin
- Abstract summary: We develop a novel information-theoretic co-clustering-based multi-view learning (scICML) method for multi-omics single-cell data integration.
scICML utilizes co-clusterings to aggregate similar features for each view of data and uncover the common clustering pattern for cells.
Our experiments on four real-world datasets demonstrate that scICML improves the overall clustering performance and provides biological insights into the data analysis of peripheral blood mononuclear cells.
- Score: 0.0
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Modern high-throughput sequencing technologies have enabled us to profile
multiple molecular modalities from the same single cell, providing
unprecedented opportunities to assay celluar heterogeneity from multiple
biological layers. However, the datasets generated from these technologies tend
to have high level of noise and are highly sparse, bringing challenges to data
analysis. In this paper, we develop a novel information-theoretic
co-clustering-based multi-view learning (scICML) method for multi-omics
single-cell data integration. scICML utilizes co-clusterings to aggregate
similar features for each view of data and uncover the common clustering
pattern for cells. In addition, scICML automatically matches the clusters of
the linked features across different data types for considering the biological
dependency structure across different types of genomic features. Our
experiments on four real-world datasets demonstrate that scICML improves the
overall clustering performance and provides biological insights into the data
analysis of peripheral blood mononuclear cells.
Related papers
- Multi-modal Spatial Clustering for Spatial Transcriptomics Utilizing High-resolution Histology Images [1.3124513975412255]
spatial transcriptomics (ST) enables transcriptome-wide gene expression profiling while preserving spatial context.
Current spatial clustering methods fail to fully integrate high-resolution histology image features with gene expression data.
We propose a novel contrastive learning-based deep learning approach that integrates gene expression data with histology image features.
arXiv Detail & Related papers (2024-10-31T00:32:24Z) - HBIC: A Biclustering Algorithm for Heterogeneous Datasets [0.0]
Biclustering is an unsupervised machine-learning approach aiming to cluster rows and columns simultaneously in a data matrix.
We introduce a biclustering approach called HBIC, capable of discovering meaningful biclusters in complex heterogeneous data.
arXiv Detail & Related papers (2024-08-23T16:48:10Z) - MMIL: A novel algorithm for disease associated cell type discovery [58.044870442206914]
Single-cell datasets often lack individual cell labels, making it challenging to identify cells associated with disease.
We introduce Mixture Modeling for Multiple Learning Instance (MMIL), an expectation method that enables the training and calibration of cell-level classifiers.
arXiv Detail & Related papers (2024-06-12T15:22:56Z) - UniCell: Universal Cell Nucleus Classification via Prompt Learning [76.11864242047074]
We propose a universal cell nucleus classification framework (UniCell)
It employs a novel prompt learning mechanism to uniformly predict the corresponding categories of pathological images from different dataset domains.
In particular, our framework adopts an end-to-end architecture for nuclei detection and classification, and utilizes flexible prediction heads for adapting various datasets.
arXiv Detail & Related papers (2024-02-20T11:50:27Z) - Single-Cell Deep Clustering Method Assisted by Exogenous Gene
Information: A Novel Approach to Identifying Cell Types [50.55583697209676]
We develop an attention-enhanced graph autoencoder, which is designed to efficiently capture the topological features between cells.
During the clustering process, we integrated both sets of information and reconstructed the features of both cells and genes to generate a discriminative representation.
This research offers enhanced insights into the characteristics and distribution of cells, thereby laying the groundwork for early diagnosis and treatment of diseases.
arXiv Detail & Related papers (2023-11-28T09:14:55Z) - Single-cell Multi-view Clustering via Community Detection with Unknown
Number of Clusters [64.31109141089598]
We introduce scUNC, an innovative multi-view clustering approach tailored for single-cell data.
scUNC seamlessly integrates information from different views without the need for a predefined number of clusters.
We conducted a comprehensive evaluation of scUNC using three distinct single-cell datasets.
arXiv Detail & Related papers (2023-11-28T08:34:58Z) - Regression-Based Analysis of Multimodal Single-Cell Data Integration
Strategies [0.0]
Multimodal single-cell technologies enable the simultaneous collection of diverse data types from individual cells.
This study highlights the exceptional performance of Echo State Networks, boasting a remarkable correlation score of 0.94.
These findings hold promise for advancing comprehension of cellular differentiation and function, leveraging the potential of Machine Learning.
arXiv Detail & Related papers (2023-11-21T16:31:27Z) - Mixed Models with Multiple Instance Learning [51.440557223100164]
We introduce MixMIL, a framework integrating Generalized Linear Mixed Models (GLMM) and Multiple Instance Learning (MIL)
Our empirical results reveal that MixMIL outperforms existing MIL models in single-cell datasets.
arXiv Detail & Related papers (2023-11-04T16:42:42Z) - CLCLSA: Cross-omics Linked embedding with Contrastive Learning and Self
Attention for multi-omics integration with incomplete multi-omics data [47.2764293508916]
Integration of heterogeneous and high-dimensional multi-omics data is becoming increasingly important in understanding genetic data.
One obstacle faced when performing multi-omics data integration is the existence of unpaired multi-omics data due to instrument sensitivity and cost.
We propose a deep learning method for multi-omics integration with incomplete data by Cross-omics Linked unified embedding with Contrastive Learning and Self Attention.
arXiv Detail & Related papers (2023-04-12T00:22:18Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.