An Approach for Clustering Subjects According to Similarities in Cell
Distributions within Biopsies
- URL: http://arxiv.org/abs/2007.00135v2
- Date: Mon, 6 Jul 2020 13:34:15 GMT
- Title: An Approach for Clustering Subjects According to Similarities in Cell
Distributions within Biopsies
- Authors: Yassine El Ouahidi, Matis Feller, Matthieu Talagas, Bastien Pasdeloup
- Abstract summary: We introduce a novel and interpretable methodology to cluster subjects suffering from cancer, based on features extracted from their biopsies.
We illustrate our approach on a database of hematoxylin and eosin (H&E)-stained tissues of subjects suffering from Stage I lung adenocarcinoma.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In this paper, we introduce a novel and interpretable methodology to cluster
subjects suffering from cancer, based on features extracted from their
biopsies. Contrary to existing approaches, we propose here to capture complex
patterns in the repartitions of their cells using histograms, and compare
subjects on the basis of these repartitions. We describe here our complete
workflow, including creation of the database, cells segmentation and
phenotyping, computation of complex features, choice of a distance function
between features, clustering between subjects using that distance, and survival
analysis of obtained clusters. We illustrate our approach on a database of
hematoxylin and eosin (H&E)-stained tissues of subjects suffering from Stage I
lung adenocarcinoma, where our results match existing knowledge in prognosis
estimation with high confidence.
Related papers
- Revisiting Adaptive Cellular Recognition Under Domain Shifts: A Contextual Correspondence View [49.03501451546763]
We identify the importance of implicit correspondences across biological contexts for exploiting domain-invariant pathological composition.
We propose self-adaptive dynamic distillation to secure instance-aware trade-offs across different model constituents.
arXiv Detail & Related papers (2024-07-14T04:41:16Z) - A deep learning pipeline for cross-sectional and longitudinal multiview
data integration [7.424942475653412]
We have developed a pipeline to integrate cross-sectional and longitudinal data from multiple sources.
It includes variable selection/ranking using linear and nonlinear methods, feature extraction using functional principal component analysis and Euler characteristics, and joint integration and classification using dense feed-forward networks and recurrent neural networks.
We applied this pipeline to cross-sectional and longitudinal multi-omics data (metagenomics, transcriptomics, and metabolomics) from an inflammatory bowel disease (IBD) study and we identified microbial pathways, metabolites, and genes that discriminate by IBD status.
arXiv Detail & Related papers (2023-12-02T22:24:35Z) - Single-Cell Deep Clustering Method Assisted by Exogenous Gene
Information: A Novel Approach to Identifying Cell Types [50.55583697209676]
We develop an attention-enhanced graph autoencoder, which is designed to efficiently capture the topological features between cells.
During the clustering process, we integrated both sets of information and reconstructed the features of both cells and genes to generate a discriminative representation.
This research offers enhanced insights into the characteristics and distribution of cells, thereby laying the groundwork for early diagnosis and treatment of diseases.
arXiv Detail & Related papers (2023-11-28T09:14:55Z) - Multi-task Explainable Skin Lesion Classification [54.76511683427566]
We propose a few-shot-based approach for skin lesions that generalizes well with few labelled data.
The proposed approach comprises a fusion of a segmentation network that acts as an attention module and classification network.
arXiv Detail & Related papers (2023-10-11T05:49:47Z) - Cell Spatial Analysis in Crohn's Disease: Unveiling Local Cell
Arrangement Pattern with Graph-based Signatures [19.24727395217543]
Crohn's disease (CD) is a chronic and relapsing inflammatory condition that affects segments of the gastrointestinal tract.
understanding the broader morphometry and local cell arrangement beyond cell counting and tissue morphology remains challenging.
We characterize six distinct cell types from H&E images and develop a novel approach for the local spatial signature of each cell.
arXiv Detail & Related papers (2023-08-20T05:26:25Z) - Bayesian outcome-guided multi-view mixture models with applications in
molecular precision medicine [0.0]
Clustering is commonly performed as an initial analysis step for uncovering structure in 'omics datasets.
We propose a multi-view Bayesian mixture model that identifies groups of variables (views"), each of which defines a distinct clustering structure.
We consider applications in stratified medicine, for which our principal goal is to identify clusters of patients that define distinct, clinically actionable disease subtypes.
arXiv Detail & Related papers (2023-03-01T08:32:23Z) - Discriminative Attribution from Counterfactuals [64.94009515033984]
We present a method for neural network interpretability by combining feature attribution with counterfactual explanations.
We show that this method can be used to quantitatively evaluate the performance of feature attribution methods in an objective manner.
arXiv Detail & Related papers (2021-09-28T00:53:34Z) - AI in Osteoporosis [0.0]
This chapter explores and evaluate methods for trabecular bone characterization and osteoporosis diagnosis with increased interest in sparse approximations.
We first describe texture representation and classification techniques, patch-based methods such as Bag of Keypoints, and more recent deep neural networks.
We report cross-validation results on osteoporosis datasets of bone radiographs and compare the results produced by the different categories of methods.
arXiv Detail & Related papers (2021-09-22T01:37:30Z) - Trajectories, bifurcations and pseudotime in large clinical datasets:
applications to myocardial infarction and diabetes data [94.37521840642141]
We suggest a semi-supervised methodology for the analysis of large clinical datasets, characterized by mixed data types and missing values.
The methodology is based on application of elastic principal graphs which can address simultaneously the tasks of dimensionality reduction, data visualization, clustering, feature selection and quantifying the geodesic distances (pseudotime) in partially ordered sequences of observations.
arXiv Detail & Related papers (2020-07-07T21:04:55Z) - Predictive Modeling of ICU Healthcare-Associated Infections from
Imbalanced Data. Using Ensembles and a Clustering-Based Undersampling
Approach [55.41644538483948]
This work is focused on both the identification of risk factors and the prediction of healthcare-associated infections in intensive-care units.
The aim is to support decision making addressed at reducing the incidence rate of infections.
arXiv Detail & Related papers (2020-05-07T16:13:12Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.