hist2RNA: An efficient deep learning architecture to predict gene
expression from breast cancer histopathology images
- URL: http://arxiv.org/abs/2304.04507v4
- Date: Sun, 7 May 2023 06:26:25 GMT
- Title: hist2RNA: An efficient deep learning architecture to predict gene
expression from breast cancer histopathology images
- Authors: Raktim Kumar Mondol, Ewan K.A. Millar, Peter H Graham, Lois Browne,
Arcot Sowmya, Erik Meijering
- Abstract summary: Deep learning algorithms can effectively extract morphological patterns in digital histopathology images to predict molecular phenotypes quickly and cost-effectively.
We propose a new, computationally efficient approach called hist2RNA inspired by bulk RNA-sequencing techniques to predict the expression of 138 genes.
- Score: 11.822321981275232
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Gene expression can be used to subtype breast cancer with improved prediction
of risk of recurrence and treatment responsiveness over that obtained using
routine immunohistochemistry (IHC). However, in the clinic, molecular profiling
is primarily used for ER+ breast cancer, which is costly, tissue destructive,
requires specialized platforms and takes several weeks to obtain a result. Deep
learning algorithms can effectively extract morphological patterns in digital
histopathology images to predict molecular phenotypes quickly and
cost-effectively. We propose a new, computationally efficient approach called
hist2RNA inspired by bulk RNA-sequencing techniques to predict the expression
of 138 genes (incorporated from six commercially available molecular profiling
tests), including luminal PAM50 subtype, from hematoxylin and eosin (H&E)
stained whole slide images (WSIs). The training phase involves the aggregation
of extracted features for each patient from a pretrained model to predict gene
expression at the patient level using annotated H&E images from The Cancer
Genome Atlas (TCGA, n=335). We demonstrate successful gene prediction on a
held-out test set (n = 160, corr = 0.82 across patients, corr = 0.29 across
genes) and perform exploratory analysis on an external tissue microarray (TMA)
dataset (n = 498) with known IHC and survival information. Our model is able to
predict gene expression and luminal PAM50 subtype (Luminal A versus Luminal B)
on the TMA dataset with prognostic significance for overall survival in
univariate analysis (c-index = 0.56, hazard ratio = 2.16 (95% CI 1.12-3.06), p
< 5 x 10-3), and independent significance in multivariate analysis
incorporating standard clinicopathological variables (c-index = 0.65, hazard
ratio = 1.85 (95% CI 1.30-2.68), p < 5 x 10-3).
Related papers
- TopoTxR: A topology-guided deep convolutional network for breast parenchyma learning on DCE-MRIs [49.69047720285225]
We propose a novel topological approach that explicitly extracts multi-scale topological structures to better approximate breast parenchymal structures.
We empirically validate emphTopoTxR using the VICTRE phantom breast dataset.
Our qualitative and quantitative analyses suggest differential topological behavior of breast tissue in treatment-na"ive imaging.
arXiv Detail & Related papers (2024-11-05T19:35:10Z) - Survival Prediction Across Diverse Cancer Types Using Neural Networks [40.392772795903795]
Gastric cancer and Colon adenocarcinoma represent widespread and challenging malignancies.
Medical community has embraced the 5-year survival rate as a vital metric for estimating patient outcomes.
This study introduces a pioneering approach to enhance survival prediction models for gastric and Colon adenocarcinoma patients.
arXiv Detail & Related papers (2024-04-11T21:47:13Z) - CIMIL-CRC: a clinically-informed multiple instance learning framework for patient-level colorectal cancer molecular subtypes classification from H\&E stained images [42.771819949806655]
We introduce CIMIL-CRC', a framework that solves the MSI/MSS MIL problem by efficiently combining a pre-trained feature extraction model with principal component analysis (PCA) to aggregate information from all patches.
We assessed our CIMIL-CRC method using the average area under the curve (AUC) from a 5-fold cross-validation experimental setup for model development on the TCGA-CRC-DX cohort.
arXiv Detail & Related papers (2024-01-29T12:56:11Z) - Prediction of Breast Cancer Recurrence Risk Using a Multi-Model Approach
Integrating Whole Slide Imaging and Clinicopathologic Features [0.6679306163028237]
The aim of this study was to develop a multi-model approach integrating the analysis of whole slide images and clinicopathologic data to predict associated breast cancer recurrence risks.
The proposed novel methodology uses convolutional neural networks for feature extraction and vision transformers for contextual aggregation.
arXiv Detail & Related papers (2024-01-28T23:33:56Z) - Neural Network-Based Histologic Remission Prediction In Ulcerative
Colitis [38.150634108667774]
Histologic remission is a new therapeutic target in ulcerative colitis (UC)
Endocytoscopy (EC) is a novel ultra-high magnification endoscopic technique.
We propose a neural network model that can assess histological disease activity in EC images.
arXiv Detail & Related papers (2023-08-28T15:54:14Z) - Breast Cancer Histopathology Image based Gene Expression Prediction
using Spatial Transcriptomics data and Deep Learning [3.583756449759971]
We present BrST-Net, a deep learning framework for predicting gene expression from histopathology images.
We trained and evaluated 10 state-of-the-art deep learning models without utilizing pretrained weights for the prediction of 250 genes.
Our methodology outperforms previous studies, with 237 genes identified with positive correlation, including 24 genes with a median correlation coefficient greater than 0.50.
arXiv Detail & Related papers (2023-03-17T14:03:40Z) - Machine Learning Methods for Cancer Classification Using Gene Expression
Data: A Review [77.34726150561087]
Cancer is the second major cause of death after cardiovascular diseases.
Gene expression can play a fundamental role in the early detection of cancer.
This study reviews recent progress in gene expression analysis for cancer classification using machine learning methods.
arXiv Detail & Related papers (2023-01-28T15:03:03Z) - Predicting Molecular Phenotypes with Single Cell RNA Sequencing Data: an
Assessment of Unsupervised Machine Learning Models [0.0]
This study is to evaluate unsupervised machine learning on classifying treatment-resistant phenotypes in heterogeneous tumors.
scRNAseq quantifies mRNA in cells and characterizes cell phenotypes.
clusters generated from this pipeline can be used to understand cancer cell behavior and malignant growth.
arXiv Detail & Related papers (2021-08-11T05:30:37Z) - Transcriptome-wide prediction of prostate cancer gene expression from
histopathology images using co-expression based convolutional neural networks [0.8874479658912061]
We propose a new, computationally efficient approach for disease specific modelling of relationships between morphology and gene expression.
We conducted the first transcriptome-wide analysis in prostate cancer, using CNNs to predict bulk RNA-sequencing estimates.
arXiv Detail & Related papers (2021-04-19T13:50:25Z) - Comparison of Machine Learning Classifiers to Predict Patient Survival
and Genetics of GBM: Towards a Standardized Model for Clinical Implementation [44.02622933605018]
Radiomic models have been shown to outperform clinical data for outcome prediction in glioblastoma (GBM)
We aimed to compare nine machine learning classifiers to predict overall survival (OS), isocitrate dehydrogenase (IDH) mutation, O-6-methylguanine-DNA-methyltransferase (MGMT) promoter methylation, epidermal growth factor receptor (EGFR) VII amplification and Ki-67 expression in GBM patients.
xGB obtained maximum accuracy for OS (74.5%), AB for IDH mutation (88%), MGMT methylation (71,7%), Ki-67 expression (86,6%), and EGFR amplification (81,
arXiv Detail & Related papers (2021-02-10T15:10:37Z) - Predicting molecular phenotypes from histopathology images: a
transcriptome-wide expression-morphology analysis in breast cancer [1.3758771225117674]
We report the first transcriptome-wide Expression-MOrphology (EMO) analysis in breast cancer.
Gene-specific models were optimised and validated for prediction of mRNA expression.
Predictions for 9,334 genes were significantly associated with RNA-sequencing estimates.
arXiv Detail & Related papers (2020-09-18T16:27:53Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.