All You Need is Color: Image based Spatial Gene Expression Prediction
using Neural Stain Learning
- URL: http://arxiv.org/abs/2108.10446v2
- Date: Thu, 26 Aug 2021 10:45:21 GMT
- Title: All You Need is Color: Image based Spatial Gene Expression Prediction
using Neural Stain Learning
- Authors: Muhammad Dawood, Kim Branson, Nasir M. Rajpoot, Fayyaz ul Amir Afsar
Minhas
- Abstract summary: We propose a "stain-aware" machine learning approach for prediction of spatial transcriptomic gene expression profiles.
We have found that the gene expression predictions from the proposed approach show higher correlations with true expression values obtained through sequencing.
- Score: 11.9045433112067
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: "Is it possible to predict expression levels of different genes at a given
spatial location in the routine histology image of a tumor section by modeling
its stain absorption characteristics?" In this work, we propose a "stain-aware"
machine learning approach for prediction of spatial transcriptomic gene
expression profiles using digital pathology image of a routine Hematoxylin &
Eosin (H&E) histology section. Unlike recent deep learning methods which are
used for gene expression prediction, our proposed approach termed Neural Stain
Learning (NSL) explicitly models the association of stain absorption
characteristics of the tissue with gene expression patterns in spatial
transcriptomics by learning a problem-specific stain deconvolution matrix in an
end-to-end manner. The proposed method with only 11 trainable weight parameters
outperforms both classical regression models with cellular composition and
morphological features as well as deep learning methods. We have found that the
gene expression predictions from the proposed approach show higher correlations
with true expression values obtained through sequencing for a larger set of
genes in comparison to other approaches.
Related papers
- Long-range gene expression prediction with token alignment of large language model [37.10820914895689]
We introduce Genetic sequence Token Alignment (GTA), which aligns genetic sequence features with natural language tokens.
GTA learns the regulatory grammar and allows us to further incorporate gene-specific human annotations as prompts.
GTA represents a powerful and novel cross-modal approach to gene expression prediction by utilizing a pretrained language model.
arXiv Detail & Related papers (2024-10-02T02:42:29Z) - Distance-Preserving Generative Modeling of Spatial Transcriptomics [0.0]
We introduce a class of distance-preserving generative models for spatial transcriptomics.
We use the provided spatial information to regularize the learned representation space of gene expressions to have a similar pair-wise distance structure.
Our framework grants compatibility with any variational-inference-based generative models for gene expression modeling.
arXiv Detail & Related papers (2024-08-01T21:04:27Z) - What makes for good morphology representations for spatial omics? [1.4298574812790055]
We introduce a framework for categorizing spatial omics-morphology combination methods.
By translation we mean finding morphological features that spatially correlate with gene expression patterns.
By integration we mean finding morphological features that spatially complement gene expression patterns.
arXiv Detail & Related papers (2024-07-30T08:52:51Z) - Predicting Genetic Mutation from Whole Slide Images via Biomedical-Linguistic Knowledge Enhanced Multi-label Classification [119.13058298388101]
We develop a Biological-knowledge enhanced PathGenomic multi-label Transformer to improve genetic mutation prediction performances.
BPGT first establishes a novel gene encoder that constructs gene priors by two carefully designed modules.
BPGT then designs a label decoder that finally performs genetic mutation prediction by two tailored modules.
arXiv Detail & Related papers (2024-06-05T06:42:27Z) - VQDNA: Unleashing the Power of Vector Quantization for Multi-Species Genomic Sequence Modeling [60.91599380893732]
VQDNA is a general-purpose framework that renovates genome tokenization from the perspective of genome vocabulary learning.
By leveraging vector-quantized codebooks as learnable vocabulary, VQDNA can adaptively tokenize genomes into pattern-aware embeddings.
arXiv Detail & Related papers (2024-05-13T20:15:03Z) - Efficient and Scalable Fine-Tune of Language Models for Genome
Understanding [49.606093223945734]
We present textscLingo: textscLanguage prefix ftextscIne-tuning for textscGentextscOmes.
Unlike DNA foundation models, textscLingo strategically leverages natural language foundation models' contextual cues.
textscLingo further accommodates numerous downstream fine-tune tasks by an adaptive rank sampling method.
arXiv Detail & Related papers (2024-02-12T21:40:45Z) - Machine Learning Methods for Cancer Classification Using Gene Expression
Data: A Review [77.34726150561087]
Cancer is the second major cause of death after cardiovascular diseases.
Gene expression can play a fundamental role in the early detection of cancer.
This study reviews recent progress in gene expression analysis for cancer classification using machine learning methods.
arXiv Detail & Related papers (2023-01-28T15:03:03Z) - Unsupervised ensemble-based phenotyping helps enhance the
discoverability of genes related to heart morphology [57.25098075813054]
We propose a new framework for gene discovery entitled Un Phenotype Ensembles.
It builds a redundant yet highly expressive representation by pooling a set of phenotypes learned in an unsupervised manner.
These phenotypes are then analyzed via (GWAS), retaining only highly confident and stable associations.
arXiv Detail & Related papers (2023-01-07T18:36:44Z) - Attention-based Interpretable Regression of Gene Expression in Histology [0.0]
Interpretability of deep learning is widely used to evaluate the reliability of medical imaging models.
We show that interpretability can reveal connections between the microscopic appearance of cancer tissue and its gene expression profiling.
arXiv Detail & Related papers (2022-08-29T07:30:33Z) - Contrastive learning-based computational histopathology predict
differential expression of cancer driver genes [13.167222116204226]
HistCode is a self-supervised contrastive learning framework to infer differential gene expressions from whole slide images.
Our experiments showed that our method outperformed other state-of-the-art models in tumor diagnosis tasks.
arXiv Detail & Related papers (2022-04-25T23:21:33Z) - rfPhen2Gen: A machine learning based association study of brain imaging
phenotypes to genotypes [71.1144397510333]
We learned machine learning models to predict SNPs using 56 brain imaging QTs.
SNPs within the known Alzheimer disease (AD) risk gene APOE had lowest RMSE for lasso and random forest.
Random forests identified additional SNPs that were not prioritized by the linear models but are known to be associated with brain-related disorders.
arXiv Detail & Related papers (2022-03-31T20:15:22Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.