Highly Accurate Disease Diagnosis and Highly Reproducible Biomarker
Identification with PathFormer
- URL: http://arxiv.org/abs/2402.07268v1
- Date: Sun, 11 Feb 2024 18:23:54 GMT
- Title: Highly Accurate Disease Diagnosis and Highly Reproducible Biomarker
Identification with PathFormer
- Authors: Zehao Dong, Qihang Zhao, Philip R.O. Payne, Michael A Province, Carlos
Cruchaga, Muhan Zhang, Tianyu Zhao, Yixin Chen, Fuhai Li
- Abstract summary: Graph neural networks (GNNs) have been the dominant deep learning model for analyzing graph-structured data.
The root of the challenges is the unique graph structure of biological signaling pathways.
We present a novel GNN model architecture, named PathFormer, which integrates signaling network, priori knowledge and omics data to rank biomarkers and predict disease diagnosis.
- Score: 32.26944736442376
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Biomarker identification is critical for precise disease diagnosis and
understanding disease pathogenesis in omics data analysis, like using fold
change and regression analysis. Graph neural networks (GNNs) have been the
dominant deep learning model for analyzing graph-structured data. However, we
found two major limitations of existing GNNs in omics data analysis, i.e.,
limited-prediction (diagnosis) accuracy and limited-reproducible biomarker
identification capacity across multiple datasets. The root of the challenges is
the unique graph structure of biological signaling pathways, which consists of
a large number of targets and intensive and complex signaling interactions
among these targets. To resolve these two challenges, in this study, we
presented a novel GNN model architecture, named PathFormer, which
systematically integrate signaling network, priori knowledge and omics data to
rank biomarkers and predict disease diagnosis. In the comparison results,
PathFormer outperformed existing GNN models significantly in terms of highly
accurate prediction capability ( 30% accuracy improvement in disease diagnosis
compared with existing GNN models) and high reproducibility of biomarker
ranking across different datasets. The improvement was confirmed using two
independent Alzheimer's Disease (AD) and cancer transcriptomic datasets. The
PathFormer model can be directly applied to other omics data analysis studies.
Related papers
- Comparative Analysis of Multi-Omics Integration Using Advanced Graph Neural Networks for Cancer Classification [40.45049709820343]
Multi-omics data integration poses significant challenges due to the high dimensionality, data complexity, and distinct characteristics of various omics types.
This study evaluates three graph neural network architectures for multi-omics (MO) integration based on graph-convolutional networks (GCN), graph-attention networks (GAT), and graph-transformer networks (GTN)
arXiv Detail & Related papers (2024-10-05T16:17:44Z) - The Heterophilic Graph Learning Handbook: Benchmarks, Models, Theoretical Analysis, Applications and Challenges [101.83124435649358]
Homophily principle, ie nodes with the same labels or similar attributes are more likely to be connected.
Recent work has identified a non-trivial set of datasets where GNN's performance compared to the NN's is not satisfactory.
arXiv Detail & Related papers (2024-07-12T18:04:32Z) - Potential of Multimodal Large Language Models for Data Mining of Medical Images and Free-text Reports [51.45762396192655]
Multimodal large language models (MLLMs) have recently transformed many domains, significantly affecting the medical field. Notably, Gemini-Vision-series (Gemini) and GPT-4-series (GPT-4) models have epitomized a paradigm shift in Artificial General Intelligence for computer vision.
This study evaluated the performance of the Gemini, GPT-4, and 4 popular large models for an exhaustive evaluation across 14 medical imaging datasets.
arXiv Detail & Related papers (2024-07-08T09:08:42Z) - Discovering robust biomarkers of neurological disorders from functional MRI using graph neural networks: A Review [4.799269666410891]
We provide an overview of how GNN and model explainability techniques have been applied on fMRI datasets for disorder prediction tasks.
We find that while most studies have performant models, salient features highlighted in these studies vary greatly across studies on the same disorder.
We suggest establishing new standards that are based on objective evaluation metrics to determine the robustness of these potential biomarkers.
arXiv Detail & Related papers (2024-05-01T15:29:55Z) - An end-to-end framework for gene expression classification by
integrating a background knowledge graph: application to cancer prognosis
prediction [1.5484595752241122]
We proposed an end-to-end framework to handle secondary data to construct a classification model for primary data.
We applied this framework to cancer prognosis prediction using gene expression data and a biological network.
arXiv Detail & Related papers (2023-06-29T11:20:47Z) - Quantifying the Reproducibility of Graph Neural Networks using
Multigraph Brain Data [0.0]
Graph neural networks (GNNs) have witnessed an unprecedented proliferation in tackling several problems in computer vision, computer-aided diagnosis, and related fields.
While prior studies have focused on boosting the model accuracy, quantifying the most discriminative features identified by GNNs is still an intact problem that yields concerns about their reliability in clinical applications in particular.
We propose for the first time, a framework for GNN assessment via the most discriminative features (i.e., biomarkers) shared between different models. To ascertain the soundness of our framework, the assessment embraces variations of different factors such as training strategies and
arXiv Detail & Related papers (2021-09-06T05:31:02Z) - Many-to-One Distribution Learning and K-Nearest Neighbor Smoothing for
Thoracic Disease Identification [83.6017225363714]
deep learning has become the most powerful computer-aided diagnosis technology for improving disease identification performance.
For chest X-ray imaging, annotating large-scale data requires professional domain knowledge and is time-consuming.
In this paper, we propose many-to-one distribution learning (MODL) and K-nearest neighbor smoothing (KNNS) methods to improve a single model's disease identification performance.
arXiv Detail & Related papers (2021-02-26T02:29:30Z) - G-MIND: An End-to-End Multimodal Imaging-Genetics Framework for
Biomarker Identification and Disease Classification [49.53651166356737]
We propose a novel deep neural network architecture to integrate imaging and genetics data, as guided by diagnosis, that provides interpretable biomarkers.
We have evaluated our model on a population study of schizophrenia that includes two functional MRI (fMRI) paradigms and Single Nucleotide Polymorphism (SNP) data.
arXiv Detail & Related papers (2021-01-27T19:28:04Z) - Select-ProtoNet: Learning to Select for Few-Shot Disease Subtype
Prediction [55.94378672172967]
We focus on few-shot disease subtype prediction problem, identifying subgroups of similar patients.
We introduce meta learning techniques to develop a new model, which can extract the common experience or knowledge from interrelated clinical tasks.
Our new model is built upon a carefully designed meta-learner, called Prototypical Network, that is a simple yet effective meta learning machine for few-shot image classification.
arXiv Detail & Related papers (2020-09-02T02:50:30Z) - A Graph Gaussian Embedding Method for Predicting Alzheimer's Disease
Progression with MEG Brain Networks [59.15734147867412]
Characterizing the subtle changes of functional brain networks associated with Alzheimer's disease (AD) is important for early diagnosis and prediction of disease progression.
We developed a new deep learning method, termed multiple graph Gaussian embedding model (MG2G)
We used MG2G to detect the intrinsic latent dimensionality of MEG brain networks, predict the progression of patients with mild cognitive impairment (MCI) to AD, and identify brain regions with network alterations related to MCI.
arXiv Detail & Related papers (2020-05-08T02:29:24Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.