Binary disease prediction using tail quantiles of the distribution of
continuous biomarkers
- URL: http://arxiv.org/abs/2103.12409v1
- Date: Tue, 23 Mar 2021 09:20:10 GMT
- Title: Binary disease prediction using tail quantiles of the distribution of
continuous biomarkers
- Authors: Michiel H.J. Paus, Edwin R. van den Heuvel, Marc J.M. Meddens
- Abstract summary: In the analysis of binary disease classification, single biomarkers might not have significant discriminating power.
We propose quantile based prediction (QBP), a binary classification method that is based on the selection of multiple continuous biomarkers.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: In the analysis of binary disease classification, single biomarkers might not
have significant discriminating power and multiple biomarkers from a large set
of biomarkers should be selected. Numerous approaches exist, but they merely
work well for mean differences in biomarkers between cases and controls.
Biological processes are however much more heterogeneous, and differences could
also occur in other distributional characteristics (e.g. variances, skewness).
Many machine learning techniques are better capable of utilizing these higher
order distributional differences, sometimes at cost of explainability.
In this study we propose quantile based prediction (QBP), a binary
classification method that is based on the selection of multiple continuous
biomarkers. QBP generates a single score using the tails of the biomarker
distributions for cases and controls. This single score can then be evaluated
by ROC analysis to investigate its predictive power.
The performance of QBP is compared to supervised learning methods using
extensive simulation studies, and two case studies: major depression disorder
and trisomy. Simultaneously, the classification performance of the existing
techniques in relation to each other is assessed. The key strengths of QBP are
the opportunity to select relevant biomarkers and the outstanding
classification performance in the case biomarkers predominantly show variance
differences between cases and controls. When only shifts in means were present
in the biomarkers, QBP obtained an inferior performance. Lastly, QBP proved to
be unbiased in case of absence of disease relevant biomarkers and outperformed
the other methods on the MDD case study.
More research is needed to further optimize QBP, since it has several
opportunities to improve its performance. Here we wanted to introduce the
principle of QBP and show its potential.
Related papers
- Biomarker based Cancer Classification using an Ensemble with Pre-trained Models [2.2436844508175224]
We propose a novel ensemble model combining pre-trained Hyperfast model, XGBoost, and LightGBM for multi-class classification tasks.
We leverage a meta-trained Hyperfast model for classifying cancer, accomplishing the highest AUC of 0.9929.
We also propose a novel ensemble model combining pre-trained Hyperfast model, XGBoost, and LightGBM for multi-class classification tasks, achieving an incremental increase in accuracy (0.9464)
arXiv Detail & Related papers (2024-06-14T14:43:59Z) - Spatio-Temporal Similarity Measure based Multi-Task Learning for
Predicting Alzheimer's Disease Progression using MRI Data [18.669489433316127]
We propose a novel-temporal- similarity measure based multi-task learning approach for effectively predicting Alzheimer's disease progression.
Our method also enables performing longitudinal stability selection to identify the changing relationships between biomarkers.
We prove that the synergistic deteriorating biomarkers between cortical volumes or surface areas have a significant effect on the cognitive prediction.
arXiv Detail & Related papers (2023-11-06T21:59:19Z) - Deep Learning Predicts Biomarker Status and Discovers Related
Histomorphology Characteristics for Low-Grade Glioma [21.281553456323998]
Biomarker detection is an indispensable part in the diagnosis and treatment of low-grade glioma (LGG)
We propose an interpretable deep learning pipeline to predict the status of five biomarkers in LGG using only hematoxylin and eosin-stained whole slide images and slide-level biomarker status labels.
Our pipeline not only provides a novel approach for biomarker prediction, enhancing the applicability of molecular treatments for LGG patients but also facilitates the discovery of new mechanisms in molecular functionality and LGG progression.
arXiv Detail & Related papers (2023-10-11T13:05:33Z) - Regression-based Deep-Learning predicts molecular biomarkers from
pathology slides [40.24757332810004]
We developed and evaluated a new self-supervised attention-based weakly supervised regression method that predicts continuous biomarkers directly from images.
Using regression significantly enhances the accuracy of biomarker prediction, while also improving the interpretability of the results over classification.
Our open-source regression approach offers a promising alternative for continuous biomarker analysis in computational pathology.
arXiv Detail & Related papers (2023-04-11T11:43:51Z) - Evaluation of the potential of Near Infrared Hyperspectral Imaging for
monitoring the invasive brown marmorated stink bug [53.682955739083056]
The brown marmorated stink bug (BMSB), Halyomorpha halys, is an invasive insect pest of global importance that damages several crops.
The present study consists in a preliminary evaluation at the laboratory level of Near Infrared Hyperspectral Imaging (NIR-HSI) as a possible technology to detect BMSB specimens.
arXiv Detail & Related papers (2023-01-19T11:37:20Z) - rfPhen2Gen: A machine learning based association study of brain imaging
phenotypes to genotypes [71.1144397510333]
We learned machine learning models to predict SNPs using 56 brain imaging QTs.
SNPs within the known Alzheimer disease (AD) risk gene APOE had lowest RMSE for lasso and random forest.
Random forests identified additional SNPs that were not prioritized by the linear models but are known to be associated with brain-related disorders.
arXiv Detail & Related papers (2022-03-31T20:15:22Z) - Multi-class versus One-class classifier in spontaneous speech analysis
oriented to Alzheimer Disease diagnosis [58.720142291102135]
The aim of our project is to contribute to earlier diagnosis of AD and better estimates of its severity by using automatic analysis performed through new biomarkers extracted from speech signal.
The use of information about outlier and Fractal Dimension features improves the system performance.
arXiv Detail & Related papers (2022-03-21T09:57:20Z) - Texture Characterization of Histopathologic Images Using Ecological
Diversity Measures and Discrete Wavelet Transform [82.53597363161228]
This paper proposes a method for characterizing texture across histopathologic images with a considerable success rate.
It is possible to quantify the intrinsic properties of such images with promising accuracy on two HI datasets.
arXiv Detail & Related papers (2022-02-27T02:19:09Z) - The Future will be Different than Today: Model Evaluation Considerations
when Developing Translational Clinical Biomarker [4.549866091318765]
We present one evaluation strategy by using leave-one-study-out (LOSO) in place of conventional cross-validation (cv) methods.
To demonstrate the performance of K-fold vs LOSO cv in estimating the effect size of biomarkers, we leveraged data from clinical trials and simulation studies.
arXiv Detail & Related papers (2021-07-13T19:36:25Z) - Cancer Gene Profiling through Unsupervised Discovery [49.28556294619424]
We introduce a novel, automatic and unsupervised framework to discover low-dimensional gene biomarkers.
Our method is based on the LP-Stability algorithm, a high dimensional center-based unsupervised clustering algorithm.
Our signature reports promising results on distinguishing immune inflammatory and immune desert tumors.
arXiv Detail & Related papers (2021-02-11T09:04:45Z) - G-MIND: An End-to-End Multimodal Imaging-Genetics Framework for
Biomarker Identification and Disease Classification [49.53651166356737]
We propose a novel deep neural network architecture to integrate imaging and genetics data, as guided by diagnosis, that provides interpretable biomarkers.
We have evaluated our model on a population study of schizophrenia that includes two functional MRI (fMRI) paradigms and Single Nucleotide Polymorphism (SNP) data.
arXiv Detail & Related papers (2021-01-27T19:28:04Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.