Optimization of Genomic Classifiers for Clinical Deployment: Evaluation
of Bayesian Optimization to Select Predictive Models of Acute Infection and
In-Hospital Mortality
- URL: http://arxiv.org/abs/2003.12310v3
- Date: Tue, 13 Oct 2020 09:45:42 GMT
- Title: Optimization of Genomic Classifiers for Clinical Deployment: Evaluation
of Bayesian Optimization to Select Predictive Models of Acute Infection and
In-Hospital Mortality
- Authors: Michael B. Mayhew, Elizabeth Tran, Kirindi Choi, Uros Midic, Roland
Luethy, Nandita Damaraju and Ljubomir Buturovic
- Abstract summary: characterization of a patient's immune response by quantifying expression levels of specific genes from blood represents a potentially more timely and precise means of accomplishing both tasks.
Machine learning methods provide a platform to leverage this 'host response' for development of deployment-ready classification models.
We compare HO approaches for the development of diagnostic classifiers of acute infection and in-hospital mortality from gene expression of 29 diagnostic markers.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Acute infection, if not rapidly and accurately detected, can lead to sepsis,
organ failure and even death. Current detection of acute infection as well as
assessment of a patient's severity of illness are imperfect. Characterization
of a patient's immune response by quantifying expression levels of specific
genes from blood represents a potentially more timely and precise means of
accomplishing both tasks. Machine learning methods provide a platform to
leverage this 'host response' for development of deployment-ready
classification models. Prioritization of promising classifiers is dependent, in
part, on hyperparameter optimization for which a number of approaches including
grid search, random sampling and Bayesian optimization have been shown to be
effective. We compare HO approaches for the development of diagnostic
classifiers of acute infection and in-hospital mortality from gene expression
of 29 diagnostic markers. We take a deployment-centered approach to our
comprehensive analysis, accounting for heterogeneity in our multi-study patient
cohort with our choices of dataset partitioning and hyperparameter optimization
objective as well as assessing selected classifiers in external (as well as
internal) validation. We find that classifiers selected by Bayesian
optimization for in-hospital mortality can outperform those selected by grid
search or random sampling. However, in contrast to previous research: 1)
Bayesian optimization is not more efficient in selecting classifiers in all
instances compared to grid search or random sampling-based methods and 2) we
note marginal gains in classifier performance in only specific circumstances
when using a common variant of Bayesian optimization (i.e. automatic relevance
determination). Our analysis highlights the need for further practical,
deployment-centered benchmarking of HO approaches in the healthcare context.
Related papers
- Dual-stage optimizer for systematic overestimation adjustment applied to
multi-objective genetic algorithms for biomarker selection [0.18648070031379424]
Biomarker identification with feature selection methods can be addressed as a multi-objective problem with trade-offs between predictive ability and parsimony in the number of features.
We propose DOSA-MO, a novel multi-objective optimization wrapper algorithm that learns how the original estimation, its variance, and the feature set size of the solutions predict the overestimation.
arXiv Detail & Related papers (2023-12-27T16:13:14Z) - Multi-task Explainable Skin Lesion Classification [54.76511683427566]
We propose a few-shot-based approach for skin lesions that generalizes well with few labelled data.
The proposed approach comprises a fusion of a segmentation network that acts as an attention module and classification network.
arXiv Detail & Related papers (2023-10-11T05:49:47Z) - Characterizing the Optimal 0-1 Loss for Multi-class Classification with
a Test-time Attacker [57.49330031751386]
We find achievable information-theoretic lower bounds on loss in the presence of a test-time attacker for multi-class classifiers on any discrete dataset.
We provide a general framework for finding the optimal 0-1 loss that revolves around the construction of a conflict hypergraph from the data and adversarial constraints.
arXiv Detail & Related papers (2023-02-21T15:17:13Z) - Analysis of lifelog data using optimal feature selection based
unsupervised logistic regression (OFS-ULR) for chronic disease classification [2.3909933791900326]
Chronic disease classification models are now harnessing the potential of lifelog data to explore better healthcare practices.
This paper is to construct an optimal feature selection-based unsupervised logistic regression model (OFS-ULR) to classify chronic diseases.
arXiv Detail & Related papers (2022-04-04T07:11:26Z) - Multi-class versus One-class classifier in spontaneous speech analysis
oriented to Alzheimer Disease diagnosis [58.720142291102135]
The aim of our project is to contribute to earlier diagnosis of AD and better estimates of its severity by using automatic analysis performed through new biomarkers extracted from speech signal.
The use of information about outlier and Fractal Dimension features improves the system performance.
arXiv Detail & Related papers (2022-03-21T09:57:20Z) - SelectAugment: Hierarchical Deterministic Sample Selection for Data
Augmentation [72.58308581812149]
We propose an effective approach, dubbed SelectAugment, to select samples to be augmented in a deterministic and online manner.
Specifically, in each batch, we first determine the augmentation ratio, and then decide whether to augment each training sample under this ratio.
In this way, the negative effects of the randomness in selecting samples to augment can be effectively alleviated and the effectiveness of DA is improved.
arXiv Detail & Related papers (2021-12-06T08:38:38Z) - Cervical Cytology Classification Using PCA & GWO Enhanced Deep Features
Selection [1.990876596716716]
Cervical cancer is one of the most deadly and common diseases among women worldwide.
We propose a fully automated framework that utilizes Deep Learning and feature selection.
The framework is evaluated on three publicly available benchmark datasets.
arXiv Detail & Related papers (2021-06-09T08:57:22Z) - Empirical Analysis of Machine Learning Configurations for Prediction of
Multiple Organ Failure in Trauma Patients [7.122236250657051]
Multiple organ failure (MOF) is a life-threatening condition.
We perform quantitative analysis on early MOF prediction with comprehensive machine learning (ML) configurations.
arXiv Detail & Related papers (2021-03-19T17:49:22Z) - Sickle-cell disease diagnosis support selecting the most appropriate
machinelearning method: Towards a general and interpretable approach for
cellmorphology analysis from microscopy images [0.0]
We propose an approach to select the classification method and features, based on the state-of-the-art.
We used samples of patients with sickle-cell disease which can be generalized for other study cases.
arXiv Detail & Related papers (2020-10-09T11:46:38Z) - Predictive Modeling of ICU Healthcare-Associated Infections from
Imbalanced Data. Using Ensembles and a Clustering-Based Undersampling
Approach [55.41644538483948]
This work is focused on both the identification of risk factors and the prediction of healthcare-associated infections in intensive-care units.
The aim is to support decision making addressed at reducing the incidence rate of infections.
arXiv Detail & Related papers (2020-05-07T16:13:12Z) - Noisy Adaptive Group Testing using Bayesian Sequential Experimental
Design [63.48989885374238]
When the infection prevalence of a disease is low, Dorfman showed 80 years ago that testing groups of people can prove more efficient than testing people individually.
Our goal in this paper is to propose new group testing algorithms that can operate in a noisy setting.
arXiv Detail & Related papers (2020-04-26T23:41:33Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.