Global and Local Interpretation of black-box Machine Learning models to
determine prognostic factors from early COVID-19 data
- URL: http://arxiv.org/abs/2109.05087v1
- Date: Fri, 10 Sep 2021 20:00:47 GMT
- Title: Global and Local Interpretation of black-box Machine Learning models to
determine prognostic factors from early COVID-19 data
- Authors: Ananya Jana, Carlos D. Minacapelli, Vinod Rustgi, Dimitris Metaxas
- Abstract summary: We analyze COVID-19 blood work data with some of the popular machine learning models.
We employ state-of-the-art post-hoc local interpretability techniques and symbolic metamodeling to draw interpretable conclusions.
We explore one of the most recent techniques called symbolic metamodeling to find the mathematical expression of the machine learning models for COVID-19.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: The COVID-19 corona virus has claimed 4.1 million lives, as of July 24, 2021.
A variety of machine learning models have been applied to related data to
predict important factors such as the severity of the disease, infection rate
and discover important prognostic factors. Often the usefulness of the findings
from the use of these techniques is reduced due to lack of method
interpretability. Some recent progress made on the interpretability of machine
learning models has the potential to unravel more insights while using
conventional machine learning models. In this work, we analyze COVID-19 blood
work data with some of the popular machine learning models; then we employ
state-of-the-art post-hoc local interpretability techniques(e.g.- SHAP, LIME),
and global interpretability techniques(e.g. - symbolic metamodeling) to the
trained black-box models to draw interpretable conclusions. In the gamut of
machine learning algorithms, regressions remain one of the simplest and most
explainable models with clear mathematical formulation. We explore one of the
most recent techniques called symbolic metamodeling to find the mathematical
expression of the machine learning models for COVID-19. We identify Acute
Kidney Injury (AKI), initial Albumin level (ALBI), Aspartate aminotransferase
(ASTI), Total Bilirubin initial(TBILI) and D-Dimer initial (DIMER) as major
prognostic factors of the disease severity. Our contributions are- (i) uncover
the underlying mathematical expression for the black-box models on COVID-19
severity prediction task (ii) we are the first to apply symbolic metamodeling
to this task, and (iii) discover important features and feature interactions.
Related papers
- Robust and Interpretable Medical Image Classifiers via Concept
Bottleneck Models [49.95603725998561]
We propose a new paradigm to build robust and interpretable medical image classifiers with natural language concepts.
Specifically, we first query clinical concepts from GPT-4, then transform latent image features into explicit concepts with a vision-language model.
arXiv Detail & Related papers (2023-10-04T21:57:09Z) - Benchmarking Machine Learning Robustness in Covid-19 Genome Sequence
Classification [109.81283748940696]
We introduce several ways to perturb SARS-CoV-2 genome sequences to mimic the error profiles of common sequencing platforms such as Illumina and PacBio.
We show that some simulation-based approaches are more robust (and accurate) than others for specific embedding methods to certain adversarial attacks to the input sequences.
arXiv Detail & Related papers (2022-07-18T19:16:56Z) - Unifying Epidemic Models with Mixtures [28.771032745045428]
The COVID-19 pandemic has emphasized the need for a robust understanding of epidemic models.
Here, we introduce a simple mixture-based model which bridges the two approaches.
Although the model is non-mechanistic, we show that it arises as the natural outcome of a process based on a networked SIR framework.
arXiv Detail & Related papers (2022-01-07T19:42:05Z) - Modelling COVID-19 Pandemic Dynamics Using Transparent, Interpretable,
Parsimonious and Simulatable (TIPS) Machine Learning Models: A Case Study
from Systems Thinking and System Identification Perspectives [1.4061680807550718]
The present study proposes using systems engineering and system identification approach to build transparent, interpretable, parsimonious and simulatable (TIPS) dynamic machine learning models.
TIPS models are developed based on the well-known NARMAX (Nonlinear AutoRegressive Moving Average with eXogenous inputs) model, which can help better understand the COVID-19 pandemic dynamics.
arXiv Detail & Related papers (2021-11-01T08:42:37Z) - A multi-stage machine learning model on diagnosis of esophageal
manometry [50.591267188664666]
The framework includes deep-learning models at the swallow-level stage and feature-based machine learning models at the study-level stage.
This is the first artificial-intelligence-style model to automatically predict CC diagnosis of HRM study from raw multi-swallow data.
arXiv Detail & Related papers (2021-06-25T20:09:23Z) - Integrating Expert ODEs into Neural ODEs: Pharmacology and Disease
Progression [71.7560927415706]
latent hybridisation model (LHM) integrates a system of expert-designed ODEs with machine-learned Neural ODEs to fully describe the dynamics of the system.
We evaluate LHM on synthetic data as well as real-world intensive care data of COVID-19 patients.
arXiv Detail & Related papers (2021-06-05T11:42:45Z) - Many-to-One Distribution Learning and K-Nearest Neighbor Smoothing for
Thoracic Disease Identification [83.6017225363714]
deep learning has become the most powerful computer-aided diagnosis technology for improving disease identification performance.
For chest X-ray imaging, annotating large-scale data requires professional domain knowledge and is time-consuming.
In this paper, we propose many-to-one distribution learning (MODL) and K-nearest neighbor smoothing (KNNS) methods to improve a single model's disease identification performance.
arXiv Detail & Related papers (2021-02-26T02:29:30Z) - Combining Graph Neural Networks and Spatio-temporal Disease Models to
Predict COVID-19 Cases in Germany [0.0]
Several experts have called for the necessity to account for human mobility to explain the spread of COVID-19.
Most statistical or epidemiological models cannot directly incorporate unstructured data sources, including data that may encode human mobility.
We propose a trade-off between both research directions and present a novel learning approach that combines the advantages of statistical regression and machine learning models.
arXiv Detail & Related papers (2021-01-03T16:39:00Z) - Patient-independent Epileptic Seizure Prediction using Deep Learning
Models [39.19336481493405]
The purpose of a seizure prediction system is to successfully identify the pre-ictal brain stage, which occurs before a seizure event.
Patient-independent seizure prediction models are designed to offer accurate performance across multiple subjects within a dataset.
We propose two patient-independent deep learning architectures with different learning strategies that can learn a global function utilizing data from multiple subjects.
arXiv Detail & Related papers (2020-11-18T23:13:48Z) - Select-ProtoNet: Learning to Select for Few-Shot Disease Subtype
Prediction [55.94378672172967]
We focus on few-shot disease subtype prediction problem, identifying subgroups of similar patients.
We introduce meta learning techniques to develop a new model, which can extract the common experience or knowledge from interrelated clinical tasks.
Our new model is built upon a carefully designed meta-learner, called Prototypical Network, that is a simple yet effective meta learning machine for few-shot image classification.
arXiv Detail & Related papers (2020-09-02T02:50:30Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.