Exploration of the Rashomon Set Assists Trustworthy Explanations for
Medical Data
- URL: http://arxiv.org/abs/2308.11446v2
- Date: Mon, 18 Sep 2023 09:50:53 GMT
- Title: Exploration of the Rashomon Set Assists Trustworthy Explanations for
Medical Data
- Authors: Katarzyna Kobyli\'nska, Mateusz Krzyzi\'nski, Rafa{\l} Machowicz,
Mariusz Adamek, Przemys{\l}aw Biecek
- Abstract summary: This paper introduces a novel process to explore models in the Rashomon set, extending the conventional modeling approach.
We propose the $textttRashomon_DETECT$ algorithm to detect models with different behavior.
To quantify differences in variable effects among models, we introduce the Profile Disparity Index (PDI) based on measures from functional data analysis.
- Score: 4.499833362998488
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The machine learning modeling process conventionally culminates in selecting
a single model that maximizes a selected performance metric. However, this
approach leads to abandoning a more profound analysis of slightly inferior
models. Particularly in medical and healthcare studies, where the objective
extends beyond predictions to valuable insight generation, relying solely on a
single model can result in misleading or incomplete conclusions. This problem
is particularly pertinent when dealing with a set of models known as
$\textit{Rashomon set}$, with performance close to maximum one. Such a set can
be numerous and may contain models describing the data in a different way,
which calls for comprehensive analysis. This paper introduces a novel process
to explore models in the Rashomon set, extending the conventional modeling
approach. We propose the $\texttt{Rashomon_DETECT}$ algorithm to detect models
with different behavior. It is based on recent developments in the eXplainable
Artificial Intelligence (XAI) field. To quantify differences in variable
effects among models, we introduce the Profile Disparity Index (PDI) based on
measures from functional data analysis. To illustrate the effectiveness of our
approach, we showcase its application in predicting survival among
hemophagocytic lymphohistiocytosis (HLH) patients - a foundational case study.
Additionally, we benchmark our approach on other medical data sets,
demonstrating its versatility and utility in various contexts. If differently
behaving models are detected in the Rashomon set, their combined analysis leads
to more trustworthy conclusions, which is of vital importance for high-stakes
applications such as medical applications.
Related papers
- Lessons Learned on Information Retrieval in Electronic Health Records: A Comparison of Embedding Models and Pooling Strategies [8.822087602255504]
Applying large language models to the clinical domain is challenging due to the context-heavy nature of processing medical records.
This paper explores how different embedding models and pooling methods affect information retrieval for the clinical domain.
arXiv Detail & Related papers (2024-09-23T16:16:08Z) - On Least Square Estimation in Softmax Gating Mixture of Experts [78.3687645289918]
We investigate the performance of the least squares estimators (LSE) under a deterministic MoE model.
We establish a condition called strong identifiability to characterize the convergence behavior of various types of expert functions.
Our findings have important practical implications for expert selection.
arXiv Detail & Related papers (2024-02-05T12:31:18Z) - The Importance of Downstream Networks in Digital Pathology Foundation Models [1.689369173057502]
We evaluate seven feature extractor models across three different datasets with 162 different aggregation model configurations.
We find that the performance of many current feature extractor models is notably similar.
arXiv Detail & Related papers (2023-11-29T16:54:25Z) - A prediction and behavioural analysis of machine learning methods for
modelling travel mode choice [0.26249027950824505]
We conduct a systematic comparison of different modelling approaches, across multiple modelling problems, in terms of the key factors likely to affect model choice.
Results indicate that the models with the highest disaggregate predictive performance provide poorer estimates of behavioural indicators and aggregate mode shares.
It is also observed that the MNL model performs robustly in a variety of situations, though ML techniques can improve the estimates of behavioural indices such as Willingness to Pay.
arXiv Detail & Related papers (2023-01-11T11:10:32Z) - Empirical Analysis of Model Selection for Heterogeneous Causal Effect Estimation [24.65301562548798]
We study the problem of model selection in causal inference, specifically for conditional average treatment effect (CATE) estimation.
We conduct an empirical analysis to benchmark the surrogate model selection metrics introduced in the literature, as well as the novel ones introduced in this work.
arXiv Detail & Related papers (2022-11-03T16:26:06Z) - Interpretable Models Capable of Handling Systematic Missingness in
Imbalanced Classes and Heterogeneous Datasets [0.0]
Application of interpretable machine learning techniques on medical datasets facilitate early and fast diagnoses, along with getting deeper insight into the data.
Medical datasets face common issues such as heterogeneous measurements, imbalanced classes with limited sample size, and missing data.
We present a family of prototype-based (PB) interpretable models which are capable of handling these issues.
arXiv Detail & Related papers (2022-06-04T20:20:39Z) - Mixed Effects Neural ODE: A Variational Approximation for Analyzing the
Dynamics of Panel Data [50.23363975709122]
We propose a probabilistic model called ME-NODE to incorporate (fixed + random) mixed effects for analyzing panel data.
We show that our model can be derived using smooth approximations of SDEs provided by the Wong-Zakai theorem.
We then derive Evidence Based Lower Bounds for ME-NODE, and develop (efficient) training algorithms.
arXiv Detail & Related papers (2022-02-18T22:41:51Z) - A multi-stage machine learning model on diagnosis of esophageal
manometry [50.591267188664666]
The framework includes deep-learning models at the swallow-level stage and feature-based machine learning models at the study-level stage.
This is the first artificial-intelligence-style model to automatically predict CC diagnosis of HRM study from raw multi-swallow data.
arXiv Detail & Related papers (2021-06-25T20:09:23Z) - Adversarial Sample Enhanced Domain Adaptation: A Case Study on
Predictive Modeling with Electronic Health Records [57.75125067744978]
We propose a data augmentation method to facilitate domain adaptation.
adversarially generated samples are used during domain adaptation.
Results confirm the effectiveness of our method and the generality on different tasks.
arXiv Detail & Related papers (2021-01-13T03:20:20Z) - Select-ProtoNet: Learning to Select for Few-Shot Disease Subtype
Prediction [55.94378672172967]
We focus on few-shot disease subtype prediction problem, identifying subgroups of similar patients.
We introduce meta learning techniques to develop a new model, which can extract the common experience or knowledge from interrelated clinical tasks.
Our new model is built upon a carefully designed meta-learner, called Prototypical Network, that is a simple yet effective meta learning machine for few-shot image classification.
arXiv Detail & Related papers (2020-09-02T02:50:30Z) - Modeling Shared Responses in Neuroimaging Studies through MultiView ICA [94.31804763196116]
Group studies involving large cohorts of subjects are important to draw general conclusions about brain functional organization.
We propose a novel MultiView Independent Component Analysis model for group studies, where data from each subject are modeled as a linear combination of shared independent sources plus noise.
We demonstrate the usefulness of our approach first on fMRI data, where our model demonstrates improved sensitivity in identifying common sources among subjects.
arXiv Detail & Related papers (2020-06-11T17:29:53Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.