A robust algorithm for explaining unreliable machine learning survival
models using the Kolmogorov-Smirnov bounds
- URL: http://arxiv.org/abs/2005.02249v1
- Date: Tue, 5 May 2020 14:47:35 GMT
- Title: A robust algorithm for explaining unreliable machine learning survival
models using the Kolmogorov-Smirnov bounds
- Authors: Maxim S. Kovalev and Lev V. Utkin
- Abstract summary: SurvLIME-KS is proposed for explaining machine learning survival models.
It is developed to ensure robustness to cases of a small amount of training data or outliers of survival data.
Various numerical experiments with synthetic and real datasets demonstrate the SurvLIME-KS efficiency.
- Score: 5.482532589225552
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: A new robust algorithm based of the explanation method SurvLIME called
SurvLIME-KS is proposed for explaining machine learning survival models. The
algorithm is developed to ensure robustness to cases of a small amount of
training data or outliers of survival data. The first idea behind SurvLIME-KS
is to apply the Cox proportional hazards model to approximate the black-box
survival model at the local area around a test example due to the linear
relationship of covariates in the model. The second idea is to incorporate the
well-known Kolmogorov-Smirnov bounds for constructing sets of predicted
cumulative hazard functions. As a result, the robust maximin strategy is used,
which aims to minimize the average distance between cumulative hazard functions
of the explained black-box model and of the approximating Cox model, and to
maximize the distance over all cumulative hazard functions in the interval
produced by the Kolmogorov-Smirnov bounds. The maximin optimization problem is
reduced to the quadratic program. Various numerical experiments with synthetic
and real datasets demonstrate the SurvLIME-KS efficiency.
Related papers
- Kolmogorov-Smirnov GAN [52.36633001046723]
We propose a novel deep generative model, the Kolmogorov-Smirnov Generative Adversarial Network (KSGAN)
Unlike existing approaches, KSGAN formulates the learning process as a minimization of the Kolmogorov-Smirnov (KS) distance.
arXiv Detail & Related papers (2024-06-28T14:30:14Z) - OPSurv: Orthogonal Polynomials Quadrature Algorithm for Survival
Analysis [19.65859820376036]
This paper introduces the Orthogonal Polynomials Quadrature Algorithm for Survival Analysis (OPSurv)
OPSurv provides time-continuous functional outputs for both single and competing risks scenarios in survival analysis.
arXiv Detail & Related papers (2024-02-02T23:26:09Z) - SurvBeX: An explanation method of the machine learning survival models
based on the Beran estimator [4.640835690336653]
An explanation method called SurvBeX is proposed to interpret predictions of the machine learning survival black-box models.
Many numerical experiments with synthetic and real survival data demonstrate the SurvBeX efficiency.
arXiv Detail & Related papers (2023-08-07T17:18:37Z) - Probabilistic Unrolling: Scalable, Inverse-Free Maximum Likelihood
Estimation for Latent Gaussian Models [69.22568644711113]
We introduce probabilistic unrolling, a method that combines Monte Carlo sampling with iterative linear solvers to circumvent matrix inversions.
Our theoretical analyses reveal that unrolling and backpropagation through the iterations of the solver can accelerate gradient estimation for maximum likelihood estimation.
In experiments on simulated and real data, we demonstrate that probabilistic unrolling learns latent Gaussian models up to an order of magnitude faster than gradient EM, with minimal losses in model performance.
arXiv Detail & Related papers (2023-06-05T21:08:34Z) - Adaptive LASSO estimation for functional hidden dynamic geostatistical
model [69.10717733870575]
We propose a novel model selection algorithm based on a penalized maximum likelihood estimator (PMLE) for functional hiddenstatistical models (f-HD)
The algorithm is based on iterative optimisation and uses an adaptive least absolute shrinkage and selector operator (GMSOLAS) penalty function, wherein the weights are obtained by the unpenalised f-HD maximum-likelihood estimators.
arXiv Detail & Related papers (2022-08-10T19:17:45Z) - Modeling the Second Player in Distributionally Robust Optimization [90.25995710696425]
We argue for the use of neural generative models to characterize the worst-case distribution.
This approach poses a number of implementation and optimization challenges.
We find that the proposed approach yields models that are more robust than comparable baselines.
arXiv Detail & Related papers (2021-03-18T14:26:26Z) - Provable Model-based Nonlinear Bandit and Reinforcement Learning: Shelve
Optimism, Embrace Virtual Curvature [61.22680308681648]
We show that global convergence is statistically intractable even for one-layer neural net bandit with a deterministic reward.
For both nonlinear bandit and RL, the paper presents a model-based algorithm, Virtual Ascent with Online Model Learner (ViOL)
arXiv Detail & Related papers (2021-02-08T12:41:56Z) - SODEN: A Scalable Continuous-Time Survival Model through Ordinary
Differential Equation Networks [14.564168076456822]
We propose a flexible model for survival analysis using neural networks along with scalable optimization algorithms.
We demonstrate the effectiveness of the proposed method in comparison to existing state-of-the-art deep learning survival analysis models.
arXiv Detail & Related papers (2020-08-19T19:11:25Z) - Efficient Ensemble Model Generation for Uncertainty Estimation with
Bayesian Approximation in Segmentation [74.06904875527556]
We propose a generic and efficient segmentation framework to construct ensemble segmentation models.
In the proposed method, ensemble models can be efficiently generated by using the layer selection method.
We also devise a new pixel-wise uncertainty loss, which improves the predictive performance.
arXiv Detail & Related papers (2020-05-21T16:08:38Z) - SurvLIME-Inf: A simplified modification of SurvLIME for explanation of
machine learning survival models [4.640835690336653]
The basic idea behind SurvLIME as well as SurvLIME-Inf is to apply the Cox proportional hazards model to approximate the black-box survival model at the local area around a test example.
In contrast to SurvLIME, the proposed modification uses $L_infty $-norm for defining distances between approximating and approximated cumulative hazard functions.
arXiv Detail & Related papers (2020-05-05T14:34:46Z) - SurvLIME: A method for explaining machine learning survival models [4.640835690336653]
The main idea behind the proposed method is to apply the Cox proportional hazards model to approximate the survival model at the local area around a test example.
A lot of numerical experiments demonstrate the SurvLIME efficiency.
arXiv Detail & Related papers (2020-03-18T17:48:42Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.