Comparing two samples through stochastic dominance: a graphical approach
- URL: http://arxiv.org/abs/2203.07889v1
- Date: Tue, 15 Mar 2022 13:37:03 GMT
- Title: Comparing two samples through stochastic dominance: a graphical approach
- Authors: Etor Arza, Josu Ceberio, Ekhi\~ne Irurozki, Aritz P\'erez
- Abstract summary: Non-deterministic measurements are common in real-world scenarios.
We propose an alternative framework to visually compare two samples according to their estimated cumulative distribution functions.
- Score: 2.867517731896504
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Non-deterministic measurements are common in real-world scenarios: the
performance of a stochastic optimization algorithm or the total reward of a
reinforcement learning agent in a chaotic environment are just two examples in
which unpredictable outcomes are common. These measures can be modeled as
random variables and compared among each other via their expected values or
more sophisticated tools such as null hypothesis statistical tests. In this
paper, we propose an alternative framework to visually compare two samples
according to their estimated cumulative distribution functions. First, we
introduce a dominance measure for two random variables that quantifies the
proportion in which the cumulative distribution function of one of the random
variables scholastically dominates the other one. Then, we present a graphical
method that decomposes in quantiles i) the proposed dominance measure and ii)
the probability that one of the random variables takes lower values than the
other. With illustrative purposes, we re-evaluate the experimentation of an
already published work with the proposed methodology and we show that
additional conclusions (missed by the rest of the methods) can be inferred.
Additionally, the software package RVCompare was created as a convenient way of
applying and experimenting with the proposed framework.
Related papers
- Probabilistic Conformal Prediction with Approximate Conditional Validity [81.30551968980143]
We develop a new method for generating prediction sets that combines the flexibility of conformal methods with an estimate of the conditional distribution.
Our method consistently outperforms existing approaches in terms of conditional coverage.
arXiv Detail & Related papers (2024-07-01T20:44:48Z) - Multiply-Robust Causal Change Attribution [15.501106533308798]
We develop a new estimation strategy that combines regression and re-weighting methods to quantify the contribution of each causal mechanism.
Our method demonstrates excellent performance in Monte Carlo simulations, and we show its usefulness in an empirical application.
arXiv Detail & Related papers (2024-04-12T22:57:01Z) - MINDE: Mutual Information Neural Diffusion Estimation [7.399561232927219]
We present a new method for the estimation of Mutual Information (MI) between random variables.
We use score-based diffusion models to estimate the Kullback Leibler divergence between two densities as a difference between their score functions.
As a by-product, our method also enables the estimation of the entropy of random variables.
arXiv Detail & Related papers (2023-10-13T11:47:41Z) - A Unified Framework for Multi-distribution Density Ratio Estimation [101.67420298343512]
Binary density ratio estimation (DRE) provides the foundation for many state-of-the-art machine learning algorithms.
We develop a general framework from the perspective of Bregman minimization divergence.
We show that our framework leads to methods that strictly generalize their counterparts in binary DRE.
arXiv Detail & Related papers (2021-12-07T01:23:20Z) - Sampling from Arbitrary Functions via PSD Models [55.41644538483948]
We take a two-step approach by first modeling the probability distribution and then sampling from that model.
We show that these models can approximate a large class of densities concisely using few evaluations, and present a simple algorithm to effectively sample from these models.
arXiv Detail & Related papers (2021-10-20T12:25:22Z) - A Stochastic Newton Algorithm for Distributed Convex Optimization [62.20732134991661]
We analyze a Newton algorithm for homogeneous distributed convex optimization, where each machine can calculate gradients of the same population objective.
We show that our method can reduce the number, and frequency, of required communication rounds compared to existing methods without hurting performance.
arXiv Detail & Related papers (2021-10-07T17:51:10Z) - Multivariate Probabilistic Regression with Natural Gradient Boosting [63.58097881421937]
We propose a Natural Gradient Boosting (NGBoost) approach based on nonparametrically modeling the conditional parameters of the multivariate predictive distribution.
Our method is robust, works out-of-the-box without extensive tuning, is modular with respect to the assumed target distribution, and performs competitively in comparison to existing approaches.
arXiv Detail & Related papers (2021-06-07T17:44:49Z) - An Embedded Model Estimator for Non-Stationary Random Functions using
Multiple Secondary Variables [0.0]
This paper introduces the method and shows that it has consistency results that are similar in nature to those applying to geostatistical modelling and to Quantile Random Forests.
The algorithm works by estimating a conditional distribution for the target variable at each target location.
arXiv Detail & Related papers (2020-11-09T00:14:24Z) - A One-step Approach to Covariate Shift Adaptation [82.01909503235385]
A default assumption in many machine learning scenarios is that the training and test samples are drawn from the same probability distribution.
We propose a novel one-step approach that jointly learns the predictive model and the associated weights in one optimization.
arXiv Detail & Related papers (2020-07-08T11:35:47Z) - Posterior Ratio Estimation of Latent Variables [14.619879849533662]
In some applications, we want to compare distributions of random variables that are emphinferred from observations.
We study the problem of estimating the ratio between two posterior probability density functions of a latent variable.
arXiv Detail & Related papers (2020-02-15T16:46:42Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.