Probability Link Models with Symmetric Information Divergence
- URL: http://arxiv.org/abs/2008.04387v1
- Date: Mon, 10 Aug 2020 19:49:51 GMT
- Title: Probability Link Models with Symmetric Information Divergence
- Authors: Majid Asadi, Karthik Devarajan, Nader Ebrahimi, Ehsan Soofi, Lauren
Spirko-Burns
- Abstract summary: Two general classes of link models are proposed.
The first model links two survival functions and is applicable to models such as the proportional odds and change point.
The second model links two cumulative probability distribution functions.
- Score: 1.5749416770494706
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: This paper introduces link functions for transforming one probability
distribution to another such that the Kullback-Leibler and R\'enyi divergences
between the two distributions are symmetric. Two general classes of link models
are proposed. The first model links two survival functions and is applicable to
models such as the proportional odds and change point, which are used in
survival analysis and reliability modeling. A prototype application involving
the proportional odds model demonstrates advantages of symmetric divergence
measures over asymmetric measures for assessing the efficacy of features and
for model averaging purposes. The advantages include providing unique ranks for
models and unique information weights for model averaging with one-half as much
computation requirement of asymmetric divergences. The second model links two
cumulative probability distribution functions. This model produces a
generalized location model which are continuous counterparts of the binary
probability models such as probit and logit models. Examples include the
generalized probit and logit models which have appeared in the survival
analysis literature, and a generalized Laplace model and a generalized
Student-$t$ model, which are survival time models corresponding to the
respective binary probability models. Lastly, extensions to symmetric
divergence between survival functions and conditions for copula dependence
information are presented.
Related papers
- Latent Semantic Consensus For Deterministic Geometric Model Fitting [109.44565542031384]
We propose an effective method called Latent Semantic Consensus (LSC)
LSC formulates the model fitting problem into two latent semantic spaces based on data points and model hypotheses.
LSC is able to provide consistent and reliable solutions within only a few milliseconds for general multi-structural model fitting.
arXiv Detail & Related papers (2024-03-11T05:35:38Z) - Sufficient Identification Conditions and Semiparametric Estimation under
Missing Not at Random Mechanisms [4.211128681972148]
Conducting valid statistical analyses is challenging in the presence of missing-not-at-random (MNAR) data.
We consider a MNAR model that generalizes several prior popular MNAR models in two ways.
We propose methods for testing the independence restrictions encoded in such models using odds ratio as our parameter of interest.
arXiv Detail & Related papers (2023-06-10T13:46:16Z) - Approximation-Generalization Trade-offs under (Approximate) Group
Equivariance [3.0458514384586395]
Group equivariant neural networks have demonstrated impressive performance across various domains and applications such as protein and drug design.
We show how models capturing task-specific symmetries lead to improved generalization.
We examine the more general question of model mis-specification when the model symmetries don't align with the data symmetries.
arXiv Detail & Related papers (2023-05-27T22:53:37Z) - On the Strong Correlation Between Model Invariance and Generalization [54.812786542023325]
Generalization captures a model's ability to classify unseen data.
Invariance measures consistency of model predictions on transformations of the data.
From a dataset-centric view, we find a certain model's accuracy and invariance linearly correlated on different test sets.
arXiv Detail & Related papers (2022-07-14T17:08:25Z) - Model Comparison in Approximate Bayesian Computation [0.456877715768796]
A common problem in natural sciences is the comparison of competing models in the light of observed data.
This framework relies on the calculation of likelihood functions which are intractable for most models used in practice.
I propose a new efficient method to perform Bayesian model comparison in ABC.
arXiv Detail & Related papers (2022-03-15T10:24:16Z) - Sampling from Arbitrary Functions via PSD Models [55.41644538483948]
We take a two-step approach by first modeling the probability distribution and then sampling from that model.
We show that these models can approximate a large class of densities concisely using few evaluations, and present a simple algorithm to effectively sample from these models.
arXiv Detail & Related papers (2021-10-20T12:25:22Z) - PSD Representations for Effective Probability Models [117.35298398434628]
We show that a recently proposed class of positive semi-definite (PSD) models for non-negative functions is particularly suited to this end.
We characterize both approximation and generalization capabilities of PSD models, showing that they enjoy strong theoretical guarantees.
Our results open the way to applications of PSD models to density estimation, decision theory and inference.
arXiv Detail & Related papers (2021-06-30T15:13:39Z) - Multipopulation mortality modelling and forecasting: The multivariate
functional principal component with time weightings approaches [3.450774887322348]
We introduce two new models for joint mortality modelling and forecasting multiple subpopulations.
The first proposed model extends the independent functional data model to a multi-population modelling setting.
The second proposed model outperforms the first model as well as the current models in terms of forecast accuracy.
arXiv Detail & Related papers (2021-02-18T21:01:58Z) - Why do classifier accuracies show linear trends under distribution
shift? [58.40438263312526]
accuracies of models on one data distribution are approximately linear functions of the accuracies on another distribution.
We assume the probability that two models agree in their predictions is higher than what we can infer from their accuracy levels alone.
We show that a linear trend must occur when evaluating models on two distributions unless the size of the distribution shift is large.
arXiv Detail & Related papers (2020-12-31T07:24:30Z) - Decision-Making with Auto-Encoding Variational Bayes [71.44735417472043]
We show that a posterior approximation distinct from the variational distribution should be used for making decisions.
Motivated by these theoretical results, we propose learning several approximate proposals for the best model.
In addition to toy examples, we present a full-fledged case study of single-cell RNA sequencing.
arXiv Detail & Related papers (2020-02-17T19:23:36Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.