The MELODIC family for simultaneous binary logistic regression in a
reduced space
- URL: http://arxiv.org/abs/2102.08232v1
- Date: Tue, 16 Feb 2021 15:47:20 GMT
- Title: The MELODIC family for simultaneous binary logistic regression in a
reduced space
- Authors: Mark de Rooij and Patrick J. F. Groenen
- Abstract summary: We propose the MELODIC family for simultaneous binary logistic regression modeling.
The model may be interpreted in terms of logistic regression coefficients or in terms of a biplot.
Two applications are shown in detail: one relating personality characteristics to drug consumption profiles and one relating personality characteristics to depressive and anxiety disorders.
- Score: 0.5330240017302619
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Logistic regression is a commonly used method for binary classification.
Researchers often have more than a single binary response variable and
simultaneous analysis is beneficial because it provides insight into the
dependencies among response variables as well as between the predictor
variables and the responses. Moreover, in such a simultaneous analysis the
equations can lend each other strength, which might increase predictive
accuracy. In this paper, we propose the MELODIC family for simultaneous binary
logistic regression modeling. In this family, the regression models are defined
in a Euclidean space of reduced dimension, based on a distance rule. The model
may be interpreted in terms of logistic regression coefficients or in terms of
a biplot. We discuss a fast iterative majorization (or MM) algorithm for
parameter estimation. Two applications are shown in detail: one relating
personality characteristics to drug consumption profiles and one relating
personality characteristics to depressive and anxiety disorders. We present a
thorough comparison of our MELODIC family with alternative approaches for
multivariate binary data.
Related papers
- High-dimensional logistic regression with missing data: Imputation, regularization, and universality [7.167672851569787]
We study high-dimensional, ridge-regularized logistic regression.
We provide exact characterizations of both the prediction error and the estimation error.
arXiv Detail & Related papers (2024-10-01T21:41:21Z) - Statistical Agnostic Regression: a machine learning method to validate regression models [0.0]
We introduce Statistical Agnostic Regression (SAR) for evaluating the statistical significance of machine learning (ML)-based linear regression models.
We define a threshold that ensures there is sufficient evidence, with a probability of at least $1-eta$, to conclude the existence of a linear relationship in the population between the explanatory (feature) and the response (label) variables.
arXiv Detail & Related papers (2024-02-23T09:19:26Z) - Deep Generative Symbolic Regression [83.04219479605801]
Symbolic regression aims to discover concise closed-form mathematical equations from data.
Existing methods, ranging from search to reinforcement learning, fail to scale with the number of input variables.
We propose an instantiation of our framework, Deep Generative Symbolic Regression.
arXiv Detail & Related papers (2023-12-30T17:05:31Z) - Structured Radial Basis Function Network: Modelling Diversity for
Multiple Hypotheses Prediction [51.82628081279621]
Multi-modal regression is important in forecasting nonstationary processes or with a complex mixture of distributions.
A Structured Radial Basis Function Network is presented as an ensemble of multiple hypotheses predictors for regression problems.
It is proved that this structured model can efficiently interpolate this tessellation and approximate the multiple hypotheses target distribution.
arXiv Detail & Related papers (2023-09-02T01:27:53Z) - Dual-sPLS: a family of Dual Sparse Partial Least Squares regressions for
feature selection and prediction with tunable sparsity; evaluation on
simulated and near-infrared (NIR) data [1.6099403809839032]
The variant presented in this paper, Dual-sPLS, generalizes the classical PLS1 algorithm.
It provides balance between accurate prediction and efficient interpretation.
Code is provided as an open-source package in R.
arXiv Detail & Related papers (2023-01-17T21:50:35Z) - Inverting brain grey matter models with likelihood-free inference: a
tool for trustable cytoarchitecture measurements [62.997667081978825]
characterisation of the brain grey matter cytoarchitecture with quantitative sensitivity to soma density and volume remains an unsolved challenge in dMRI.
We propose a new forward model, specifically a new system of equations, requiring a few relatively sparse b-shells.
We then apply modern tools from Bayesian analysis known as likelihood-free inference (LFI) to invert our proposed model.
arXiv Detail & Related papers (2021-11-15T09:08:27Z) - On the Use of Minimum Penalties in Statistical Learning [2.1320960069210475]
We propose a framework to simultaneously estimate regression coefficients associated with a multivariate regression model and relationships between outcome variables.
An iterative algorithm that generalizes current state art methods is proposed as a solution.
We extend the proposed MinPen framework to other exponential family loss functions, with a specific focus on multiple binomial responses.
arXiv Detail & Related papers (2021-06-09T16:15:46Z) - MINIMALIST: Mutual INformatIon Maximization for Amortized Likelihood
Inference from Sampled Trajectories [61.3299263929289]
Simulation-based inference enables learning the parameters of a model even when its likelihood cannot be computed in practice.
One class of methods uses data simulated with different parameters to infer an amortized estimator for the likelihood-to-evidence ratio.
We show that this approach can be formulated in terms of mutual information between model parameters and simulated data.
arXiv Detail & Related papers (2021-06-03T12:59:16Z) - A Hypergradient Approach to Robust Regression without Correspondence [85.49775273716503]
We consider a variant of regression problem, where the correspondence between input and output data is not available.
Most existing methods are only applicable when the sample size is small.
We propose a new computational framework -- ROBOT -- for the shuffled regression problem.
arXiv Detail & Related papers (2020-11-30T21:47:38Z) - Two-step penalised logistic regression for multi-omic data with an
application to cardiometabolic syndrome [62.997667081978825]
We implement a two-step approach to multi-omic logistic regression in which variable selection is performed on each layer separately.
Our approach should be preferred if the goal is to select as many relevant predictors as possible.
Our proposed approach allows us to identify features that characterise cardiometabolic syndrome at the molecular level.
arXiv Detail & Related papers (2020-08-01T10:36:27Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.