Ordinal-ResLogit: Interpretable Deep Residual Neural Networks for
Ordered Choices
- URL: http://arxiv.org/abs/2204.09187v1
- Date: Wed, 20 Apr 2022 02:14:28 GMT
- Title: Ordinal-ResLogit: Interpretable Deep Residual Neural Networks for
Ordered Choices
- Authors: Kimia Kamal and Bilal Farooq
- Abstract summary: We develop a fully interpretable deep learning-based ordinal regression model.
Formulations for market share, substitution patterns, and elasticities are derived.
Our results show that Ordinal-ResLogit outperforms the traditional ordinal regression model for both datasets.
- Score: 6.982614422666432
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: This study presents an Ordinal version of Residual Logit (Ordinal-ResLogit)
model to investigate the ordinal responses. We integrate the standard ResLogit
model into COnsistent RAnk Logits (CORAL) framework, classified as a binary
classification algorithm, to develop a fully interpretable deep learning-based
ordinal regression model. As the formulation of the Ordinal-ResLogit model
enjoys the Residual Neural Networks concept, our proposed model addresses the
main constraint of machine learning algorithms, known as black-box. Moreover,
the Ordinal-ResLogit model, as a binary classification framework for ordinal
data, guarantees consistency among binary classifiers. We showed that the
resulting formulation is able to capture underlying unobserved heterogeneity
from the data as well as being an interpretable deep learning-based model.
Formulations for market share, substitution patterns, and elasticities are
derived. We compare the performance of the Ordinal-ResLogit model with an
Ordered Logit Model using a stated preference (SP) dataset on pedestrian wait
time and a revealed preference (RP) dataset on travel distance. Our results
show that Ordinal-ResLogit outperforms the traditional ordinal regression model
for both datasets. Furthermore, the results obtained from the Ordinal-ResLogit
RP model show that travel attributes such as driving and transit cost have
significant effects on choosing the location of non-mandatory trips. In terms
of the Ordinal-ResLogit SP model, our results highlight that the road-related
variables and traffic condition are contributing factors in the prediction of
pedestrian waiting time such that the mixed traffic condition significantly
increases the probability of choosing longer waiting times.
Related papers
- A Statistical-Modelling Approach to Feedforward Neural Network Model Selection [0.8287206589886881]
Feedforward neural networks (FNNs) can be viewed as non-linear regression models.
A novel model selection method is proposed using the Bayesian information criterion (BIC) for FNNs.
The choice of BIC over out-of-sample performance leads to an increased probability of recovering the true model.
arXiv Detail & Related papers (2022-07-09T11:07:04Z) - Amortized Inference for Causal Structure Learning [72.84105256353801]
Learning causal structure poses a search problem that typically involves evaluating structures using a score or independence test.
We train a variational inference model to predict the causal structure from observational/interventional data.
Our models exhibit robust generalization capabilities under substantial distribution shift.
arXiv Detail & Related papers (2022-05-25T17:37:08Z) - Continuously Generalized Ordinal Regression for Linear and Deep Models [41.03778663275373]
Ordinal regression is a classification task where classes have an order and prediction error increases the further the predicted class is from the true class.
We propose a new approach for modeling ordinal data that allows class-specific hyperplane slopes.
Our method significantly outperforms the standard ordinal logistic model over a thorough set of ordinal regression benchmark datasets.
arXiv Detail & Related papers (2022-02-14T19:49:05Z) - General Greedy De-bias Learning [163.65789778416172]
We propose a General Greedy De-bias learning framework (GGD), which greedily trains the biased models and the base model like gradient descent in functional space.
GGD can learn a more robust base model under the settings of both task-specific biased models with prior knowledge and self-ensemble biased model without prior knowledge.
arXiv Detail & Related papers (2021-12-20T14:47:32Z) - Anomaly Detection of Time Series with Smoothness-Inducing Sequential
Variational Auto-Encoder [59.69303945834122]
We present a Smoothness-Inducing Sequential Variational Auto-Encoder (SISVAE) model for robust estimation and anomaly detection of time series.
Our model parameterizes mean and variance for each time-stamp with flexible neural networks.
We show the effectiveness of our model on both synthetic datasets and public real-world benchmarks.
arXiv Detail & Related papers (2021-02-02T06:15:15Z) - Learning a binary search with a recurrent neural network. A novel
approach to ordinal regression analysis [0.0]
This article investigates the application of sequence-to-sequence learning methods provided by the deep learning framework in ordinal regression.
A method for visualizing the model's explanatory variables according to the ordinal target variable is proposed.
The method is compared to traditional ordinal regression methods on a number of benchmark dataset, and is shown to have comparable or significantly better predictive power.
arXiv Detail & Related papers (2021-01-07T16:16:43Z) - Deep and interpretable regression models for ordinal outcomes [0.0]
Outcomes with a natural order commonly occur in prediction tasks and often the available input data are a mixture of complex data.
We present ordinal neural network transformation models (ONTRAMs) which unite Deep Learning (DL) with classical ordinal regression approaches.
The performance of the most flexible ONTRAM is by definition equivalent to a standard multi-class DL model trained with cross-entropy while being faster in training when facing ordinal outcomes.
arXiv Detail & Related papers (2020-10-16T13:27:34Z) - Robust Finite Mixture Regression for Heterogeneous Targets [70.19798470463378]
We propose an FMR model that finds sample clusters and jointly models multiple incomplete mixed-type targets simultaneously.
We provide non-asymptotic oracle performance bounds for our model under a high-dimensional learning framework.
The results show that our model can achieve state-of-the-art performance.
arXiv Detail & Related papers (2020-10-12T03:27:07Z) - Two-step penalised logistic regression for multi-omic data with an
application to cardiometabolic syndrome [62.997667081978825]
We implement a two-step approach to multi-omic logistic regression in which variable selection is performed on each layer separately.
Our approach should be preferred if the goal is to select as many relevant predictors as possible.
Our proposed approach allows us to identify features that characterise cardiometabolic syndrome at the molecular level.
arXiv Detail & Related papers (2020-08-01T10:36:27Z) - Pattern Similarity-based Machine Learning Methods for Mid-term Load
Forecasting: A Comparative Study [0.0]
We use pattern similarity-based methods for forecasting monthly electricity demand expressing annual seasonality.
An integral part of the models is the time series representation using patterns of time series sequences.
We consider four such models: nearest neighbor model, fuzzy neighborhood model, kernel regression model and general regression neural network.
arXiv Detail & Related papers (2020-03-03T12:14:36Z) - On the Discrepancy between Density Estimation and Sequence Generation [92.70116082182076]
log-likelihood is highly correlated with BLEU when we consider models within the same family.
We observe no correlation between rankings of models across different families.
arXiv Detail & Related papers (2020-02-17T20:13:35Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.