Related papers: Torch-Choice: A PyTorch Package for Large-Scale Choice Modelling with Python

Torch-Choice: A PyTorch Package for Large-Scale Choice Modelling with Python

URL: http://arxiv.org/abs/2304.01906v3
Date: Fri, 14 Jul 2023 21:42:04 GMT
Title: Torch-Choice: A PyTorch Package for Large-Scale Choice Modelling with Python
Authors: Tianyu Du, Ayush Kanodia and Susan Athey
Abstract summary: $texttttorch-choice$ is an open-source library for flexible, fast choice modeling with Python and PyTorch. $textttChoiceDataset$ provides a $textttChoiceDataset$ data structure to manage databases flexibly and memory-efficiently.
Score: 11.566791864440262
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: The $\texttt{torch-choice}$ is an open-source library for flexible, fast choice modeling with Python and PyTorch. $\texttt{torch-choice}$ provides a $\texttt{ChoiceDataset}$ data structure to manage databases flexibly and memory-efficiently. The paper demonstrates constructing a $\texttt{ChoiceDataset}$ from databases of various formats and functionalities of $\texttt{ChoiceDataset}$. The package implements two widely used models, namely the multinomial logit and nested logit models, and supports regularization during model estimation. The package incorporates the option to take advantage of GPUs for estimation, allowing it to scale to massive datasets while being computationally efficient. Models can be initialized using either R-style formula strings or Python dictionaries. We conclude with a comparison of the computational efficiencies of $\texttt{torch-choice}$ and $\texttt{mlogit}$ in R as (1) the number of observations increases, (2) the number of covariates increases, and (3) the expansion of item sets. Finally, we demonstrate the scalability of $\texttt{torch-choice}$ on large-scale datasets.

Related papers

PyTorch Frame: A Modular Framework for Multi-Modal Tabular Learning [54.912520425218496]
We present PyTorch Frame, a PyTorch-based framework for deep learning over multi-modal tabular data. We demonstrate the usefulness of PyTorch Frame by implementing diverse models in a modular way. We integrate PyTorch Frame with PyTorch Geometric, a PyTorch library for Graph Neural Networks (GNNs), to perform end-to-end learning over relational databases.
arXiv Detail & Related papers (2024-03-31T19:15:09Z)
pyvene: A Library for Understanding and Improving PyTorch Models via Interventions [79.72930339711478]
$textbfpyvene$ is an open-source library that supports customizable interventions on a range of different PyTorch modules. We show how $textbfpyvene$ provides a unified framework for performing interventions on neural models and sharing the intervened upon models with others.
arXiv Detail & Related papers (2024-03-12T16:46:54Z)
Interpreting Deep Neural Networks with the Package innsight [0.951828574518325]
innsight is generally the first R package implementing feature attribution methods for neural networks. It operates independently of the deep learning library allowing the interpretation of models from any R package. Innsight benefits internally from the torch package's fast and efficient array calculations.
arXiv Detail & Related papers (2023-06-19T10:12:32Z)
DADApy: Distance-based Analysis of DAta-manifolds in Python [51.37841707191944]
DADApy is a python software package for analysing and characterising high-dimensional data. It provides methods for estimating the intrinsic dimension and the probability density, for performing density-based clustering and for comparing different distance metrics.
arXiv Detail & Related papers (2022-05-04T08:41:59Z)
Scaling Up Models and Data with $\texttt{t5x}$ and $\texttt{seqio}$ [118.04625413322827]
$texttt5x$ and $texttseqio$ are open source software libraries for building and training language models. These libraries have been used to train models with hundreds of billions of parameters on datasets with multiple terabytes of training data.
arXiv Detail & Related papers (2022-03-31T17:12:13Z)
$\texttt{py-irt}$: A Scalable Item Response Theory Library for Python [3.9828133571463935]
$textttpy-irt$ is a Python library for fitting Bayesian Item Response Theory (IRT) models. It estimates latent traits of subjects and items, making it appropriate for use in IRT tasks as well as ideal-point models.
arXiv Detail & Related papers (2022-03-02T18:09:46Z)
PyHHMM: A Python Library for Heterogeneous Hidden Markov Models [63.01207205641885]
PyHHMM is an object-oriented Python implementation of Heterogeneous-Hidden Markov Models (HHMMs) PyHHMM emphasizes features not supported in similar available frameworks: a heterogeneous observation model, missing data inference, different model order selection criterias, and semi-supervised training. PyHHMM relies on the numpy, scipy, scikit-learn, and seaborn Python packages, and is distributed under the Apache-2.0 License.
arXiv Detail & Related papers (2022-01-12T07:32:36Z)
Understanding Dataset Difficulty with $\mathcal{V}$-Usable Information [67.25713071340518]
Estimating the difficulty of a dataset typically involves comparing state-of-the-art models to humans. We frame dataset difficulty as the lack of $mathcalV$-$textitusable information. We also introduce $textitpointwise $mathcalV$-information$ (PVI) for measuring the difficulty of individual instances.
arXiv Detail & Related papers (2021-10-16T00:21:42Z)
Using Python for Model Inference in Deep Learning [0.6027358520885614]
We show how it is possible to meet performance and packaging constraints while performing inference in Python. We present a way of using multiple Python interpreters within a single process to achieve scalable inference.
arXiv Detail & Related papers (2021-04-01T04:48:52Z)
pyBART: Evidence-based Syntactic Transformations for IE [52.93947844555369]
We present pyBART, an easy-to-use open-source Python library for converting English UD trees to Enhanced UD graphs or to our representation. When evaluated in a pattern-based relation extraction scenario, our representation results in higher extraction scores than Enhanced UD, while requiring fewer patterns.
arXiv Detail & Related papers (2020-05-04T07:38:34Z)
Pruned Wasserstein Index Generation Model and wigpy Package [0.0]
I propose a Lasso-based shrinkage method to reduce dimensionality for the vocabulary as a pre-processing step prior to fitting the WIG model. I also provide a textitwigpy module in Python to carry out computation in both flavor.
arXiv Detail & Related papers (2020-03-30T18:26:24Z)

This list is automatically generated from the titles and abstracts of the papers in this site.