SiMCa: Sinkhorn Matrix Factorization with Capacity Constraints
- URL: http://arxiv.org/abs/2203.10107v1
- Date: Fri, 18 Mar 2022 18:02:44 GMT
- Title: SiMCa: Sinkhorn Matrix Factorization with Capacity Constraints
- Authors: Eric Daoud, Luca Ganassali, Antoine Baker, Marc Lelarge
- Abstract summary: We study the recommendation problem in the setting where affinities between users and items are based both on their embeddings in a latent space.
We propose an algorithm based on matrix factorization enhanced with optimal transport steps to model user-item affinities and learn item embeddings from observed data.
- Score: 6.241630263094724
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: For a very broad range of problems, recommendation algorithms have been
increasingly used over the past decade. In most of these algorithms, the
predictions are built upon user-item affinity scores which are obtained from
high-dimensional embeddings of items and users. In more complex scenarios, with
geometrical or capacity constraints, prediction based on embeddings may not be
sufficient and some additional features should be considered in the design of
the algorithm. In this work, we study the recommendation problem in the setting
where affinities between users and items are based both on their embeddings in
a latent space and on their geographical distance in their underlying euclidean
space (e.g., $\mathbb{R}^2$), together with item capacity constraints. This
framework is motivated by some real-world applications, for instance in
healthcare: the task is to recommend hospitals to patients based on their
location, pathology, and hospital capacities. In these applications, there is
somewhat of an asymmetry between users and items: items are viewed as static
points, their embeddings, capacities and locations constraining the allocation.
Upon the observation of an optimal allocation, user embeddings, items
capacities, and their positions in their underlying euclidean space, our aim is
to recover item embeddings in the latent space; doing so, we are then able to
use this estimate e.g. in order to predict future allocations. We propose an
algorithm (SiMCa) based on matrix factorization enhanced with optimal transport
steps to model user-item affinities and learn item embeddings from observed
data. We then illustrate and discuss the results of such an approach for
hospital recommendation on synthetic data.
Related papers
- Quantifying User Coherence: A Unified Framework for Cross-Domain Recommendation Analysis [69.37718774071793]
This paper introduces novel information-theoretic measures for understanding recommender systems.
We evaluate 7 recommendation algorithms across 9 datasets, revealing the relationships between our measures and standard performance metrics.
arXiv Detail & Related papers (2024-10-03T13:02:07Z) - Spatial Process Approximations: Assessing Their Necessity [5.8666339171606445]
In spatial statistics and machine learning, the kernel matrix plays a pivotal role in prediction, classification, and maximum likelihood estimation.
A review of current methodologies for managing large spatial data indicates that some fail to address this ill-conditioning problem.
This paper introduces various optimality criteria and provides solutions for each.
arXiv Detail & Related papers (2023-11-06T15:46:03Z) - CARE: Large Precision Matrix Estimation for Compositional Data [9.440956168571617]
We introduce a precise specification of the compositional precision matrix and relate it to its basis counterpart.
By exploiting this connection, we propose a composition regularized estimation (CARE) method for estimating the sparse basis precision matrix.
Our theory reveals an intriguing trade-off between identification and estimation, thereby highlighting the blessing of dimensionality in compositional data analysis.
arXiv Detail & Related papers (2023-09-13T14:20:22Z) - Multicriteria Optimization Techniques for Understanding the Case Mix
Landscape of a Hospital [0.0]
This article considers the impact of treating different patient case mix (PCM) in a hospital.
To better understand the case mix landscape and to identify those which are optimal from a capacity utilisation perspective, an improved multicriteria optimization (MCO) approach is proposed.
arXiv Detail & Related papers (2023-07-31T22:55:48Z) - Tree ensemble kernels for Bayesian optimization with known constraints
over mixed-feature spaces [54.58348769621782]
Tree ensembles can be well-suited for black-box optimization tasks such as algorithm tuning and neural architecture search.
Two well-known challenges in using tree ensembles for black-box optimization are (i) effectively quantifying model uncertainty for exploration and (ii) optimizing over the piece-wise constant acquisition function.
Our framework performs as well as state-of-the-art methods for unconstrained black-box optimization over continuous/discrete features and outperforms competing methods for problems combining mixed-variable feature spaces and known input constraints.
arXiv Detail & Related papers (2022-07-02T16:59:37Z) - Quantifying Availability and Discovery in Recommender Systems via
Stochastic Reachability [27.21058243752746]
We propose an evaluation procedure based on reachability to quantify the maximum probability of recommending a target piece of content to a user.
reachability can be used to detect biases in the availability of content and diagnose limitations in the opportunities for discovery granted to users.
We demonstrate evaluations of recommendation algorithms trained on large datasets of explicit and implicit ratings.
arXiv Detail & Related papers (2021-06-30T16:18:12Z) - Estimating leverage scores via rank revealing methods and randomization [50.591267188664666]
We study algorithms for estimating the statistical leverage scores of rectangular dense or sparse matrices of arbitrary rank.
Our approach is based on combining rank revealing methods with compositions of dense and sparse randomized dimensionality reduction transforms.
arXiv Detail & Related papers (2021-05-23T19:21:55Z) - Resource Planning for Hospitals Under Special Consideration of the
COVID-19 Pandemic: Optimization and Sensitivity Analysis [87.31348761201716]
Crises like the COVID-19 pandemic pose a serious challenge to health-care institutions.
BaBSim.Hospital is a tool for capacity planning based on discrete event simulation.
We aim to investigate and optimize these parameters to improve BaBSim.Hospital.
arXiv Detail & Related papers (2021-05-16T12:38:35Z) - Offline Model-Based Optimization via Normalized Maximum Likelihood
Estimation [101.22379613810881]
We consider data-driven optimization problems where one must maximize a function given only queries at a fixed set of points.
This problem setting emerges in many domains where function evaluation is a complex and expensive process.
We propose a tractable approximation that allows us to scale our method to high-capacity neural network models.
arXiv Detail & Related papers (2021-02-16T06:04:27Z) - ReliefE: Feature Ranking in High-dimensional Spaces via Manifold
Embeddings [0.0]
Relief family of algorithms assign importances to features by iteratively accounting for nearest relevant and irrelevant instances.
Recent embedding-based methods learn compact, low-dimensional representations.
ReliefE algorithm is faster and can result in better feature rankings.
arXiv Detail & Related papers (2021-01-23T20:23:31Z) - Human Preference-Based Learning for High-dimensional Optimization of
Exoskeleton Walking Gaits [55.59198568303196]
This work presents LineCoSpar, a human-in-the-loop preference-based framework to learn user preferences in high dimensions.
In simulations and human trials, we empirically verify that LineCoSpar is a sample-efficient approach for high-dimensional preference optimization.
This result has implications for exoskeleton gait synthesis, an active field with applications to clinical use and patient rehabilitation.
arXiv Detail & Related papers (2020-03-13T22:02:15Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.