Related papers: SiMCa: Sinkhorn Matrix Factorization with Capacity Constraints

SiMCa: Sinkhorn Matrix Factorization with Capacity Constraints

URL: http://arxiv.org/abs/2203.10107v1
Date: Fri, 18 Mar 2022 18:02:44 GMT
Title: SiMCa: Sinkhorn Matrix Factorization with Capacity Constraints
Authors: Eric Daoud, Luca Ganassali, Antoine Baker, Marc Lelarge
Abstract summary: We study the recommendation problem in the setting where affinities between users and items are based both on their embeddings in a latent space. We propose an algorithm based on matrix factorization enhanced with optimal transport steps to model user-item affinities and learn item embeddings from observed data.
Score: 6.241630263094724
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: For a very broad range of problems, recommendation algorithms have been increasingly used over the past decade. In most of these algorithms, the predictions are built upon user-item affinity scores which are obtained from high-dimensional embeddings of items and users. In more complex scenarios, with geometrical or capacity constraints, prediction based on embeddings may not be sufficient and some additional features should be considered in the design of the algorithm. In this work, we study the recommendation problem in the setting where affinities between users and items are based both on their embeddings in a latent space and on their geographical distance in their underlying euclidean space (e.g., $\mathbb{R}^2$), together with item capacity constraints. This framework is motivated by some real-world applications, for instance in healthcare: the task is to recommend hospitals to patients based on their location, pathology, and hospital capacities. In these applications, there is somewhat of an asymmetry between users and items: items are viewed as static points, their embeddings, capacities and locations constraining the allocation. Upon the observation of an optimal allocation, user embeddings, items capacities, and their positions in their underlying euclidean space, our aim is to recover item embeddings in the latent space; doing so, we are then able to use this estimate e.g. in order to predict future allocations. We propose an algorithm (SiMCa) based on matrix factorization enhanced with optimal transport steps to model user-item affinities and learn item embeddings from observed data. We then illustrate and discuss the results of such an approach for hospital recommendation on synthetic data.

Related papers

Scalable Private Partition Selection via Adaptive Weighting [66.09199304818928]
In a private set union, users hold subsets of items from an unbounded universe. The goal is to output as many items as possible from the union of the users' sets while maintaining user-level differential privacy. We propose an algorithm for this problem, MaximumDegree (MAD), which adaptively reroutes weight from items with weight far above the threshold needed for privacy to items with smaller weight.
arXiv Detail & Related papers (2025-02-13T01:27:11Z)
Towards Scalable General Utility Reinforcement Learning: Occupancy Approximation, Sample Complexity and Global Optimality [23.623705771223303]
We propose to approximate occupancy measures within a function approximation class using maximum likelihood estimation (MLE) We provide a statistical analysis showing that our occupancy measure estimation error only scales with the dimension of our function approximation class rather than the size of the state action space.
arXiv Detail & Related papers (2024-10-05T10:24:07Z)
Scalable Dynamic Embedding Size Search for Streaming Recommendation [54.28404337601801]
Real-world recommender systems often operate in streaming recommendation scenarios. Number of users and items continues to grow, leading to substantial storage resource consumption. We learn Lightweight Embeddings for streaming recommendation, called SCALL, which can adaptively adjust the embedding sizes of users/items.
arXiv Detail & Related papers (2024-07-22T06:37:24Z)
Spatial Process Approximations: Assessing Their Necessity [5.8666339171606445]
In spatial statistics and machine learning, the kernel matrix plays a pivotal role in prediction, classification, and maximum likelihood estimation. A review of current methodologies for managing large spatial data indicates that some fail to address this ill-conditioning problem. This paper introduces various optimality criteria and provides solutions for each.
arXiv Detail & Related papers (2023-11-06T15:46:03Z)
CARE: Large Precision Matrix Estimation for Compositional Data [9.440956168571617]
We introduce a precise specification of the compositional precision matrix and relate it to its basis counterpart. By exploiting this connection, we propose a composition regularized estimation (CARE) method for estimating the sparse basis precision matrix. Our theory reveals an intriguing trade-off between identification and estimation, thereby highlighting the blessing of dimensionality in compositional data analysis.
arXiv Detail & Related papers (2023-09-13T14:20:22Z)
Multicriteria Optimization Techniques for Understanding the Case Mix Landscape of a Hospital [0.0]
This article considers the impact of treating different patient case mix (PCM) in a hospital. To better understand the case mix landscape and to identify those which are optimal from a capacity utilisation perspective, an improved multicriteria optimization (MCO) approach is proposed.
arXiv Detail & Related papers (2023-07-31T22:55:48Z)
Tree ensemble kernels for Bayesian optimization with known constraints over mixed-feature spaces [54.58348769621782]
Tree ensembles can be well-suited for black-box optimization tasks such as algorithm tuning and neural architecture search. Two well-known challenges in using tree ensembles for black-box optimization are (i) effectively quantifying model uncertainty for exploration and (ii) optimizing over the piece-wise constant acquisition function. Our framework performs as well as state-of-the-art methods for unconstrained black-box optimization over continuous/discrete features and outperforms competing methods for problems combining mixed-variable feature spaces and known input constraints.
arXiv Detail & Related papers (2022-07-02T16:59:37Z)
Quantifying Availability and Discovery in Recommender Systems via Stochastic Reachability [27.21058243752746]
We propose an evaluation procedure based on reachability to quantify the maximum probability of recommending a target piece of content to a user. reachability can be used to detect biases in the availability of content and diagnose limitations in the opportunities for discovery granted to users. We demonstrate evaluations of recommendation algorithms trained on large datasets of explicit and implicit ratings.
arXiv Detail & Related papers (2021-06-30T16:18:12Z)
Estimating leverage scores via rank revealing methods and randomization [50.591267188664666]
We study algorithms for estimating the statistical leverage scores of rectangular dense or sparse matrices of arbitrary rank. Our approach is based on combining rank revealing methods with compositions of dense and sparse randomized dimensionality reduction transforms.
arXiv Detail & Related papers (2021-05-23T19:21:55Z)
Resource Planning for Hospitals Under Special Consideration of the COVID-19 Pandemic: Optimization and Sensitivity Analysis [87.31348761201716]
Crises like the COVID-19 pandemic pose a serious challenge to health-care institutions. BaBSim.Hospital is a tool for capacity planning based on discrete event simulation. We aim to investigate and optimize these parameters to improve BaBSim.Hospital.
arXiv Detail & Related papers (2021-05-16T12:38:35Z)
Offline Model-Based Optimization via Normalized Maximum Likelihood Estimation [101.22379613810881]
We consider data-driven optimization problems where one must maximize a function given only queries at a fixed set of points. This problem setting emerges in many domains where function evaluation is a complex and expensive process. We propose a tractable approximation that allows us to scale our method to high-capacity neural network models.
arXiv Detail & Related papers (2021-02-16T06:04:27Z)
ReliefE: Feature Ranking in High-dimensional Spaces via Manifold Embeddings [0.0]
Relief family of algorithms assign importances to features by iteratively accounting for nearest relevant and irrelevant instances. Recent embedding-based methods learn compact, low-dimensional representations. ReliefE algorithm is faster and can result in better feature rankings.
arXiv Detail & Related papers (2021-01-23T20:23:31Z)
Human Preference-Based Learning for High-dimensional Optimization of Exoskeleton Walking Gaits [55.59198568303196]
This work presents LineCoSpar, a human-in-the-loop preference-based framework to learn user preferences in high dimensions. In simulations and human trials, we empirically verify that LineCoSpar is a sample-efficient approach for high-dimensional preference optimization. This result has implications for exoskeleton gait synthesis, an active field with applications to clinical use and patient rehabilitation.
arXiv Detail & Related papers (2020-03-13T22:02:15Z)

This list is automatically generated from the titles and abstracts of the papers in this site.