Interactive Recommendations for Optimal Allocations in Markets with
Constraints
- URL: http://arxiv.org/abs/2207.04143v1
- Date: Fri, 8 Jul 2022 22:16:51 GMT
- Title: Interactive Recommendations for Optimal Allocations in Markets with
Constraints
- Authors: Yigit Efe Erginbas, Soham Phade, Kannan Ramchandran
- Abstract summary: We propose an interactive framework where the system provider can enhance the quality of recommendations to the users.
We employ an integrated approach using techniques from collaborative filtering, bandits, and optimal resource allocation.
Empirical studies on synthetic matrix and real-world data also demonstrate the effectiveness and performance of our approach.
- Score: 12.580391999838128
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Recommendation systems when employed in markets play a dual role: they assist
users in selecting their most desired items from a large pool and they help in
allocating a limited number of items to the users who desire them the most.
Despite the prevalence of capacity constraints on allocations in many
real-world recommendation settings, a principled way of incorporating them in
the design of these systems has been lacking. Motivated by this, we propose an
interactive framework where the system provider can enhance the quality of
recommendations to the users by opportunistically exploring allocations that
maximize user rewards and respect the capacity constraints using appropriate
pricing mechanisms. We model the problem as an instance of a low-rank
combinatorial multi-armed bandit problem with selection constraints on the
arms. We employ an integrated approach using techniques from collaborative
filtering, combinatorial bandits, and optimal resource allocation to provide an
algorithm that provably achieves sub-linear regret, namely $\tilde{\mathcal{O}}
( \sqrt{N M (N+M) RT} )$ in $T$ rounds for a problem with $N$ users, $M$ items
and rank $R$ mean reward matrix. Empirical studies on synthetic and real-world
data also demonstrate the effectiveness and performance of our approach.
Related papers
- LANE: Logic Alignment of Non-tuning Large Language Models and Online Recommendation Systems for Explainable Reason Generation [15.972926854420619]
Leveraging large language models (LLMs) offers new opportunities for comprehensive recommendation logic generation.
Fine-tuning LLM models for recommendation tasks incurs high computational costs and alignment issues with existing systems.
In this work, our proposed effective strategy LANE aligns LLMs with online recommendation systems without additional LLMs tuning.
arXiv Detail & Related papers (2024-07-03T06:20:31Z) - Efficient Prompt Optimization Through the Lens of Best Arm Identification [50.56113809171805]
This work provides a principled framework, TRIPLE, to efficiently perform prompt selection under an explicit budget constraint.
It is built on a novel connection established between prompt optimization and fixed-budget best arm identification (BAI-FB) in multi-armed bandits (MAB)
arXiv Detail & Related papers (2024-02-15T05:31:13Z) - Approaching Collateral Optimization for NISQ and Quantum-Inspired
Computing [0.0]
Collateral optimization refers to the systematic allocation of financial assets to satisfy obligations or secure transactions.
One of the common objectives is to minimise the cost of collateral required to mitigate the risk associated with a particular transaction or a portfolio of transactions.
arXiv Detail & Related papers (2023-05-25T18:01:04Z) - A Bandit Approach to Online Pricing for Heterogeneous Edge Resource
Allocation [8.089950414444115]
Two novel online pricing mechanisms are proposed for heterogeneous edge resource allocation.
The mechanisms operate in real-time and do not require prior knowledge of demand distribution.
The proposed posted pricing schemes allow users to select and pay for their preferred resources, with the platform dynamically adjusting resource prices based on observed historical data.
arXiv Detail & Related papers (2023-02-14T10:21:14Z) - Interactive Learning with Pricing for Optimal and Stable Allocations in
Markets [12.580391999838128]
Large-scale online recommendation systems must facilitate the allocation of a limited number of items among competing users while learning their preferences from user feedback.
Our framework enhances the quality of recommendations by exploring allocations that optimistically maximize the rewards.
To minimize instability, a measure of users' incentives to deviate from recommended allocations, the algorithm prices the items based on a scheme derived from the Walrasian equilibria.
Our approach is the first to integrate techniques from bandits, optimal resource allocation, and collaborative filtering to obtain an algorithm that achieves sub-linear social welfare regret as well as sub-linear instability.
arXiv Detail & Related papers (2022-12-13T20:33:54Z) - Incentive-Aware Recommender Systems in Two-Sided Markets [49.692453629365204]
We propose a novel recommender system that aligns with agents' incentives while achieving myopically optimal performance.
Our framework models this incentive-aware system as a multi-agent bandit problem in two-sided markets.
Both algorithms satisfy an ex-post fairness criterion, which protects agents from over-exploitation.
arXiv Detail & Related papers (2022-11-23T22:20:12Z) - A Unifying Framework for Online Optimization with Long-Term Constraints [62.35194099438855]
We study online learning problems in which a decision maker has to take a sequence of decisions subject to $m$ long-term constraints.
The goal is to maximize their total reward, while at the same time achieving small cumulative violation across the $T$ rounds.
We present the first best-of-both-world type algorithm for this general class problems, with no-regret guarantees both in the case in which rewards and constraints are selected according to an unknown model, and in the case in which they are selected at each round by an adversary.
arXiv Detail & Related papers (2022-09-15T16:59:19Z) - Online Learning with Knapsacks: the Best of Both Worlds [54.28273783164608]
We casting online learning problems in which a decision maker wants to maximize their expected reward without violating a finite set of $m$m resource constraints.
Our framework allows the decision maker to handle its evidence flexibility and costoretic functions.
arXiv Detail & Related papers (2022-02-28T12:10:48Z) - Exploration in two-stage recommender systems [79.50534282841618]
Two-stage recommender systems are widely adopted in industry due to their scalability and maintainability.
A key challenge of this setup is that optimal performance of each stage in isolation does not imply optimal global performance.
We propose a method of synchronising the exploration strategies between the ranker and the nominators.
arXiv Detail & Related papers (2020-09-01T16:52:51Z) - Hierarchical Adaptive Contextual Bandits for Resource Constraint based
Recommendation [49.69139684065241]
Contextual multi-armed bandit (MAB) achieves cutting-edge performance on a variety of problems.
In this paper, we propose a hierarchical adaptive contextual bandit method (HATCH) to conduct the policy learning of contextual bandits with a budget constraint.
arXiv Detail & Related papers (2020-04-02T17:04:52Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.