Competing Bandits: The Perils of Exploration Under Competition
- URL: http://arxiv.org/abs/2007.10144v8
- Date: Sat, 12 Oct 2024 14:24:30 GMT
- Title: Competing Bandits: The Perils of Exploration Under Competition
- Authors: Guy Aridor, Yishay Mansour, Aleksandrs Slivkins, Zhiwei Steven Wu,
- Abstract summary: We study the interplay between exploration and competition on online platforms.
We find that stark competition induces firms to commit to a "greedy" bandit algorithm that leads to low welfare.
We investigate two channels for weakening the competition: relaxing the rationality of users and giving one firm a first-mover advantage.
- Score: 99.68537519404727
- License:
- Abstract: Most online platforms strive to learn from interactions with users, and many engage in exploration: making potentially suboptimal choices for the sake of acquiring new information. We study the interplay between exploration and competition: how such platforms balance the exploration for learning and the competition for users. Here users play three distinct roles: they are customers that generate revenue, they are sources of data for learning, and they are self-interested agents which choose among the competing platforms. We consider a stylized duopoly model in which two firms face the same multi-armed bandit problem. Users arrive one by one and choose between the two firms, so that each firm makes progress on its bandit problem only if it is chosen. Through a mix of theoretical results and numerical simulations, we study whether and to what extent competition incentivizes the adoption of better bandit algorithms, and whether it leads to welfare increases for users. We find that stark competition induces firms to commit to a "greedy" bandit algorithm that leads to low welfare. However, weakening competition by providing firms with some "free" users incentivizes better exploration strategies and increases welfare. We investigate two channels for weakening the competition: relaxing the rationality of users and giving one firm a first-mover advantage. Our findings are closely related to the "competition vs. innovation" relationship, and elucidate the first-mover advantage in the digital economy.
Related papers
- Defection-Free Collaboration between Competitors in a Learning System [61.22540496065961]
We study collaborative learning systems in which the participants are competitors who will defect from the system if they lose revenue by collaborating.
We propose a more equitable, *defection-free* scheme in which both firms share with each other while losing no revenue.
arXiv Detail & Related papers (2024-06-22T17:29:45Z) - User Welfare Optimization in Recommender Systems with Competing Content Creators [65.25721571688369]
In this study, we perform system-side user welfare optimization under a competitive game setting among content creators.
We propose an algorithmic solution for the platform, which dynamically computes a sequence of weights for each user based on their satisfaction of the recommended content.
These weights are then utilized to design mechanisms that adjust the recommendation policy or the post-recommendation rewards, thereby influencing creators' content production strategies.
arXiv Detail & Related papers (2024-04-28T21:09:52Z) - A Competition-based Pricing Strategy in Cloud Markets using Regret
Minimization Techniques [0.0]
This work proposes a pricing policy related to the regret minimization algorithm and applies it to the considered incomplete-information game.
Based on the competition based marketplace of the Cloud, providers update the distribution of their strategies using the experienced regret.
The experimental results show much more increase in profits of the providers in comparison with other pricing policies.
arXiv Detail & Related papers (2023-09-20T13:38:43Z) - Improving Fairness in Adaptive Social Exergames via Shapley Bandits [7.215807283769683]
We propose a new type of fairness-aware multi-armed bandit, Shapley Bandits.
It uses the Shapley Value for increasing overall player participation and intervention adherence rather than the total group output.
Our results indicate that our Shapley Bandits effectively mediates the Greedy Bandit Problem and achieves better user retention and motivation across the participants.
arXiv Detail & Related papers (2023-02-18T11:34:02Z) - Bandit Social Learning: Exploration under Myopic Behavior [58.75758600464338]
We study social learning dynamics motivated by reviews on online platforms.
Agents collectively follow a simple multi-armed bandit protocol, but each agent acts myopically, without regards to exploration.
We derive stark learning failures for any such behavior, and provide matching positive results.
arXiv Detail & Related papers (2023-02-15T01:57:57Z) - How Bad is Top-$K$ Recommendation under Competing Content Creators? [43.2268992294178]
We study the user welfare guarantee through the lens of Price of Anarchy.
We show that the fraction of user welfare loss due to creator competition is always upper bounded by a small constant depending on $K$ and randomness in user decisions.
arXiv Detail & Related papers (2023-02-03T19:37:35Z) - Incentive-Aware Recommender Systems in Two-Sided Markets [49.692453629365204]
We propose a novel recommender system that aligns with agents' incentives while achieving myopically optimal performance.
Our framework models this incentive-aware system as a multi-agent bandit problem in two-sided markets.
Both algorithms satisfy an ex-post fairness criterion, which protects agents from over-exploitation.
arXiv Detail & Related papers (2022-11-23T22:20:12Z) - Competition, Alignment, and Equilibria in Digital Marketplaces [97.03797129675951]
We study a duopoly market where platform actions are bandit algorithms and the two platforms compete for user participation.
Our main finding is that competition in this market does not perfectly align market outcomes with user utility.
arXiv Detail & Related papers (2022-08-30T17:43:58Z) - Invitation in Crowdsourcing Contests [9.860944032009847]
In this work, we take peoples' social ties as a key factor in the modeling and designing of agents' incentives for crowdsourcing contests.
We establish a new contest mechanism by which the requester can impel agents to invite their neighbours to contribute to the task.
According to our equilibrium analysis, in the Bayesian Nash equilibrium agents' behaviors show a vast diversity, capturing that besides the intrinsic ability, the social ties among agents also play a central role for decision-making.
arXiv Detail & Related papers (2021-12-06T09:18:18Z) - Maximizing Social Welfare in a Competitive Diffusion Model [19.18481967353127]
UIC (IM) has garnered a lot of attention owing to applications such as viral marketing and infection containment.
It aims to select a small number of seed users to adopt an item such that adoption propagates to a large number of users in the network.
Existing works on competitive IM have several limitations.
arXiv Detail & Related papers (2020-12-06T19:09:12Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.