Online Learning in a Creator Economy
- URL: http://arxiv.org/abs/2305.11381v1
- Date: Fri, 19 May 2023 01:58:13 GMT
- Title: Online Learning in a Creator Economy
- Authors: Banghua Zhu, Sai Praneeth Karimireddy, Jiantao Jiao, Michael I. Jordan
- Abstract summary: We study the creator economy as a three-party game between the users, platform, and content creators.
We analyze two families of contracts: return-based contracts and feature-based contracts.
We show that under smoothness assumptions, the joint optimization of return-based contracts and recommendation policy provides a regret.
- Score: 91.55437924091844
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The creator economy has revolutionized the way individuals can profit through
online platforms. In this paper, we initiate the study of online learning in
the creator economy by modeling the creator economy as a three-party game
between the users, platform, and content creators, with the platform
interacting with the content creator under a principal-agent model through
contracts to encourage better content. Additionally, the platform interacts
with the users to recommend new content, receive an evaluation, and ultimately
profit from the content, which can be modeled as a recommender system.
Our study aims to explore how the platform can jointly optimize the contract
and recommender system to maximize the utility in an online learning fashion.
We primarily analyze and compare two families of contracts: return-based
contracts and feature-based contracts. Return-based contracts pay the content
creator a fraction of the reward the platform gains. In contrast, feature-based
contracts pay the content creator based on the quality or features of the
content, regardless of the reward the platform receives. We show that under
smoothness assumptions, the joint optimization of return-based contracts and
recommendation policy provides a regret $\Theta(T^{2/3})$. For the
feature-based contract, we introduce a definition of intrinsic dimension $d$ to
characterize the hardness of learning the contract and provide an upper bound
on the regret $\mathcal{O}(T^{(d+1)/(d+2)})$. The upper bound is tight for the
linear family.
Related papers
- Unveiling User Satisfaction and Creator Productivity Trade-Offs in Recommendation Platforms [68.51708490104687]
We show that a purely relevance-driven policy with low exploration strength boosts short-term user satisfaction but undermines the long-term richness of the content pool.
Our findings reveal a fundamental trade-off between immediate user satisfaction and overall content production on platforms.
arXiv Detail & Related papers (2024-10-31T07:19:22Z) - User Welfare Optimization in Recommender Systems with Competing Content Creators [65.25721571688369]
In this study, we perform system-side user welfare optimization under a competitive game setting among content creators.
We propose an algorithmic solution for the platform, which dynamically computes a sequence of weights for each user based on their satisfaction of the recommended content.
These weights are then utilized to design mechanisms that adjust the recommendation policy or the post-recommendation rewards, thereby influencing creators' content production strategies.
arXiv Detail & Related papers (2024-04-28T21:09:52Z) - Clickbait vs. Quality: How Engagement-Based Optimization Shapes the
Content Landscape in Online Platforms [16.26484874313566]
We study a game between content creators competing on the basis of engagement metrics and analyze the equilibrium decisions about investment in quality and gaming.
We show the content created at equilibrium exhibits a positive correlation between quality and gaming, and we empirically validate this finding on a Twitter dataset.
Perhaps counterintuitively, the average quality of content consumed by users can decrease at equilibrium as gaming tricks become more costly for content creators to employ.
arXiv Detail & Related papers (2024-01-18T08:48:54Z) - Preferences Evolve And So Should Your Bandits: Bandits with Evolving States for Online Platforms [12.368291979686122]
We propose a model for learning with bandit feedback while accounting for deterministically evolving and unobservable states.
The workhorse applications of our model are learning for recommendation systems and learning for online ads.
arXiv Detail & Related papers (2023-07-21T15:43:32Z) - Incentivizing High-Quality Content in Online Recommender Systems [80.19930280144123]
We study the game between producers and analyze the content created at equilibrium.
We show that standard online learning algorithms, such as Hedge and EXP3, unfortunately incentivize producers to create low-quality content.
arXiv Detail & Related papers (2023-06-13T00:55:10Z) - Modeling Content Creator Incentives on Algorithm-Curated Platforms [76.53541575455978]
We study how algorithmic choices affect the existence and character of (Nash) equilibria in exposure games.
We propose tools for numerically finding equilibria in exposure games, and illustrate results of an audit on the MovieLens and LastFM datasets.
arXiv Detail & Related papers (2022-06-27T08:16:59Z) - Feedback Shaping: A Modeling Approach to Nurture Content Creation [10.31854532203776]
We propose a modeling approach to predict how feedback from content consumers incentivizes creators.
We then leverage this model to optimize the newsfeed experience for content creators by reshaping the feedback distribution.
We present a deployed use case on the LinkedIn newsfeed, where we used this approach to improve content creation significantly without compromising the consumers' experience.
arXiv Detail & Related papers (2021-06-21T22:53:16Z) - SoMin.ai: Personality-Driven Content Generation Platform [60.49416044866648]
We showcase the World's first personality-driven marketing content generation platform, called SoMin.ai.
The platform combines deep multi-view personality profiling framework and style generative adversarial networks.
It can be used for the enhancement of the social networking user experience as well as for content marketing routines.
arXiv Detail & Related papers (2020-11-30T08:33:39Z) - Incentivising Exploration and Recommendations for Contextual Bandits
with Payments [2.5966580648312223]
We show how the platform can learn the inherent attributes of items and achieve a sublinear regret while maximizing cumulative social welfare.
Our approach can improve various engagement metrics of users on e-commerce stores, recommendation engines and matching platforms.
arXiv Detail & Related papers (2020-01-22T02:26:22Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.