MoTiAC: Multi-Objective Actor-Critics for Real-Time Bidding
- URL: http://arxiv.org/abs/2002.07408v2
- Date: Wed, 6 Jul 2022 05:07:10 GMT
- Title: MoTiAC: Multi-Objective Actor-Critics for Real-Time Bidding
- Authors: Haolin Zhou, Chaoqi Yang, Xiaofeng Gao, Qiong Chen, Gongshen Liu and
Guihai Chen
- Abstract summary: We propose a Multi-ecTive Actor-Critics algorithm named MoTiAC for the problem of bidding optimization with various goals.
Unlike previous RL models, the proposed MoTiAC can simultaneously fulfill multi-objective tasks in complicated bidding environments.
- Score: 47.555870679348416
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Online Real-Time Bidding (RTB) is a complex auction game among which
advertisers struggle to bid for ad impressions when a user request occurs.
Considering display cost, Return on Investment (ROI), and other influential Key
Performance Indicators (KPIs), large ad platforms try to balance the trade-off
among various goals in dynamics. To address the challenge, we propose a
Multi-ObjecTive Actor-Critics algorithm based on reinforcement learning (RL),
named MoTiAC, for the problem of bidding optimization with various goals. In
MoTiAC, objective-specific agents update the global network asynchronously with
different goals and perspectives, leading to a robust bidding policy. Unlike
previous RL models, the proposed MoTiAC can simultaneously fulfill
multi-objective tasks in complicated bidding environments. In addition, we
mathematically prove that our model will converge to Pareto optimality.
Finally, experiments on a large-scale real-world commercial dataset from
Tencent verify the effectiveness of MoTiAC versus a set of recent approaches
Related papers
- MODRL-TA:A Multi-Objective Deep Reinforcement Learning Framework for Traffic Allocation in E-Commerce Search [13.893431289065997]
This paper propose a multi-objective deep reinforcement learning framework consisting of multi-objective Q-learning (MOQ), a decision fusion algorithm (DFM) based on the cross-entropy method(CEM) and a progressive data augmentation system(PDA)
Experiments on real-world online e-commerce systems demonstrate the significant improvements of MODRL-TA.
arXiv Detail & Related papers (2024-07-22T08:40:27Z) - A Bargaining-based Approach for Feature Trading in Vertical Federated
Learning [54.51890573369637]
We propose a bargaining-based feature trading approach in Vertical Federated Learning (VFL) to encourage economically efficient transactions.
Our model incorporates performance gain-based pricing, taking into account the revenue-based optimization objectives of both parties.
arXiv Detail & Related papers (2024-02-23T10:21:07Z) - Exploring Missing Modality in Multimodal Egocentric Datasets [89.76463983679058]
We introduce a novel concept -Missing Modality Token (MMT)-to maintain performance even when modalities are absent.
Our method mitigates the performance loss, reducing it from its original $sim 30%$ drop to only $sim 10%$ when half of the test set is modal-incomplete.
arXiv Detail & Related papers (2024-01-21T11:55:42Z) - Lightweight In-Context Tuning for Multimodal Unified Models [57.10831399642176]
MultiModal In-conteXt Tuning (M$2$IXT) is a lightweight module to enhance the ICL capabilities of multimodal unified models.
When tuned on as little as 50K multimodal data, M$2$IXT can boost the few-shot ICL performance significantly.
arXiv Detail & Related papers (2023-10-08T10:47:24Z) - UniMatch: A Unified User-Item Matching Framework for the Multi-purpose
Merchant Marketing [27.459774494479227]
We present a unified user-item matching framework to simultaneously conduct item recommendation and user targeting with just one model.
Our framework results in significant performance gains in comparison with the state-of-the-art methods, with greatly reduced cost on computing resources and daily maintenance.
arXiv Detail & Related papers (2023-07-19T13:49:35Z) - Functional Optimization Reinforcement Learning for Real-Time Bidding [14.5826735379053]
Real-time bidding is the new paradigm of programmatic advertising.
Existing approaches are struggling to provide a satisfactory solution for bidding optimization.
This paper proposes a multi-agent reinforcement learning architecture for RTB with functional optimization.
arXiv Detail & Related papers (2022-06-25T06:12:17Z) - Entity-Graph Enhanced Cross-Modal Pretraining for Instance-level Product
Retrieval [152.3504607706575]
This research aims to conduct weakly-supervised multi-modal instance-level product retrieval for fine-grained product categories.
We first contribute the Product1M datasets, and define two real practical instance-level retrieval tasks.
We exploit to train a more effective cross-modal model which is adaptively capable of incorporating key concept information from the multi-modal data.
arXiv Detail & Related papers (2022-06-17T15:40:45Z) - Single-Modal Entropy based Active Learning for Visual Question Answering [75.1682163844354]
We address Active Learning in the multi-modal setting of Visual Question Answering (VQA)
In light of the multi-modal inputs, image and question, we propose a novel method for effective sample acquisition.
Our novel idea is simple to implement, cost-efficient, and readily adaptable to other multi-modal tasks.
arXiv Detail & Related papers (2021-10-21T05:38:45Z) - Multi-Agent Cooperative Bidding Games for Multi-Objective Optimization
in e-Commercial Sponsored Search [26.117969395228503]
We propose a novel multi-objective cooperative bid optimization formulation called Multi-Agent Cooperative bidding Games (MACG)
A global objective to maximize the overall profit of all advertisements is added in order to encourage better cooperation and also to protect self-bidding advertisers.
offline experiments and online A/B tests conducted on the Taobao platform indicate both single advertiser's objective and global profit have been significantly improved.
arXiv Detail & Related papers (2021-06-08T03:18:28Z) - Multi-Purchase Behavior: Modeling, Estimation and Optimization [0.9337154228221861]
We present a parsimonious multi-purchase family of choice models called the Bundle-MVL-K family.
We develop a binary search based iterative strategy that efficiently computes optimized recommendations for this model.
arXiv Detail & Related papers (2020-06-14T23:47:14Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.