Related papers: OneRanker: Unified Generation and Ranking with One Model in Industrial Advertising Recommendation

OneRanker: Unified Generation and Ranking with One Model in Industrial Advertising Recommendation

URL: http://arxiv.org/abs/2603.02999v2
Date: Wed, 04 Mar 2026 09:01:13 GMT
Title: OneRanker: Unified Generation and Ranking with One Model in Industrial Advertising Recommendation
Authors: Dekai Sun, Yiming Liu, Jiafan Zhou, Xun Liu, Chenchen Yu, Yi Li, Huan Yu, Jun Zhang,
Abstract summary: We propose OneRanker, achieving architectural-level deep integration of generation and ranking.<n>We construct a coarse-to-fine collaborative target awareness mechanism.<n>The full deployment on Tencent's WeiXin channels advertising system has shown a significant improvement in key business metrics.
Score: 16.27240743307534
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: The end-to-end generative paradigm is revolutionizing advertising recommendation systems, driving a shift from traditional cascaded architectures towards unified modeling. However, practical deployment faces three core challenges: the misalignment between interest objectives and business value, the target-agnostic limitation of generative processes, and the disconnection between generation and ranking stages. Existing solutions often fall into a dilemma where single-stage fusion induces optimization tension, while stage decoupling causes irreversible information loss. To address this, we propose OneRanker, achieving architectural-level deep integration of generation and ranking. First, we design a value-aware multi-task decoupling architecture. By leveraging task token sequences and causal mask, we separate interest coverage and value optimization spaces within shared representations, effectively alleviating target conflicts. Second, we construct a coarse-to-fine collaborative target awareness mechanism, utilizing Fake Item Tokens for implicit awareness during generation and a ranking decoder for explicit value alignment at the candidate level. Finally, we propose input-output dual-side consistency guarantees. Through Key/Value pass-through mechanisms and Distribution Consistency (DC) Constraint Loss, we achieve end-to-end collaborative optimization between generation and ranking. The full deployment on Tencent's WeiXin channels advertising system has shown a significant improvement in key business metrics (GMV - Normal +1.34\%), providing a new paradigm with industrial feasibility for generative advertising recommendations.

Related papers

PIT: A Dynamic Personalized Item Tokenizer for End-to-End Generative Recommendation [10.959841655014387]
PIT is a dynamic Personalized Item Tokenizer framework for end-to-end generative recommendation.<n>It employs a co-generative architecture that harmonizes collaborative patterns through collaborative signal alignment.<n>Experiments on real-world datasets demonstrate that PIT consistently outperforms competitive baselines.
arXiv Detail & Related papers (2026-02-09T11:28:56Z)
PRISM: Purified Representation and Integrated Semantic Modeling for Generative Sequential Recommendation [28.629759086187352]
We propose a novel generative recommendation framework, PRISM, with Purified Representation and Integrated Semantic Modeling.<n>PRISM consistently outperforms state-of-the-art baselines across four real-world datasets.
arXiv Detail & Related papers (2026-01-23T08:50:16Z)
MAESTRO: Meta-learning Adaptive Estimation of Scalarization Trade-offs for Reward Optimization [56.074760766965085]
Group-Relative Policy Optimization has emerged as an efficient paradigm for aligning Large Language Models (LLMs)<n>We propose MAESTRO, which treats reward scalarization as a dynamic latent policy, leveraging the model's terminal hidden states as a semantic bottleneck.<n>We formulate this as a contextual bandit problem within a bi-level optimization framework, where a lightweight Conductor network co-evolves with the policy by utilizing group-relative advantages as a meta-reward signal.
arXiv Detail & Related papers (2026-01-12T05:02:48Z)
HarmonRank: Ranking-aligned Multi-objective Ensemble for Live-streaming E-commerce Recommendation [17.992877606615533]
Live-streaming e-commerce requires ranking mechanism to balance both purchases and user-streamer interactions.<n>We propose a novel multi-objective ensemble framework HarmonRank to fulfill both alignment to the ranking task and alignment among objectives.<n>The proposed method has been fully deployed in Kuaishou's live-streaming e-commerce recommendation platform with 400 million DAUs, contributing over 2% purchase gain.
arXiv Detail & Related papers (2026-01-06T11:59:02Z)
Beyond Confidence: Adaptive and Coherent Decoding for Diffusion Language Models [64.92045568376705]
Coherent Contextual Decoding (CCD) is a novel inference framework built upon two core innovations.<n>CCD employs a trajectory rectification mechanism that leverages historical context to enhance sequence coherence.<n>Instead of rigid allocations based on diffusion steps, we introduce an adaptive sampling strategy that dynamically adjusts the unmasking budget for each step.
arXiv Detail & Related papers (2025-11-26T09:49:48Z)
GPR: Towards a Generative Pre-trained One-Model Paradigm for Large-Scale Advertising Recommendation [38.48999566011862]
We propose GPR (Generative Pre-trained Recommender), a one-model framework that redefines advertising recommendation as an end-to-end generative task.<n>We introduce three key innovations spanning unified representation, network architecture, and training strategy.<n>GPR has been fully deployed in the Tencent Weixin Channels advertising system, delivering significant improvements in key business metrics.
arXiv Detail & Related papers (2025-11-13T09:50:53Z)
DeepThinkVLA: Enhancing Reasoning Capability of Vision-Language-Action Models [51.76664843721462]
DeepThinkVLA is a new architecture for Vision-Language-Action models.<n>It generates sequential CoT with causal attention and switches to bidirectional attention for fast decoding of action vectors.<n>It achieves a 97.0% success rate on the LIBERO benchmark.
arXiv Detail & Related papers (2025-10-31T05:26:16Z)
OnePiece: Bringing Context Engineering and Reasoning to Industrial Cascade Ranking System [61.12400636463362]
OnePiece is a unified framework that seamlessly integrates LLM-style context engineering and reasoning into both retrieval and ranking models.<n>OnePiece has been deployed in the main personalized search scenario of Shopee and achieves consistent online gains across different key business metrics.
arXiv Detail & Related papers (2025-09-22T17:59:07Z)
Practice on Long Behavior Sequence Modeling in Tencent Advertising [75.65309022911994]
Long-sequence modeling has become an indispensable frontier in recommendation systems for capturing users' long-term preferences.<n>We propose several practical approaches within the two-stage framework for long-sequence modeling.<n> Deployed in production on Tencent's large-scale advertising platforms, our innovations delivered significant performance gains.
arXiv Detail & Related papers (2025-09-10T06:55:57Z)
EGA-V1: Unifying Online Advertising with End-to-End Learning [17.943921299281207]
We present EGA-V1, an end-to-end generative architecture that unifies online advertising ranking as one model.<n>EGA-V1 replaces cascaded stages with a single model to directly generate optimal ad sequences from the full candidate ad corpus.
arXiv Detail & Related papers (2025-05-26T09:33:54Z)
EAGER: Two-Stream Generative Recommender with Behavior-Semantic Collaboration [63.112790050749695]
We introduce EAGER, a novel generative recommendation framework that seamlessly integrates both behavioral and semantic information. We validate the effectiveness of EAGER on four public benchmarks, demonstrating its superior performance compared to existing methods.
arXiv Detail & Related papers (2024-06-20T06:21:56Z)

This list is automatically generated from the titles and abstracts of the papers in this site.