Related papers: Foresight Prediction Enhanced Live-Streaming Recommendation

Foresight Prediction Enhanced Live-Streaming Recommendation

URL: http://arxiv.org/abs/2512.06700v1
Date: Sun, 07 Dec 2025 07:25:38 GMT
Title: Foresight Prediction Enhanced Live-Streaming Recommendation
Authors: Jiangxia Cao, Ruochen Yang, Xiang Chen, Changxin Lao, Yueyang Liu, Yusheng Huang, Yuanhao Tian, Xiangyu Wu, Shuang Yang, Zhaojie Liu, Guorui Zhou,
Abstract summary: Live-streaming, due to the dynamics of content and time, poses higher requirements for the recommendation algorithm of the platform.<n>We perform semantic quantization on live-streaming segments to obtain Semantic ids (Sid), encode the historical Sid sequence to capture the author's characteristics, and model Sid evolution trend to enable foresight prediction of future content.
Score: 18.07489662404993
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Live-streaming, as an emerging media enabling real-time interaction between authors and users, has attracted significant attention. Unlike the stable playback time of traditional TV live or the fixed content of short video, live-streaming, due to the dynamics of content and time, poses higher requirements for the recommendation algorithm of the platform - understanding the ever-changing content in real time and push it to users at the appropriate moment. Through analysis, we find that users have a better experience and express more positive behaviors during highlight moments of the live-streaming. Furthermore, since the model lacks access to future content during recommendation, yet user engagement depends on how well subsequent content aligns with their interests, an intuitive solution is to predict future live-streaming content. Therefore, we perform semantic quantization on live-streaming segments to obtain Semantic ids (Sid), encode the historical Sid sequence to capture the author's characteristics, and model Sid evolution trend to enable foresight prediction of future content. This foresight enhances the ranking model through refined features. Extensive offline and online experiments demonstrate the effectiveness of our method.

Related papers

OneLive: Dynamically Unified Generative Framework for Live-Streaming Recommendation [49.95897358060393]
We propose OneLive, a dynamically unified generative recommendation framework tailored for live-streaming scenario.<n>OneLive integrates four key components: (i) A Dynamic Tokenizer that continuously encodes evolving real-time live content fused with behavior signal through residual quantization; (ii) A Time-Aware Gated Attention mechanism that explicitly models temporal dynamics for timely decision making; (iii) An efficient decoder-only generative architecture enhanced with Sequential MTP and QK Norm for stable training and accelerated inference.
arXiv Detail & Related papers (2026-02-09T12:56:39Z)
Harnessing Synthetic Preference Data for Enhancing Temporal Understanding of Video-LLMs [54.502280390499756]
We propose TimeWarp to create a targeted synthetic temporal dataset to fine-tune the model's responses to encourage it to focus on the given input video.<n>We demonstrate that when our method is applied to existing models, it significantly improves performance on temporal understanding benchmarks.
arXiv Detail & Related papers (2025-10-04T21:48:40Z)
Leveraging Scene Context with Dual Networks for Sequential User Behavior Modeling [58.72480539725212]
We propose a novel Dual Sequence Prediction networks (DSPnet) to capture the dynamic interests and interplay between scenes and items for future behavior prediction.<n>DSPnet consists of two parallel networks dedicated to learn users' dynamic interests over items and scenes, and a sequence feature enhancement module to capture the interplay for enhanced future behavior prediction.
arXiv Detail & Related papers (2025-09-30T12:26:57Z)
KuaiLive: A Real-time Interactive Dataset for Live Streaming Recommendation [7.94801228491541]
KuaiLive is the first real-time, interactive dataset collected from Kuaishou, a leading live streaming platform in China.<n>The dataset records the interaction logs of 23,772 users and 452,621 streamers over a 21-day period.<n>It can support a wide range of tasks in the live streaming domain, such as top-K recommendation, click-through rate prediction, watch time prediction, and gift price prediction.
arXiv Detail & Related papers (2025-08-07T17:59:36Z)
StreamAgent: Towards Anticipatory Agents for Streaming Video Understanding [52.55809460075286]
We propose a StreamAgent that anticipates the temporal intervals and spatial regions expected to contain future task-relevant information.<n>We integrate question semantics and historical observations through prompting the anticipatory agent to anticipate the temporal progression of key events.<n>Our method outperforms existing methods in response accuracy and real-time efficiency, highlighting its practical value for real-world streaming scenarios.
arXiv Detail & Related papers (2025-08-03T18:15:42Z)
LLM-Alignment Live-Streaming Recommendation [20.817796284487468]
Integrated short-video and live-streaming platforms have gained massive global adoption, offering dynamic content creation and consumption.<n>The same live-streaming vastly different experiences depending on when a user watching.<n>To optimize recommendations, a RecSys must accurately interpret the real-time semantics of live content and align them with user preferences.
arXiv Detail & Related papers (2025-04-07T16:04:00Z)
Object-Centric Temporal Consistency via Conditional Autoregressive Inductive Biases [69.46487306858789]
Conditional Autoregressive Slot Attention (CA-SA) is a framework that enhances the temporal consistency of extracted object-centric representations in video-centric vision tasks. We present qualitative and quantitative results showing that our proposed method outperforms the considered baselines on downstream tasks.
arXiv Detail & Related papers (2024-10-21T07:44:44Z)
Flash-VStream: Memory-Based Real-Time Understanding for Long Video Streams [78.72965584414368]
We present Flash-VStream, a video-language model that simulates the memory mechanism of human. Compared to existing models, Flash-VStream achieves significant reductions in latency inference and VRAM consumption. We propose VStream-QA, a novel question answering benchmark specifically designed for online video streaming understanding.
arXiv Detail & Related papers (2024-06-12T11:07:55Z)
Look into the Future: Deep Contextualized Sequential Recommendation [28.726897673576865]
We propose a novel framework of sequential recommendation called Look into the Future (LIFT) LIFT builds and leverages the contexts of sequential recommendation. In our experiments, LIFT achieves significant performance improvement on click-through rate prediction and rating prediction tasks.
arXiv Detail & Related papers (2024-05-23T09:34:28Z)
ContentCTR: Frame-level Live Streaming Click-Through Rate Prediction with Multimodal Transformer [31.10377461705053]
We propose a ContentCTR model that leverages multimodal transformer for frame-level CTR prediction. We conduct extensive experiments on both real-world scenarios and public datasets, and our ContentCTR model outperforms traditional recommendation models in capturing real-time content changes.
arXiv Detail & Related papers (2023-06-26T03:04:53Z)

This list is automatically generated from the titles and abstracts of the papers in this site.