SessionIntentBench: A Multi-task Inter-session Intention-shift Modeling Benchmark for E-commerce Customer Behavior Understanding
- URL: http://arxiv.org/abs/2507.20185v1
- Date: Sun, 27 Jul 2025 09:04:17 GMT
- Title: SessionIntentBench: A Multi-task Inter-session Intention-shift Modeling Benchmark for E-commerce Customer Behavior Understanding
- Authors: Yuqi Yang, Weiqi Wang, Baixuan Xu, Wei Fan, Qing Zong, Chunkit Chan, Zheye Deng, Xin Liu, Yifan Gao, Changlong Yu, Chen Luo, Yang Li, Zheng Li, Qingyu Yin, Bing Yin, Yangqiu Song,
- Abstract summary: We introduce the concept of an intention tree and propose a dataset curation pipeline.<n>We construct a sibling multimodal benchmark, SessionIntentBench, that evaluates L(V)LMs' capability on understanding inter-session intention shift.<n>With 1,952,177 intention entries, 1,132,145 session intention trajectories, and 13,003,664 available tasks mined using 10,905 sessions, we provide a scalable way to exploit the existing session data.
- Score: 64.45047674586671
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Session history is a common way of recording user interacting behaviors throughout a browsing activity with multiple products. For example, if an user clicks a product webpage and then leaves, it might because there are certain features that don't satisfy the user, which serve as an important indicator of on-the-spot user preferences. However, all prior works fail to capture and model customer intention effectively because insufficient information exploitation and only apparent information like descriptions and titles are used. There is also a lack of data and corresponding benchmark for explicitly modeling intention in E-commerce product purchase sessions. To address these issues, we introduce the concept of an intention tree and propose a dataset curation pipeline. Together, we construct a sibling multimodal benchmark, SessionIntentBench, that evaluates L(V)LMs' capability on understanding inter-session intention shift with four subtasks. With 1,952,177 intention entries, 1,132,145 session intention trajectories, and 13,003,664 available tasks mined using 10,905 sessions, we provide a scalable way to exploit the existing session data for customer intention understanding. We conduct human annotations to collect ground-truth label for a subset of collected data to form an evaluation gold set. Extensive experiments on the annotated data further confirm that current L(V)LMs fail to capture and utilize the intention across the complex session setting. Further analysis show injecting intention enhances LLMs' performances.
Related papers
- Hierarchical Intent-guided Optimization with Pluggable LLM-Driven Semantics for Session-based Recommendation [22.653549796453426]
Session-based Recommendation (SBR) aims to predict the next item a user will likely engage with, using their interaction sequence within an anonymous session.<n>Existing SBR models often focus only on single-session information, ignoring inter-session relationships and valuable cross-session insights.<n>We propose a novel hierarchical intent-guided optimization approach with pluggable LLM-driven semantic learning for session-based recommendations, called HIPHOP.
arXiv Detail & Related papers (2025-07-07T02:50:04Z) - Know Me, Respond to Me: Benchmarking LLMs for Dynamic User Profiling and Personalized Responses at Scale [51.9706400130481]
Large Language Models (LLMs) have emerged as personalized assistants for users across a wide range of tasks.<n> PERSONAMEM features curated user profiles with over 180 simulated user-LLM interaction histories.<n>We evaluate LLM chatbots' ability to identify the most suitable response according to the current state of the user's profile.
arXiv Detail & Related papers (2025-04-19T08:16:10Z) - Enhancing User Intent Capture in Session-Based Recommendation with
Attribute Patterns [77.19390850643944]
We propose the Frequent Attribute Pattern Augmented Transformer (FAPAT)
FAPAT characterizes user intents by building attribute transition graphs and matching attribute patterns.
We demonstrate that FAPAT consistently outperforms state-of-the-art methods by an average of 4.5% across various evaluation metrics.
arXiv Detail & Related papers (2023-12-23T03:28:18Z) - Understanding Inter-Session Intentions via Complex Logical Reasoning [50.199811535229045]
We present the task of logical session complex query answering (LS-CQA)
We frame the problem of complex intention understanding as an LS-CQA task on an aggregated hypergraph of sessions, items, and attributes.
We introduce a new model, the Logical Session Graph Transformer (LSGT), which captures interactions among items across different sessions and their logical connections.
arXiv Detail & Related papers (2023-12-21T14:03:30Z) - Context-aware Session-based Recommendation with Graph Neural Networks [6.825493772727133]
We propose CARES, a novel context-aware session-based recommendation model with graph neural networks.
We first construct a multi-relation cross-session graph to connect items according to intra- and cross-session item-level contexts.
To encode the variation of user interests, we design personalized item representations.
arXiv Detail & Related papers (2023-10-14T14:29:52Z) - Amazon-M2: A Multilingual Multi-locale Shopping Session Dataset for
Recommendation and Text Generation [127.35910314813854]
We present the Amazon Multi-locale Shopping Session dataset, namely Amazon-M2.
It is the first multilingual dataset consisting of millions of user sessions from six different locales.
Remarkably, the dataset can help us enhance personalization and understanding of user preferences.
arXiv Detail & Related papers (2023-07-19T00:08:49Z) - Fine-Grained Session Recommendations in E-commerce using Deep
Reinforcement Learning [0.028675177318965035]
Sustaining users' interest and keeping them engaged in the platform is very important for the success of an e-commerce business.
In this work, we focus primarily on the unknown intent setting where our objective is to recommend a sequence of products to a user in a session to sustain their interest.
We formulate this problem in the framework of the Markov Decision Process (MDP), a popular mathematical framework for sequential decision making.
arXiv Detail & Related papers (2022-10-20T13:22:13Z) - Multi-Interactive Attention Network for Fine-grained Feature Learning in
CTR Prediction [48.267995749975476]
In the Click-Through Rate (CTR) prediction scenario, user's sequential behaviors are well utilized to capture the user interest.
Existing methods mostly utilize attention on the behavior of users, which is not always suitable for CTR prediction.
We propose a Multi-Interactive Attention Network (MIAN) to comprehensively extract the latent relationship among all kinds of fine-grained features.
arXiv Detail & Related papers (2020-12-13T05:46:19Z) - Incorporating User Micro-behaviors and Item Knowledge into Multi-task
Learning for Session-based Recommendation [18.516121495514007]
Session-based recommendation (SR) aims to predict the next interacted item based on a given session.
Most existing SR models only focus on exploiting the consecutive items in a session interacted by a certain user.
We propose a novel SR model MKM-SR, which incorporates user Micro-behaviors and item Knowledge into Multi-task learning for Session-based Recommendation.
arXiv Detail & Related papers (2020-06-12T03:06:23Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.