From IDs to Semantics: A Generative Framework for Cross-Domain Recommendation with Adaptive Semantic Tokenization
- URL: http://arxiv.org/abs/2511.08006v2
- Date: Sun, 16 Nov 2025 06:59:56 GMT
- Title: From IDs to Semantics: A Generative Framework for Cross-Domain Recommendation with Adaptive Semantic Tokenization
- Authors: Peiyu Hu, Wayne Lu, Jia Wang,
- Abstract summary: Cross-domain recommendation is crucial for improving recommendation accuracy and generalization.<n>Many efforts have focused on learning disentangled representations through multi-domain joint training to bridge the domain gaps.<n>Recent Large Language Model (LLM)-based approaches show promise, they still face critical challenges.<n>We propose textbfGenCDR, a novel textbfGenerative textbfCross-textbfDomain textbfRecommendation framework.
- Score: 3.4546102059619526
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Cross-domain recommendation (CDR) is crucial for improving recommendation accuracy and generalization, yet traditional methods are often hindered by the reliance on shared user/item IDs, which are unavailable in most real-world scenarios. Consequently, many efforts have focused on learning disentangled representations through multi-domain joint training to bridge the domain gaps. Recent Large Language Model (LLM)-based approaches show promise, they still face critical challenges, including: (1) the \textbf{item ID tokenization dilemma}, which leads to vocabulary explosion and fails to capture high-order collaborative knowledge; and (2) \textbf{insufficient domain-specific modeling} for the complex evolution of user interests and item semantics. To address these limitations, we propose \textbf{GenCDR}, a novel \textbf{Gen}erative \textbf{C}ross-\textbf{D}omain \textbf{R}ecommendation framework. GenCDR first employs a \textbf{Domain-adaptive Tokenization} module, which generates disentangled semantic IDs for items by dynamically routing between a universal encoder and domain-specific adapters. Symmetrically, a \textbf{Cross-domain Autoregressive Recommendation} module models user preferences by fusing universal and domain-specific interests. Finally, a \textbf{Domain-aware Prefix-tree} enables efficient and accurate generation. Extensive experiments on multiple real-world datasets demonstrate that GenCDR significantly outperforms state-of-the-art baselines. Our code is available in the supplementary materials.
Related papers
- Generative Data Transformation: From Mixed to Unified Data [57.84692191369066]
textscTaesar is a emphdata-centric framework for textbftarget-textbfal textbfregeneration.<n>It encodes cross-domain context into target sequences, enabling standard models to learn intricate dependencies without complex fusion architectures.
arXiv Detail & Related papers (2026-02-26T08:30:09Z) - LLM-EDT: Large Language Model Enhanced Cross-domain Sequential Recommendation with Dual-phase Training [53.539682966282534]
Cross-domain Sequential Recommendation (CDSR) has been proposed to enrich user-item interactions by incorporating information from various domains.<n>Despite current progress, the imbalance issue and transition issue hinder further development of CDSR.<n>We propose an LLMs Enhanced Cross-domain Sequential Recommendation with Dual-phase Training (LLM-EDT)
arXiv Detail & Related papers (2025-11-25T05:18:04Z) - RecBase: Generative Foundation Model Pretraining for Zero-Shot Recommendation [78.01030342481246]
RecBase is a domain-agnostic foundational model pretrained with a recommendation-oriented objective.<n>We introduce a unified item tokenizer that encodes items into hierarchical concept identifiers.<n>Our model matches or surpasses the performance of LLM baselines up to 7B parameters in zero-shot and cross-domain recommendation tasks.
arXiv Detail & Related papers (2025-09-03T08:33:43Z) - Generative Multi-Target Cross-Domain Recommendation [48.54929268144516]
This paper introduces GMC, a generative paradigm-based approach for multi-target cross-domain recommendation.<n>The core idea of GMC is to leverage semantically quantized discrete item identifiers as a medium for integrating multi-domain knowledge.<n>Extensive experiments on five public datasets demonstrate the effectiveness of GMC.
arXiv Detail & Related papers (2025-07-17T07:44:05Z) - Text-Driven Causal Representation Learning for Source-Free Domain Generalization [82.75041792888274]
We propose TDCRL, the first method to integrate causal inference into the source-free domain generalization setting.<n>Our approach offers a clear and effective mechanism to achieve robust, domain-invariant features, ensuring robust generalization.
arXiv Detail & Related papers (2025-07-14T06:20:42Z) - Bridge the Domains: Large Language Models Enhanced Cross-domain Sequential Recommendation [30.116213884571803]
Cross-domain Sequential Recommendation (CDSR) aims to extract the preference from the user's historical interactions across various domains.<n>Existing CDSR methods rely on users who own interactions on all domains to learn cross-domain item relationships.<n>With powerful representation and reasoning abilities, Large Language Models (LLMs) are promising to address these two problems.
arXiv Detail & Related papers (2025-04-25T14:30:25Z) - Exploring User Retrieval Integration towards Large Language Models for Cross-Domain Sequential Recommendation [66.72195610471624]
Cross-Domain Sequential Recommendation aims to mine and transfer users' sequential preferences across different domains.
We propose a novel framework named URLLM, which aims to improve the CDSR performance by exploring the User Retrieval approach.
arXiv Detail & Related papers (2024-06-05T09:19:54Z) - i$^2$VAE: Interest Information Augmentation with Variational Regularizers for Cross-Domain Sequential Recommendation [5.300964409946611]
i$2$VAE is a variational autoencoder that enhances user interest learning with mutual information-based regularizers.<n>Experiments demonstrate that i$2$VAE outperforms state-of-the-art methods.
arXiv Detail & Related papers (2024-05-31T09:07:03Z) - Learning Partially Aligned Item Representation for Cross-Domain Sequential Recommendation [72.73379646418435]
Cross-domain sequential recommendation aims to uncover and transfer users' sequential preferences across domains.
misaligned item representations can potentially lead to sub-optimal sequential modeling and user representation alignment.
We propose a model-agnostic framework called textbfCross-domain item representation textbfAlignment for textbfCross-textbfDomain textbfSequential textbfRecommendation.
arXiv Detail & Related papers (2024-05-21T03:25:32Z) - Review-Based Hyperbolic Cross-Domain Recommendation [3.4498722449655066]
Cross-Domain Recommendation (CDR) captures domain-shareable knowledge and transfers it from a richer domain to a sparser one.<n>This paper advocates a hyperbolic CDR approach based on review texts for modeling user-item relationships.
arXiv Detail & Related papers (2024-03-29T17:15:21Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.