Multi-task Item-attribute Graph Pre-training for Strict Cold-start Item
Recommendation
- URL: http://arxiv.org/abs/2306.14462v1
- Date: Mon, 26 Jun 2023 07:04:47 GMT
- Title: Multi-task Item-attribute Graph Pre-training for Strict Cold-start Item
Recommendation
- Authors: Yuwei Cao, Liangwei Yang, Chen Wang, Zhiwei Liu, Hao Peng, Chenyu You,
Philip S. Yu
- Abstract summary: ColdGPT models item-attribute correlations into an item-attribute graph by extracting fine-grained attributes from item contents.
ColdGPT transfers knowledge into the item-attribute graph from various available data sources, i.e., item contents, historical purchase sequences, and review texts of the existing items.
Extensive experiments show that ColdGPT consistently outperforms the existing SCS recommenders by large margins.
- Score: 71.5871100348448
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Recommendation systems suffer in the strict cold-start (SCS) scenario, where
the user-item interactions are entirely unavailable. The ID-based approaches
completely fail to work. Cold-start recommenders, on the other hand, leverage
item contents to map the new items to the existing ones. However, the existing
SCS recommenders explore item contents in coarse-grained manners that introduce
noise or information loss. Moreover, informative data sources other than item
contents, such as users' purchase sequences and review texts, are ignored. We
explore the role of the fine-grained item attributes in bridging the gaps
between the existing and the SCS items and pre-train a knowledgeable
item-attribute graph for SCS item recommendation. Our proposed framework,
ColdGPT, models item-attribute correlations into an item-attribute graph by
extracting fine-grained attributes from item contents. ColdGPT then transfers
knowledge into the item-attribute graph from various available data sources,
i.e., item contents, historical purchase sequences, and review texts of the
existing items, via multi-task learning. To facilitate the positive transfer,
ColdGPT designs submodules according to the natural forms of the data sources
and coordinates the multiple pre-training tasks via unified
alignment-and-uniformity losses. Our pre-trained item-attribute graph acts as
an implicit, extendable item embedding matrix, which enables the SCS item
embeddings to be easily acquired by inserting these items and propagating their
attributes' embeddings. We carefully process three public datasets, i.e., Yelp,
Amazon-home, and Amazon-sports, to guarantee the SCS setting for evaluation.
Extensive experiments show that ColdGPT consistently outperforms the existing
SCS recommenders by large margins and even surpasses models that are
pre-trained on 75-224 times more, cross-domain data on two out of four
datasets.
Related papers
- Language-Model Prior Overcomes Cold-Start Items [14.370472820496802]
The growth ofRecSys is driven by digitization and the need for personalized content in areas such as e-commerce and video streaming.
Existing solutions for the cold-start problem, such as content-based recommenders and hybrid methods, leverage item metadata to determine item similarities.
This paper introduces a novel approach for cold-start item recommendation that utilizes the language model (LM) to estimate item similarities.
arXiv Detail & Related papers (2024-11-13T22:45:52Z) - Firzen: Firing Strict Cold-Start Items with Frozen Heterogeneous and Homogeneous Graphs for Recommendation [34.414081170244955]
We propose a unified framework incorporating multi-modal content of items and knowledge graphs (KGs) to solve both strict cold-start and warm-start recommendation.
Our model yields significant improvements for strict cold-start recommendation and outperforms or matches the state-of-the-art performance in the warm-start scenario.
arXiv Detail & Related papers (2024-10-10T06:48:27Z) - Enhanced E-Commerce Attribute Extraction: Innovating with Decorative
Relation Correction and LLAMA 2.0-Based Annotation [4.81846973621209]
We propose a pioneering framework that integrates BERT for classification, a Conditional Random Fields (CRFs) layer for attribute value extraction, and Large Language Models (LLMs) for data annotation.
Our approach capitalizes on the robust representation learning of BERT, synergized with the sequence decoding prowess of CRFs, to adeptly identify and extract attribute values.
Our methodology is rigorously validated on various datasets, including Walmart, BestBuy's e-commerce NER dataset, and the CoNLL dataset.
arXiv Detail & Related papers (2023-12-09T08:26:30Z) - Product Information Extraction using ChatGPT [69.12244027050454]
This paper explores the potential of ChatGPT for extracting attribute/value pairs from product descriptions.
Our results show that ChatGPT achieves a performance similar to a pre-trained language model but requires much smaller amounts of training data and computation for fine-tuning.
arXiv Detail & Related papers (2023-06-23T09:30:01Z) - CARCA: Context and Attribute-Aware Next-Item Recommendation via
Cross-Attention [7.573586022424399]
In recommender settings, users' context and item attributes play a crucial role in deciding which items to recommend next.
We propose a context and attribute-aware recommender model (CARCA) that can capture the dynamic nature of the user profiles in terms of contextual features and item attributes.
Experiments on four real-world recommender system datasets show that the proposed model significantly outperforms all state-of-the-art models in the task of item recommendation.
arXiv Detail & Related papers (2022-04-04T13:22:28Z) - Sequential Modeling with Multiple Attributes for Watchlist
Recommendation in E-Commerce [67.6615871959902]
We study the watchlist functionality in e-commerce and introduce a novel watchlist recommendation task.
Our goal is to prioritize which watchlist items the user should pay attention to next by predicting the next items the user will click.
Our proposed recommendation model, Trans2D, is built on top of the Transformer architecture.
arXiv Detail & Related papers (2021-10-18T10:02:15Z) - Pre-training Graph Transformer with Multimodal Side Information for
Recommendation [82.4194024706817]
We propose a pre-training strategy to learn item representations by considering both item side information and their relationships.
We develop a novel sampling algorithm named MCNSampling to select contextual neighbors for each item.
The proposed Pre-trained Multimodal Graph Transformer (PMGT) learns item representations with two objectives: 1) graph structure reconstruction, and 2) masked node feature reconstruction.
arXiv Detail & Related papers (2020-10-23T10:30:24Z) - Automatic Validation of Textual Attribute Values in E-commerce Catalog
by Learning with Limited Labeled Data [61.789797281676606]
We propose a novel meta-learning latent variable approach, called MetaBridge.
It can learn transferable knowledge from a subset of categories with limited labeled data.
It can capture the uncertainty of never-seen categories with unlabeled data.
arXiv Detail & Related papers (2020-06-15T21:31:05Z) - Joint Item Recommendation and Attribute Inference: An Adaptive Graph
Convolutional Network Approach [61.2786065744784]
In recommender systems, users and items are associated with attributes, and users show preferences to items.
As annotating user (item) attributes is a labor intensive task, the attribute values are often incomplete with many missing attribute values.
We propose an Adaptive Graph Convolutional Network (AGCN) approach for joint item recommendation and attribute inference.
arXiv Detail & Related papers (2020-05-25T10:50:01Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.