Multi-task Item-attribute Graph Pre-training for Strict Cold-start Item
Recommendation
- URL: http://arxiv.org/abs/2306.14462v1
- Date: Mon, 26 Jun 2023 07:04:47 GMT
- Title: Multi-task Item-attribute Graph Pre-training for Strict Cold-start Item
Recommendation
- Authors: Yuwei Cao, Liangwei Yang, Chen Wang, Zhiwei Liu, Hao Peng, Chenyu You,
Philip S. Yu
- Abstract summary: ColdGPT models item-attribute correlations into an item-attribute graph by extracting fine-grained attributes from item contents.
ColdGPT transfers knowledge into the item-attribute graph from various available data sources, i.e., item contents, historical purchase sequences, and review texts of the existing items.
Extensive experiments show that ColdGPT consistently outperforms the existing SCS recommenders by large margins.
- Score: 71.5871100348448
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Recommendation systems suffer in the strict cold-start (SCS) scenario, where
the user-item interactions are entirely unavailable. The ID-based approaches
completely fail to work. Cold-start recommenders, on the other hand, leverage
item contents to map the new items to the existing ones. However, the existing
SCS recommenders explore item contents in coarse-grained manners that introduce
noise or information loss. Moreover, informative data sources other than item
contents, such as users' purchase sequences and review texts, are ignored. We
explore the role of the fine-grained item attributes in bridging the gaps
between the existing and the SCS items and pre-train a knowledgeable
item-attribute graph for SCS item recommendation. Our proposed framework,
ColdGPT, models item-attribute correlations into an item-attribute graph by
extracting fine-grained attributes from item contents. ColdGPT then transfers
knowledge into the item-attribute graph from various available data sources,
i.e., item contents, historical purchase sequences, and review texts of the
existing items, via multi-task learning. To facilitate the positive transfer,
ColdGPT designs submodules according to the natural forms of the data sources
and coordinates the multiple pre-training tasks via unified
alignment-and-uniformity losses. Our pre-trained item-attribute graph acts as
an implicit, extendable item embedding matrix, which enables the SCS item
embeddings to be easily acquired by inserting these items and propagating their
attributes' embeddings. We carefully process three public datasets, i.e., Yelp,
Amazon-home, and Amazon-sports, to guarantee the SCS setting for evaluation.
Extensive experiments show that ColdGPT consistently outperforms the existing
SCS recommenders by large margins and even surpasses models that are
pre-trained on 75-224 times more, cross-domain data on two out of four
datasets.
Related papers
- Enhanced E-Commerce Attribute Extraction: Innovating with Decorative
Relation Correction and LLAMA 2.0-Based Annotation [4.81846973621209]
We propose a pioneering framework that integrates BERT for classification, a Conditional Random Fields (CRFs) layer for attribute value extraction, and Large Language Models (LLMs) for data annotation.
Our approach capitalizes on the robust representation learning of BERT, synergized with the sequence decoding prowess of CRFs, to adeptly identify and extract attribute values.
Our methodology is rigorously validated on various datasets, including Walmart, BestBuy's e-commerce NER dataset, and the CoNLL dataset.
arXiv Detail & Related papers (2023-12-09T08:26:30Z) - Product Attribute Value Extraction using Large Language Models [56.96665345570965]
State-of-the-art attribute/value extraction methods based on pre-trained language models (PLMs) face two drawbacks.
We explore the potential of using large language models (LLMs) as a more training data-efficient and more robust alternative to existing attribute/value extraction methods.
arXiv Detail & Related papers (2023-10-19T07:39:00Z) - Product Information Extraction using ChatGPT [69.12244027050454]
This paper explores the potential of ChatGPT for extracting attribute/value pairs from product descriptions.
Our results show that ChatGPT achieves a performance similar to a pre-trained language model but requires much smaller amounts of training data and computation for fine-tuning.
arXiv Detail & Related papers (2023-06-23T09:30:01Z) - Semi-supervised Adversarial Learning for Complementary Item
Recommendation [5.5174379874002435]
In certain online marketplaces, e.g., on online auction sites, constantly new items are added to the catalog.
We propose a novel approach that can leverage both item side-information and labeled complementary item pairs.
Experiments on three e-commerce datasets show that our method is highly effective.
arXiv Detail & Related papers (2023-03-10T09:39:18Z) - M2TRec: Metadata-aware Multi-task Transformer for Large-scale and
Cold-start free Session-based Recommendations [9.327321259021236]
Session-based recommender systems (SBRSs) have shown superior performance over conventional methods.
We propose M2TRec, a Metadata-aware Multi-task Transformer model for session-based recommendations.
arXiv Detail & Related papers (2022-09-23T19:34:29Z) - CARCA: Context and Attribute-Aware Next-Item Recommendation via
Cross-Attention [7.573586022424399]
In recommender settings, users' context and item attributes play a crucial role in deciding which items to recommend next.
We propose a context and attribute-aware recommender model (CARCA) that can capture the dynamic nature of the user profiles in terms of contextual features and item attributes.
Experiments on four real-world recommender system datasets show that the proposed model significantly outperforms all state-of-the-art models in the task of item recommendation.
arXiv Detail & Related papers (2022-04-04T13:22:28Z) - Sequential Modeling with Multiple Attributes for Watchlist
Recommendation in E-Commerce [67.6615871959902]
We study the watchlist functionality in e-commerce and introduce a novel watchlist recommendation task.
Our goal is to prioritize which watchlist items the user should pay attention to next by predicting the next items the user will click.
Our proposed recommendation model, Trans2D, is built on top of the Transformer architecture.
arXiv Detail & Related papers (2021-10-18T10:02:15Z) - Automatic Validation of Textual Attribute Values in E-commerce Catalog
by Learning with Limited Labeled Data [61.789797281676606]
We propose a novel meta-learning latent variable approach, called MetaBridge.
It can learn transferable knowledge from a subset of categories with limited labeled data.
It can capture the uncertainty of never-seen categories with unlabeled data.
arXiv Detail & Related papers (2020-06-15T21:31:05Z) - Joint Item Recommendation and Attribute Inference: An Adaptive Graph
Convolutional Network Approach [61.2786065744784]
In recommender systems, users and items are associated with attributes, and users show preferences to items.
As annotating user (item) attributes is a labor intensive task, the attribute values are often incomplete with many missing attribute values.
We propose an Adaptive Graph Convolutional Network (AGCN) approach for joint item recommendation and attribute inference.
arXiv Detail & Related papers (2020-05-25T10:50:01Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.