M2TRec: Metadata-aware Multi-task Transformer for Large-scale and
Cold-start free Session-based Recommendations
- URL: http://arxiv.org/abs/2209.11824v1
- Date: Fri, 23 Sep 2022 19:34:29 GMT
- Title: M2TRec: Metadata-aware Multi-task Transformer for Large-scale and
Cold-start free Session-based Recommendations
- Authors: Walid Shalaby, Sejoon Oh, Amir Afsharinejad, Srijan Kumar, Xiquan Cui
- Abstract summary: Session-based recommender systems (SBRSs) have shown superior performance over conventional methods.
We propose M2TRec, a Metadata-aware Multi-task Transformer model for session-based recommendations.
- Score: 9.327321259021236
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Session-based recommender systems (SBRSs) have shown superior performance
over conventional methods. However, they show limited scalability on
large-scale industrial datasets since most models learn one embedding per item.
This leads to a large memory requirement (of storing one vector per item) and
poor performance on sparse sessions with cold-start or unpopular items. Using
one public and one large industrial dataset, we experimentally show that
state-of-the-art SBRSs have low performance on sparse sessions with sparse
items. We propose M2TRec, a Metadata-aware Multi-task Transformer model for
session-based recommendations. Our proposed method learns a transformation
function from item metadata to embeddings, and is thus, item-ID free (i.e.,
does not need to learn one embedding per item). It integrates item metadata to
learn shared representations of diverse item attributes. During inference, new
or unpopular items will be assigned identical representations for the
attributes they share with items previously observed during training, and thus
will have similar representations with those items, enabling recommendations of
even cold-start and sparse items. Additionally, M2TRec is trained in a
multi-task setting to predict the next item in the session along with its
primary category and subcategories. Our multi-task strategy makes the model
converge faster and significantly improves the overall performance.
Experimental results show significant performance gains using our proposed
approach on sparse items on the two datasets.
Related papers
- Ada-Retrieval: An Adaptive Multi-Round Retrieval Paradigm for Sequential
Recommendations [50.03560306423678]
We propose Ada-Retrieval, an adaptive multi-round retrieval paradigm for recommender systems.
Ada-Retrieval iteratively refines user representations to better capture potential candidates in the full item space.
arXiv Detail & Related papers (2024-01-12T15:26:40Z) - Multi-task Item-attribute Graph Pre-training for Strict Cold-start Item
Recommendation [71.5871100348448]
ColdGPT models item-attribute correlations into an item-attribute graph by extracting fine-grained attributes from item contents.
ColdGPT transfers knowledge into the item-attribute graph from various available data sources, i.e., item contents, historical purchase sequences, and review texts of the existing items.
Extensive experiments show that ColdGPT consistently outperforms the existing SCS recommenders by large margins.
arXiv Detail & Related papers (2023-06-26T07:04:47Z) - Revisiting Neural Retrieval on Accelerators [20.415728886298915]
A key component of retrieval is to model (user, item) similarity.
Despite its popularity, dot products cannot capture complex user-item interactions, which are multifaceted and likely high rank.
We propose textitmixture of logits (MoL), which models (user, item) similarity as an adaptive composition of elementary similarity functions.
arXiv Detail & Related papers (2023-06-06T22:08:42Z) - Language-aware Multiple Datasets Detection Pretraining for DETRs [4.939595148195813]
We propose a framework for utilizing Multiple datasets to pretrain DETR-like detectors, termed METR.
It converts the typical multi-classification in object detection into binary classification by introducing a pre-trained language model.
We show METR achieves extraordinary results on either multi-task joint training or the pretrain & finetune paradigm.
arXiv Detail & Related papers (2023-04-07T10:34:04Z) - The Minority Matters: A Diversity-Promoting Collaborative Metric
Learning Algorithm [154.47590401735323]
Collaborative Metric Learning (CML) has recently emerged as a popular method in recommendation systems.
This paper focuses on a challenging scenario where a user has multiple categories of interests.
We propose a novel method called textitDiversity-Promoting Collaborative Metric Learning (DPCML)
arXiv Detail & Related papers (2022-09-30T08:02:18Z) - Sequential Modeling with Multiple Attributes for Watchlist
Recommendation in E-Commerce [67.6615871959902]
We study the watchlist functionality in e-commerce and introduce a novel watchlist recommendation task.
Our goal is to prioritize which watchlist items the user should pay attention to next by predicting the next items the user will click.
Our proposed recommendation model, Trans2D, is built on top of the Transformer architecture.
arXiv Detail & Related papers (2021-10-18T10:02:15Z) - Self-supervised Learning for Large-scale Item Recommendations [18.19202958502061]
Large scale recommender models find most relevant items from huge catalogs.
With millions to billions of items in the corpus, users tend to provide feedback for a very small set of them.
We propose a multi-task self-supervised learning framework for large-scale item recommendations.
arXiv Detail & Related papers (2020-07-25T06:21:43Z) - Incorporating User Micro-behaviors and Item Knowledge into Multi-task
Learning for Session-based Recommendation [18.516121495514007]
Session-based recommendation (SR) aims to predict the next interacted item based on a given session.
Most existing SR models only focus on exploiting the consecutive items in a session interacted by a certain user.
We propose a novel SR model MKM-SR, which incorporates user Micro-behaviors and item Knowledge into Multi-task learning for Session-based Recommendation.
arXiv Detail & Related papers (2020-06-12T03:06:23Z) - Joint Item Recommendation and Attribute Inference: An Adaptive Graph
Convolutional Network Approach [61.2786065744784]
In recommender systems, users and items are associated with attributes, and users show preferences to items.
As annotating user (item) attributes is a labor intensive task, the attribute values are often incomplete with many missing attribute values.
We propose an Adaptive Graph Convolutional Network (AGCN) approach for joint item recommendation and attribute inference.
arXiv Detail & Related papers (2020-05-25T10:50:01Z) - Selecting Relevant Features from a Multi-domain Representation for
Few-shot Classification [91.67977602992657]
We propose a new strategy based on feature selection, which is both simpler and more effective than previous feature adaptation approaches.
We show that a simple non-parametric classifier built on top of such features produces high accuracy and generalizes to domains never seen during training.
arXiv Detail & Related papers (2020-03-20T15:44:17Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.