kNN-Embed: Locally Smoothed Embedding Mixtures For Multi-interest
Candidate Retrieval
- URL: http://arxiv.org/abs/2205.06205v3
- Date: Sat, 5 Aug 2023 19:10:12 GMT
- Title: kNN-Embed: Locally Smoothed Embedding Mixtures For Multi-interest
Candidate Retrieval
- Authors: Ahmed El-Kishky, Thomas Markovich, Kenny Leung, Frank Portman, Aria
Haghighi, Ying Xiao
- Abstract summary: kNN-Embed represents each user as a smoothed mixture over learned item clusters that represent distinct "interests" of the user.
We experimentally compare kNN-Embed to standard ANN candidate retrieval, and show significant improvements in overall recall and improved diversity across three datasets.
- Score: 7.681386867564213
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Candidate retrieval is the first stage in recommendation systems, where a
light-weight system is used to retrieve potentially relevant items for an input
user. These candidate items are then ranked and pruned in later stages of
recommender systems using a more complex ranking model. As the top of the
recommendation funnel, it is important to retrieve a high-recall candidate set
to feed into downstream ranking models. A common approach is to leverage
approximate nearest neighbor (ANN) search from a single dense query embedding;
however, this approach this can yield a low-diversity result set with many near
duplicates. As users often have multiple interests, candidate retrieval should
ideally return a diverse set of candidates reflective of the user's multiple
interests. To this end, we introduce kNN-Embed, a general approach to improving
diversity in dense ANN-based retrieval. kNN-Embed represents each user as a
smoothed mixture over learned item clusters that represent distinct "interests"
of the user. By querying each of a user's mixture component in proportion to
their mixture weights, we retrieve a high-diversity set of candidates
reflecting elements from each of a user's interests. We experimentally compare
kNN-Embed to standard ANN candidate retrieval, and show significant
improvements in overall recall and improved diversity across three datasets.
Accompanying this work, we open source a large Twitter follow-graph dataset
(https://huggingface.co/datasets/Twitter/TwitterFollowGraph), to spur further
research in graph-mining and representation learning for recommender systems.
Related papers
- Ada-Retrieval: An Adaptive Multi-Round Retrieval Paradigm for Sequential
Recommendations [50.03560306423678]
We propose Ada-Retrieval, an adaptive multi-round retrieval paradigm for recommender systems.
Ada-Retrieval iteratively refines user representations to better capture potential candidates in the full item space.
arXiv Detail & Related papers (2024-01-12T15:26:40Z) - Revisiting Neural Retrieval on Accelerators [20.415728886298915]
A key component of retrieval is to model (user, item) similarity.
Despite its popularity, dot products cannot capture complex user-item interactions, which are multifaceted and likely high rank.
We propose textitmixture of logits (MoL), which models (user, item) similarity as an adaptive composition of elementary similarity functions.
arXiv Detail & Related papers (2023-06-06T22:08:42Z) - Recommender Systems with Generative Retrieval [58.454606442670034]
We propose a novel generative retrieval approach, where the retrieval model autoregressively decodes the identifiers of the target candidates.
To that end, we create semantically meaningful of codewords to serve as a Semantic ID for each item.
We show that recommender systems trained with the proposed paradigm significantly outperform the current SOTA models on various datasets.
arXiv Detail & Related papers (2023-05-08T21:48:17Z) - Hierarchical Multi-Interest Co-Network For Coarse-Grained Ranking [27.00455925014862]
Large-scale online recommendation services usually consist of three stages: candidate generation, coarse-grained ranking, and fine-grained ranking.
Previous research shows that users' interests are diverse, and one vector is not sufficient to capture users' different preferences.
This paper proposes a Hierarchical Multi-Interest Co-Network (HCN) to capture users' diverse interests in the coarse-grained ranking stage.
arXiv Detail & Related papers (2022-10-19T13:40:16Z) - The Minority Matters: A Diversity-Promoting Collaborative Metric
Learning Algorithm [154.47590401735323]
Collaborative Metric Learning (CML) has recently emerged as a popular method in recommendation systems.
This paper focuses on a challenging scenario where a user has multiple categories of interests.
We propose a novel method called textitDiversity-Promoting Collaborative Metric Learning (DPCML)
arXiv Detail & Related papers (2022-09-30T08:02:18Z) - Everyone's Preference Changes Differently: Weighted Multi-Interest
Retrieval Model [18.109035867113217]
Multi-Interest Preference (MIP) model is an approach that produces multi-interest for users by using the user's sequential engagement more effectively.
Extensive experiments have been done on various industrial-scale datasets to demonstrate the effectiveness of our approach.
arXiv Detail & Related papers (2022-07-14T04:29:54Z) - Modeling Dynamic User Preference via Dictionary Learning for Sequential
Recommendation [133.8758914874593]
Capturing the dynamics in user preference is crucial to better predict user future behaviors because user preferences often drift over time.
Many existing recommendation algorithms -- including both shallow and deep ones -- often model such dynamics independently.
This paper considers the problem of embedding a user's sequential behavior into the latent space of user preferences.
arXiv Detail & Related papers (2022-04-02T03:23:46Z) - Sparse-Interest Network for Sequential Recommendation [78.83064567614656]
We propose a novel textbfSparse textbfInterest textbfNEtwork (SINE) for sequential recommendation.
Our sparse-interest module can adaptively infer a sparse set of concepts for each user from the large concept pool.
SINE can achieve substantial improvement over state-of-the-art methods.
arXiv Detail & Related papers (2021-02-18T11:03:48Z) - Controllable Multi-Interest Framework for Recommendation [64.30030600415654]
We formalize the recommender system as a sequential recommendation problem.
We propose a novel controllable multi-interest framework for the sequential recommendation, called ComiRec.
Our framework has been successfully deployed on the offline Alibaba distributed cloud platform.
arXiv Detail & Related papers (2020-05-19T10:18:43Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.