Related papers: FLARE: Fusing Language Models and Collaborative Architectures for Recommender Enhancement

FLARE: Fusing Language Models and Collaborative Architectures for Recommender Enhancement

URL: http://arxiv.org/abs/2409.11699v2
Date: Wed, 05 Mar 2025 23:46:26 GMT
Title: FLARE: Fusing Language Models and Collaborative Architectures for Recommender Enhancement
Authors: Liam Hebert, Marialena Kyriakidi, Hubert Pham, Krishna Sayana, James Pine, Sukhdeep Sodhi, Ambarish Jash,
Abstract summary: Flare is a novel hybrid sequence recommender that integrates a language model with a collaborative filtering model using a Perceiver network.<n>We revisit the often-used Bert4Rec baseline and show that Bert4Rec significantly outperforms previously reported numbers.<n>This paper also showcases Flare's inherent ability to support critiquing, enabling users to provide feedback and refine recommendations.
Score: 0.7372033475418547
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Recent proposals in recommender systems represent items with their textual description, using a large language model. They show better results on standard benchmarks compared to an item ID-only model, such as Bert4Rec. In this work, we revisit the often-used Bert4Rec baseline and show that with further tuning, Bert4Rec significantly outperforms previously reported numbers, and in some datasets, is competitive with state-of-the-art models. With revised baselines for item ID-only models, this paper also establishes new competitive results for architectures that combine IDs and textual descriptions. We demonstrate this with Flare (Fusing Language models and collaborative Architectures for Recommender Enhancement). Flare is a novel hybrid sequence recommender that integrates a language model with a collaborative filtering model using a Perceiver network. Prior studies focus evaluation on datasets with limited-corpus size, but many commercially-applicable recommender systems common on the web must handle larger corpora. We evaluate Flare on a more realistic dataset with a significantly larger item vocabulary, introducing new baselines for this setting. This paper also showcases Flare's inherent ability to support critiquing, enabling users to provide feedback and refine recommendations. We leverage critiquing as an evaluation method to assess the model's language understanding and its transferability to the recommendation task.

Related papers

REGEN: A Dataset and Benchmarks with Natural Language Critiques and Narratives [4.558818396613368]
We extend the Amazon Product Reviews dataset by inpainting two key natural language features. The narratives include product endorsements, purchase explanations, and summaries of user preferences.
arXiv Detail & Related papers (2025-03-14T23:47:46Z)
Collaborative Retrieval for Large Language Model-based Conversational Recommender Systems [65.75265303064654]
Conversational recommender systems (CRS) aim to provide personalized recommendations via interactive dialogues with users. Large language models (LLMs) enhance CRS with their superior understanding of context-aware user preferences. We propose CRAG, Collaborative Retrieval Augmented Generation for LLM-based CRS.
arXiv Detail & Related papers (2025-02-19T22:47:40Z)
Beyond Retrieval: Generating Narratives in Conversational Recommender Systems [4.912663905306209]
We introduce a new dataset (REGEN) for natural language generation tasks in conversational recommendations. We establish benchmarks using well-known generative metrics, and perform an automated evaluation of the new dataset using a rater LLM. And to the best of our knowledge, represents the first attempt to analyze the capabilities of LLMs in understanding recommender signals and generating rich narratives.
arXiv Detail & Related papers (2024-10-22T07:53:41Z)
EasyRec: Simple yet Effective Language Models for Recommendation [6.311058599430178]
EasyRec is an effective and easy-to-use approach that seamlessly integrates text-based semantic understanding with collaborative signals. EasyRec employs a text-behavior alignment framework, which combines contrastive learning with collaborative language model tuning. The study highlights the potential of seamlessly integrating EasyRec as a plug-and-play component into text-enhanced collaborative filtering frameworks.
arXiv Detail & Related papers (2024-08-16T16:09:59Z)
DaRec: A Disentangled Alignment Framework for Large Language Model and Recommender System [83.34921966305804]
Large language models (LLMs) have demonstrated remarkable performance in recommender systems. We propose a novel plug-and-play alignment framework for LLMs and collaborative models. Our method is superior to existing state-of-the-art algorithms.
arXiv Detail & Related papers (2024-08-15T15:56:23Z)
Language Representations Can be What Recommenders Need: Findings and Potentials [57.90679739598295]
We show that item representations, when linearly mapped from advanced LM representations, yield superior recommendation performance. This outcome suggests the possible homomorphism between the advanced language representation space and an effective item representation space for recommendation. Our findings highlight the connection between language modeling and behavior modeling, which can inspire both natural language processing and recommender system communities.
arXiv Detail & Related papers (2024-07-07T17:05:24Z)
CELA: Cost-Efficient Language Model Alignment for CTR Prediction [71.85120354973073]
Click-Through Rate (CTR) prediction holds a paramount position in recommender systems. Recent efforts have sought to mitigate these challenges by integrating Pre-trained Language Models (PLMs) We propose textbfCost-textbfEfficient textbfLanguage Model textbfAlignment (textbfCELA) for CTR prediction.
arXiv Detail & Related papers (2024-05-17T07:43:25Z)
RLVF: Learning from Verbal Feedback without Overgeneralization [94.19501420241188]
We study the problem of incorporating verbal feedback without such overgeneralization. We develop a new method Contextualized Critiques with Constrained Preference Optimization (C3PO) Our approach effectively applies verbal feedback to relevant scenarios while preserving existing behaviors for other contexts.
arXiv Detail & Related papers (2024-02-16T18:50:24Z)
Contextualization Distillation from Large Language Model for Knowledge Graph Completion [51.126166442122546]
We introduce the Contextualization Distillation strategy, a plug-in-and-play approach compatible with both discriminative and generative KGC frameworks. Our method begins by instructing large language models to transform compact, structural triplets into context-rich segments. Comprehensive evaluations across diverse datasets and KGC techniques highlight the efficacy and adaptability of our approach.
arXiv Detail & Related papers (2024-01-28T08:56:49Z)
CoLLM: Integrating Collaborative Embeddings into Large Language Models for Recommendation [60.2700801392527]
We introduce CoLLM, an innovative LLMRec methodology that seamlessly incorporates collaborative information into LLMs for recommendation. CoLLM captures collaborative information through an external traditional model and maps it to the input token embedding space of LLM. Extensive experiments validate that CoLLM adeptly integrates collaborative information into LLMs, resulting in enhanced recommendation performance.
arXiv Detail & Related papers (2023-10-30T12:25:00Z)
EvalCrafter: Benchmarking and Evaluating Large Video Generation Models [70.19437817951673]
We argue that it is hard to judge the large conditional generative models from the simple metrics since these models are often trained on very large datasets with multi-aspect abilities. Our approach involves generating a diverse and comprehensive list of 700 prompts for text-to-video generation. Then, we evaluate the state-of-the-art video generative models on our carefully designed benchmark, in terms of visual qualities, content qualities, motion qualities, and text-video alignment with 17 well-selected objective metrics.
arXiv Detail & Related papers (2023-10-17T17:50:46Z)
Reformulating Sequential Recommendation: Learning Dynamic User Interest with Content-enriched Language Modeling [18.297332953450514]
We propose LANCER, which leverages the semantic understanding capabilities of pre-trained language models to generate personalized recommendations. Our approach bridges the gap between language models and recommender systems, resulting in more human-like recommendations.
arXiv Detail & Related papers (2023-09-19T08:54:47Z)
Text Matching Improves Sequential Recommendation by Reducing Popularity Biases [48.272381505993366]
TASTE verbalizes items and user-item interactions using identifiers and attributes of items. Our experiments show that TASTE outperforms the state-of-the-art methods on widely used sequential recommendation datasets.
arXiv Detail & Related papers (2023-08-27T07:44:33Z)
Attentive Graph-based Text-aware Preference Modeling for Top-N Recommendation [2.3991565023534083]
We propose a new model named Attentive Graph-based Text-aware Recommendation Model (AGTM) In this work, we aim to further improve top-N recommendation by effectively modeling both item textual content and high-order connectivity in user-item graph.
arXiv Detail & Related papers (2023-05-22T12:32:06Z)
Incorporating Relevance Feedback for Information-Seeking Retrieval using Few-Shot Document Re-Ranking [56.80065604034095]
We introduce a kNN approach that re-ranks documents based on their similarity with the query and the documents the user considers relevant. To evaluate our different integration strategies, we transform four existing information retrieval datasets into the relevance feedback scenario.
arXiv Detail & Related papers (2022-10-19T16:19:37Z)
GEMv2: Multilingual NLG Benchmarking in a Single Line of Code [161.1761414080574]
Generation, Evaluation, and Metrics Benchmark introduces a modular infrastructure for dataset, model, and metric developers. GEMv2 supports 40 documented datasets in 51 languages. Models for all datasets can be evaluated online and our interactive data card creation and rendering tools make it easier to add new datasets to the living benchmark.
arXiv Detail & Related papers (2022-06-22T17:52:30Z)
Utilizing Textual Reviews in Latent Factor Models for Recommender Systems [1.7361353199214251]
We propose a recommender algorithm that combines a rating modelling technique with a topic modelling method based on textual reviews. We evaluate the performance of the algorithm using Amazon.com datasets with different sizes, corresponding to 23 product categories.
arXiv Detail & Related papers (2021-11-16T15:07:51Z)
Tracing Origins: Coref-aware Machine Reading Comprehension [43.352833140317486]
We imitated the human's reading process in connecting the anaphoric expressions and leverage the coreference information to enhance the word embeddings from the pre-trained model. We demonstrated that the explicit incorporation of the coreference information in fine-tuning stage performed better than the incorporation of the coreference information in training a pre-trained language models.
arXiv Detail & Related papers (2021-10-15T09:28:35Z)
A Survey on Neural Recommendation: From Collaborative Filtering to Content and Context Enriched Recommendation [70.69134448863483]
Research in recommendation has shifted to inventing new recommender models based on neural networks. In recent years, we have witnessed significant progress in developing neural recommender models.
arXiv Detail & Related papers (2021-04-27T08:03:52Z)
Self-supervised Learning for Large-scale Item Recommendations [18.19202958502061]
Large scale recommender models find most relevant items from huge catalogs. With millions to billions of items in the corpus, users tend to provide feedback for a very small set of them. We propose a multi-task self-supervised learning framework for large-scale item recommendations.
arXiv Detail & Related papers (2020-07-25T06:21:43Z)
Rich-Item Recommendations for Rich-Users: Exploiting Dynamic and Static Side Information [20.176329366180934]
We study the problem of recommendation system where the users and items to be recommended are rich data structures with multiple entity types. We provide a general formulation for the problem that captures the complexities of modern real-world recommendations. We present two real-world case studies of our formulation and the MEDRES architecture.
arXiv Detail & Related papers (2020-01-28T17:53:38Z)

This list is automatically generated from the titles and abstracts of the papers in this site.