Self-Supervised Bot Play for Conversational Recommendation with
  Justifications
        - URL: http://arxiv.org/abs/2112.05197v1
- Date: Thu, 9 Dec 2021 20:07:41 GMT
- Title: Self-Supervised Bot Play for Conversational Recommendation with
  Justifications
- Authors: Shuyang Li, Bodhisattwa Prasad Majumder, Julian McAuley
- Abstract summary: We develop a new two-part framework for training conversational recommender systems.
First, we train a recommender system to jointly suggest items and justify its reasoning with subjective aspects.
We then fine-tune this model to incorporate iterative user feedback via self-supervised bot-play.
- Score: 3.015622397986615
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract:   Conversational recommender systems offer the promise of interactive, engaging
ways for users to find items they enjoy. We seek to improve conversational
recommendation via three dimensions: 1) We aim to mimic a common mode of human
interaction for recommendation: experts justify their suggestions, a seeker
explains why they don't like the item, and both parties iterate through the
dialog to find a suitable item. 2) We leverage ideas from conversational
critiquing to allow users to flexibly interact with natural language
justifications by critiquing subjective aspects. 3) We adapt conversational
recommendation to a wider range of domains where crowd-sourced ground truth
dialogs are not available. We develop a new two-part framework for training
conversational recommender systems. First, we train a recommender system to
jointly suggest items and justify its reasoning with subjective aspects. We
then fine-tune this model to incorporate iterative user feedback via
self-supervised bot-play. Experiments on three real-world datasets demonstrate
that our system can be applied to different recommendation models across
diverse domains to achieve superior performance in conversational
recommendation compared to state-of-the-art methods. We also evaluate our model
on human users, showing that systems trained under our framework provide more
useful, helpful, and knowledgeable recommendations in warm- and cold-start
settings.
 
      
        Related papers
        - Beyond Whole Dialogue Modeling: Contextual Disentanglement for   Conversational Recommendation [22.213312621287482]
 This paper proposes a novel model to introduce contextual disentanglement for improving conversational recommender systems.
DisenCRS employs a dual disentanglement framework, including self-supervised contrastive disentanglement and counterfactual inference disentanglement.
 Experimental results on two widely used public datasets demonstrate that DisenCRS significantly outperforms existing conversational recommendation models.
 arXiv  Detail & Related papers  (2025-04-24T10:33:26Z)
- Leveraging Knowledge Graph Embedding for Effective Conversational   Recommendation [4.079573593766921]
 We propose a knowledge graph based conversational recommender system (referred as KG-CRS)
 Specifically, we first integrate the user-item graph and item-attribute graph into a dynamic graph, dynamically changing during the dialogue process by removing negative items or attributes.
We then learn informative embedding of users, items, and attributes by also considering propagation through neighbors on the graph.
 arXiv  Detail & Related papers  (2024-08-02T15:38:55Z)
- ChatGPT for Conversational Recommendation: Refining Recommendations by
  Reprompting with Feedback [1.3654846342364308]
 Large Language Models (LLMs) like ChatGPT have gained popularity due to their ease of use and their ability to adapt dynamically to various tasks while responding to feedback.
We build a rigorous pipeline around ChatGPT to simulate how a user might realistically probe the model for recommendations.
We explore the effect of popularity bias in ChatGPT's recommendations, and compare its performance to baseline models.
 arXiv  Detail & Related papers  (2024-01-07T23:17:42Z)
- Large Language Models as Zero-Shot Conversational Recommenders [52.57230221644014]
 We present empirical studies on conversational recommendation tasks using representative large language models in a zero-shot setting.
We construct a new dataset of recommendation-related conversations by scraping a popular discussion website.
We observe that even without fine-tuning, large language models can outperform existing fine-tuned conversational recommendation models.
 arXiv  Detail & Related papers  (2023-08-19T15:29:45Z)
- Aligning Recommendation and Conversation via Dual Imitation [56.236932446280825]
 We propose DICR (Dual Imitation for Conversational Recommendation), which designs a dual imitation to explicitly align the recommendation paths and user interest shift paths.
By exchanging alignment signals, DICR achieves bidirectional promotion between recommendation and conversation modules.
Experiments demonstrate that DICR outperforms the state-of-the-art models on recommendation and conversation performance with automatic, human, and novel explainability metrics.
 arXiv  Detail & Related papers  (2022-11-05T08:13:46Z)
- Hierarchical Conversational Preference Elicitation with Bandit Feedback [36.507341041113825]
 We formulate a new conversational bandit problem that allows the recommender system to choose either a key-term or an item to recommend at each round.
We conduct a survey and analyze a real-world dataset to find that, unlike assumptions made in prior works, key-term rewards are mainly affected by rewards of representative items.
We propose two bandit algorithms, Hier-UCB and Hier-LinUCB, that leverage this observed relationship and the hierarchical structure between key-terms and items.
 arXiv  Detail & Related papers  (2022-09-06T05:35:24Z)
- Customized Conversational Recommender Systems [45.84713970070487]
 Conversational recommender systems (CRS) aim to capture user's current intentions and provide recommendations through real-time multi-turn conversational interactions.
We propose a novel CRS model, coined Customized Conversational Recommender System ( CCRS), which customizes CRS model for users from three perspectives.
To provide personalized recommendations, we extract user's current fine-grained intentions from dialogue context with the guidance of user's inherent preferences.
 arXiv  Detail & Related papers  (2022-06-30T09:45:36Z)
- Improving Conversational Recommender Systems via Knowledge Graph based
  Semantic Fusion [77.21442487537139]
 Conversational recommender systems (CRS) aim to recommend high-quality items to users through interactive conversations.
First, the conversation data itself lacks of sufficient contextual information for accurately understanding users' preference.
Second, there is a semantic gap between natural language expression and item-level user preference.
 arXiv  Detail & Related papers  (2020-07-08T11:14:23Z)
- Seamlessly Unifying Attributes and Items: Conversational Recommendation
  for Cold-Start Users [111.28351584726092]
 We consider the conversational recommendation for cold-start users, where a system can both ask the attributes from and recommend items to a user interactively.
Our Conversational Thompson Sampling (ConTS) model holistically solves all questions in conversational recommendation by choosing the arm with the maximal reward to play.
 arXiv  Detail & Related papers  (2020-05-23T08:56:37Z)
- Towards Conversational Recommendation over Multi-Type Dialogs [78.52354759386296]
 We propose a new task of conversational recommendation over multi-type dialogs, where the bots can proactively and naturally lead a conversation from a non-recommendation dialog to a recommendation dialog.
To facilitate the study of this task, we create a human-to-human Chinese dialog dataset emphDuRecDial (about 10k dialogs, 156k utterances)
In each dialog, the recommender proactively leads a multi-type dialog to approach recommendation targets and then makes multiple recommendations with rich interaction behavior.
 arXiv  Detail & Related papers  (2020-05-08T11:01:21Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
       
     
           This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.