Large-scale Hybrid Approach for Predicting User Satisfaction with
Conversational Agents
- URL: http://arxiv.org/abs/2006.07113v1
- Date: Fri, 29 May 2020 16:29:09 GMT
- Title: Large-scale Hybrid Approach for Predicting User Satisfaction with
Conversational Agents
- Authors: Dookun Park, Hao Yuan, Dongmin Kim, Yinglei Zhang, Matsoukas Spyros,
Young-Bum Kim, Ruhi Sarikaya, Edward Guo, Yuan Ling, Kevin Quinn, Pham Hung,
Benjamin Yao, Sungjin Lee
- Abstract summary: Measuring user satisfaction level is a challenging task, and a critical component in developing large-scale conversational agent systems.
Human annotation based approaches are easier to control, but hard to scale.
A novel alternative approach is to collect user's direct feedback via a feedback elicitation system embedded to the conversational agent system.
- Score: 28.668681892786264
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Measuring user satisfaction level is a challenging task, and a critical
component in developing large-scale conversational agent systems serving the
needs of real users. An widely used approach to tackle this is to collect human
annotation data and use them for evaluation or modeling. Human annotation based
approaches are easier to control, but hard to scale. A novel alternative
approach is to collect user's direct feedback via a feedback elicitation system
embedded to the conversational agent system, and use the collected user
feedback to train a machine-learned model for generalization. User feedback is
the best proxy for user satisfaction, but is not available for some ineligible
intents and certain situations. Thus, these two types of approaches are
complementary to each other. In this work, we tackle the user satisfaction
assessment problem with a hybrid approach that fuses explicit user feedback,
user satisfaction predictions inferred by two machine-learned models, one
trained on user feedback data and the other human annotation data. The hybrid
approach is based on a waterfall policy, and the experimental results with
Amazon Alexa's large-scale datasets show significant improvements in inferring
user satisfaction. A detailed hybrid architecture, an in-depth analysis on user
feedback data, and an algorithm that generates data sets to properly simulate
the live traffic are presented in this paper.
Related papers
- Long-Span Question-Answering: Automatic Question Generation and QA-System Ranking via Side-by-Side Evaluation [65.16137964758612]
We explore the use of long-context capabilities in large language models to create synthetic reading comprehension data from entire books.
Our objective is to test the capabilities of LLMs to analyze, understand, and reason over problems that require a detailed comprehension of long spans of text.
arXiv Detail & Related papers (2024-05-31T20:15:10Z) - CAUSE: Counterfactual Assessment of User Satisfaction Estimation in Task-Oriented Dialogue Systems [60.27663010453209]
We leverage large language models (LLMs) to generate satisfaction-aware counterfactual dialogues.
We gather human annotations to ensure the reliability of the generated samples.
Our results shed light on the need for data augmentation approaches for user satisfaction estimation in TOD systems.
arXiv Detail & Related papers (2024-03-27T23:45:31Z) - UltraFeedback: Boosting Language Models with Scaled AI Feedback [99.4633351133207]
We present textscUltraFeedback, a large-scale, high-quality, and diversified AI feedback dataset.
Our work validates the effectiveness of scaled AI feedback data in constructing strong open-source chat language models.
arXiv Detail & Related papers (2023-10-02T17:40:01Z) - Latent User Intent Modeling for Sequential Recommenders [92.66888409973495]
Sequential recommender models learn to predict the next items a user is likely to interact with based on his/her interaction history on the platform.
Most sequential recommenders however lack a higher-level understanding of user intents, which often drive user behaviors online.
Intent modeling is thus critical for understanding users and optimizing long-term user experience.
arXiv Detail & Related papers (2022-11-17T19:00:24Z) - Incorporating Relevance Feedback for Information-Seeking Retrieval using
Few-Shot Document Re-Ranking [56.80065604034095]
We introduce a kNN approach that re-ranks documents based on their similarity with the query and the documents the user considers relevant.
To evaluate our different integration strategies, we transform four existing information retrieval datasets into the relevance feedback scenario.
arXiv Detail & Related papers (2022-10-19T16:19:37Z) - Ordinal Graph Gamma Belief Network for Social Recommender Systems [54.9487910312535]
We develop a hierarchical Bayesian model termed ordinal graph factor analysis (OGFA), which jointly models user-item and user-user interactions.
OGFA not only achieves good recommendation performance, but also extracts interpretable latent factors corresponding to representative user preferences.
We extend OGFA to ordinal graph gamma belief network, which is a multi-stochastic-layer deep probabilistic model.
arXiv Detail & Related papers (2022-09-12T09:19:22Z) - Adaptive Summaries: A Personalized Concept-based Summarization Approach
by Learning from Users' Feedback [0.0]
This paper proposes an interactive concept-based summarization model, called Adaptive Summaries.
The system learns from users' provided information gradually while interacting with the system by giving feedback in an iterative loop.
It helps users make high-quality summaries based on their preferences by maximizing the user-desired content in the generated summaries.
arXiv Detail & Related papers (2020-12-24T18:27:50Z) - Self-Supervised Contrastive Learning for Efficient User Satisfaction
Prediction in Conversational Agents [35.2098736872247]
We propose a self-supervised contrastive learning approach to learn user-agent interactions.
We show that the pre-trained models using the self-supervised objective are transferable to the user satisfaction prediction.
We also propose a novel few-shot transfer learning approach that ensures better transferability for very small sample sizes.
arXiv Detail & Related papers (2020-10-21T18:10:58Z) - Presentation of a Recommender System with Ensemble Learning and Graph
Embedding: A Case on MovieLens [3.8848561367220276]
Group classification and the ensemble learning technique were used for increasing prediction accuracy in recommender systems.
This study was performed on the MovieLens datasets, and the obtained results indicated the high efficiency of the presented method.
arXiv Detail & Related papers (2020-07-15T12:52:15Z) - Recommendation system using a deep learning and graph analysis approach [1.2183405753834562]
We propose a novel recommendation method based on Matrix Factorization and graph analysis methods.
In addition, we leverage deep Autoencoders to initialize users and items latent factors, and deep embedding method gathers users' latent factors from the user trust graph.
arXiv Detail & Related papers (2020-04-17T08:05:33Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.