PENTATRON: PErsonalized coNText-Aware Transformer for Retrieval-based
cOnversational uNderstanding
- URL: http://arxiv.org/abs/2210.12308v1
- Date: Sat, 22 Oct 2022 00:14:47 GMT
- Title: PENTATRON: PErsonalized coNText-Aware Transformer for Retrieval-based
cOnversational uNderstanding
- Authors: Niranjan Uma Naresh, Ziyan Jiang, Ankit, Sungjin Lee, Jie Hao, Xing
Fan, Chenlei Guo
- Abstract summary: In a large fraction of the global traffic from customers using smart digital assistants, frictions in dialogues may be attributed to incorrect understanding.
We build and evaluate a scalable entity correction system, PENTATRON.
We show a significant upward movement of the key metric (Exact Match) by up to 500.97%.
- Score: 18.788620612619823
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Conversational understanding is an integral part of modern intelligent
devices. In a large fraction of the global traffic from customers using smart
digital assistants, frictions in dialogues may be attributed to incorrect
understanding of the entities in a customer's query due to factors including
ambiguous mentions, mispronunciation, background noise and faulty on-device
signal processing. Such errors are compounded by two common deficiencies from
intelligent devices namely, (1) the device not being tailored to individual
customers, and (2) the device responses being unaware of the context in the
conversation session. Viewing this problem via the lens of retrieval-based
search engines, we build and evaluate a scalable entity correction system,
PENTATRON. The system leverages a parametric transformer-based language model
to learn patterns from in-session customer-device interactions coupled with a
non-parametric personalized entity index to compute the correct query, which
aids downstream components in reasoning about the best response. In addition to
establishing baselines and demonstrating the value of personalized and
context-aware systems, we use multitasking to learn the domain of the correct
entity. We also investigate the utility of language model prompts. Through
extensive experiments, we show a significant upward movement of the key metric
(Exact Match) by up to 500.97% (relative to the baseline).
Related papers
- Robust Interaction-Based Relevance Modeling for Online e-Commerce Search [8.499253194630665]
Traditional text-matching techniques fail to capture the nuances of search intent accurately.
We introduce a robust interaction-based modeling paradigm to address these shortcomings.
To the best of our knowledge, this method is the first interaction-based approach for large e-commerce search relevance calculation.
arXiv Detail & Related papers (2024-06-04T09:24:04Z) - Improving Text Matching in E-Commerce Search with A Rationalizable,
Intervenable and Fast Entity-Based Relevance Model [78.80174696043021]
We propose a novel model called the Entity-Based Relevance Model (EBRM)
The decomposition allows us to use a Cross-encoder QE relevance module for high accuracy.
We also show that pretraining the QE module with auto-generated QE data from user logs can further improve the overall performance.
arXiv Detail & Related papers (2023-07-01T15:44:53Z) - Using Textual Interface to Align External Knowledge for End-to-End
Task-Oriented Dialogue Systems [53.38517204698343]
We propose a novel paradigm that uses a textual interface to align external knowledge and eliminate redundant processes.
We demonstrate our paradigm in practice through MultiWOZ-Remake, including an interactive textual interface built for the MultiWOZ database.
arXiv Detail & Related papers (2023-05-23T05:48:21Z) - Intent Recognition in Conversational Recommender Systems [0.0]
We introduce a pipeline to contextualize the input utterances in conversations.
We then take the next step towards leveraging reverse feature engineering to link the contextualized input and learning model to support intent recognition.
arXiv Detail & Related papers (2022-12-06T11:02:42Z) - Task-Oriented Over-the-Air Computation for Multi-Device Edge AI [57.50247872182593]
6G networks for supporting edge AI features task-oriented techniques that focus on effective and efficient execution of AI task.
Task-oriented over-the-air computation (AirComp) scheme is proposed in this paper for multi-device split-inference system.
arXiv Detail & Related papers (2022-11-02T16:35:14Z) - Analysis of Joint Speech-Text Embeddings for Semantic Matching [3.6423306784901235]
We study a joint speech-text embedding space trained for semantic matching by minimizing the distance between paired utterance and transcription inputs.
We extend our method to incorporate automatic speech recognition through both pretraining and multitask scenarios.
arXiv Detail & Related papers (2022-04-04T04:50:32Z) - Actionable Conversational Quality Indicators for Improving Task-Oriented
Dialog Systems [2.6094079735487994]
This paper introduces and explains the use of Actionable Conversational Quality Indicators (ACQIs)
ACQIs are used both to recognize parts of dialogs that can be improved, and to recommend how to improve them.
We demonstrate the effectiveness of using ACQIs on LivePerson internal dialog systems used in commercial customer service applications.
arXiv Detail & Related papers (2021-09-22T22:41:42Z) - Distantly Supervised Transformers For E-Commerce Product QA [5.460297795256275]
We propose a practical instant question answering (QA) system on product pages of ecommerce services.
For each user query, relevant community question answer (CQA) pairs are retrieved.
Our proposed transformer-based model learns a robust relevance function by jointly learning unified syntactic and semantic representations.
arXiv Detail & Related papers (2021-04-07T06:37:16Z) - End-to-End Object Detection with Transformers [88.06357745922716]
We present a new method that views object detection as a direct set prediction problem.
Our approach streamlines the detection pipeline, effectively removing the need for many hand-designed components.
The main ingredients of the new framework, called DEtection TRansformer or DETR, are a set-based global loss.
arXiv Detail & Related papers (2020-05-26T17:06:38Z) - Multi-Stage Conversational Passage Retrieval: An Approach to Fusing Term
Importance Estimation and Neural Query Rewriting [56.268862325167575]
We tackle conversational passage retrieval (ConvPR) with query reformulation integrated into a multi-stage ad-hoc IR system.
We propose two conversational query reformulation (CQR) methods: (1) term importance estimation and (2) neural query rewriting.
For the former, we expand conversational queries using important terms extracted from the conversational context with frequency-based signals.
For the latter, we reformulate conversational queries into natural, standalone, human-understandable queries with a pretrained sequence-tosequence model.
arXiv Detail & Related papers (2020-05-05T14:30:20Z) - IART: Intent-aware Response Ranking with Transformers in
Information-seeking Conversation Systems [80.0781718687327]
We analyze user intent patterns in information-seeking conversations and propose an intent-aware neural response ranking model "IART"
IART is built on top of the integration of user intent modeling and language representation learning with the Transformer architecture.
arXiv Detail & Related papers (2020-02-03T05:59:52Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.