ConCET: Entity-Aware Topic Classification for Open-Domain Conversational
Agents
- URL: http://arxiv.org/abs/2005.13798v1
- Date: Thu, 28 May 2020 06:29:08 GMT
- Title: ConCET: Entity-Aware Topic Classification for Open-Domain Conversational
Agents
- Authors: Ali Ahmadvand, Harshita Sahijwani, Jason Ingyu Choi, Eugene Agichtein
- Abstract summary: We introduce ConCET: a Concurrent Entity-aware conversational Topic classifier.
We propose a simple and effective method for generating synthetic training data.
We evaluate ConCET on a large dataset of human-machine conversations with real users, collected as part of the Amazon Alexa Prize.
- Score: 9.870634472479571
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Identifying the topic (domain) of each user's utterance in open-domain
conversational systems is a crucial step for all subsequent language
understanding and response tasks. In particular, for complex domains, an
utterance is often routed to a single component responsible for that domain.
Thus, correctly mapping a user utterance to the right domain is critical. To
address this problem, we introduce ConCET: a Concurrent Entity-aware
conversational Topic classifier, which incorporates entity-type information
together with the utterance content features. Specifically, ConCET utilizes
entity information to enrich the utterance representation, combining character,
word, and entity-type embeddings into a single representation. However, for
rich domains with millions of available entities, unrealistic amounts of
labeled training data would be required. To complement our model, we propose a
simple and effective method for generating synthetic training data, to augment
the typically limited amounts of labeled training data, using commonly
available knowledge bases to generate additional labeled utterances. We
extensively evaluate ConCET and our proposed training method first on an openly
available human-human conversational dataset called Self-Dialogue, to calibrate
our approach against previous state-of-the-art methods; second, we evaluate
ConCET on a large dataset of human-machine conversations with real users,
collected as part of the Amazon Alexa Prize. Our results show that ConCET
significantly improves topic classification performance on both datasets,
including 8-10% improvements over state-of-the-art deep learning methods. We
complement our quantitative results with detailed analysis of system
performance, which could be used for further improvements of conversational
agents.
Related papers
- Web-Scale Visual Entity Recognition: An LLM-Driven Data Approach [56.55633052479446]
Web-scale visual entity recognition presents significant challenges due to the lack of clean, large-scale training data.
We propose a novel methodology to curate such a dataset, leveraging a multimodal large language model (LLM) for label verification, metadata generation, and rationale explanation.
Experiments demonstrate that models trained on this automatically curated data achieve state-of-the-art performance on web-scale visual entity recognition tasks.
arXiv Detail & Related papers (2024-10-31T06:55:24Z) - Synthesizing Conversations from Unlabeled Documents using Automatic Response Segmentation [13.322409682814827]
We tackle the challenge of inadequate and costly training data for conversational question answering systems.
In this paper, we propose a robust dialog synthesising method.
We learn the segmentation of data for the dialog task instead of using segmenting at sentence boundaries.
arXiv Detail & Related papers (2024-06-06T02:52:45Z) - A Self-enhancement Approach for Domain-specific Chatbot Training via
Knowledge Mining and Digest [62.63606958140248]
Large Language Models (LLMs) often encounter challenges when dealing with intricate and knowledge-demanding queries in specific domains.
This paper introduces a novel approach to enhance LLMs by effectively extracting the relevant knowledge from domain-specific textual sources.
We train a knowledge miner, namely LLMiner, which autonomously extracts Question-Answer pairs from relevant documents.
arXiv Detail & Related papers (2023-11-17T16:09:10Z) - PoE: a Panel of Experts for Generalized Automatic Dialogue Assessment [58.46761798403072]
A model-based automatic dialogue evaluation metric (ADEM) is expected to perform well across multiple domains.
Despite significant progress, an ADEM that works well in one domain does not necessarily generalize to another.
We propose a Panel of Experts (PoE) network that consists of a shared transformer encoder and a collection of lightweight adapters.
arXiv Detail & Related papers (2022-12-18T02:26:50Z) - A Simple Baseline for Domain Adaptation in End to End ASR Systems Using
Synthetic Data [1.14219428942199]
We propose a simple baseline technique for domain adaptation in end-to-end speech recognition models.
We convert the text-only corpus to audio data using single speaker Text to Speech (TTS) engine.
We show that single speaker synthetic TTS data coupled with final dense layer only fine-tuning provides reasonable improvements in word error rates.
arXiv Detail & Related papers (2022-06-22T12:07:38Z) - A combined approach to the analysis of speech conversations in a contact
center domain [2.575030923243061]
We describe an experimentation with a speech analytics process for an Italian contact center, that deals with call recordings extracted from inbound or outbound flows.
First, we illustrate in detail the development of an in-house speech-to-text solution, based on Kaldi framework.
Then, we evaluate and compare different approaches to the semantic tagging of call transcripts.
Finally, a decision tree inducer, called J48S, is applied to the problem of tagging.
arXiv Detail & Related papers (2022-03-12T10:03:20Z) - On the Use of External Data for Spoken Named Entity Recognition [40.93448412171246]
Recent advances in self-supervised speech representations have made it feasible to consider learning models with limited labeled data.
We draw on a variety of approaches, including self-training, knowledge distillation, and transfer learning, and consider their applicability to both end-to-end models and pipeline approaches.
arXiv Detail & Related papers (2021-12-14T18:49:26Z) - Unsupervised Summarization for Chat Logs with Topic-Oriented Ranking and
Context-Aware Auto-Encoders [59.038157066874255]
We propose a novel framework called RankAE to perform chat summarization without employing manually labeled data.
RankAE consists of a topic-oriented ranking strategy that selects topic utterances according to centrality and diversity simultaneously.
A denoising auto-encoder is designed to generate succinct but context-informative summaries based on the selected utterances.
arXiv Detail & Related papers (2020-12-14T07:31:17Z) - Data-Efficient Methods for Dialogue Systems [4.061135251278187]
Conversational User Interface (CUI) has become ubiquitous in everyday life, in consumer-focused products like Siri and Alexa.
Deep learning underlies many recent breakthroughs in dialogue systems but requires very large amounts of training data, often annotated by experts.
In this thesis, we introduce a series of methods for training robust dialogue systems from minimal data.
arXiv Detail & Related papers (2020-12-05T02:51:09Z) - Exploiting Unsupervised Data for Emotion Recognition in Conversations [76.01690906995286]
Emotion Recognition in Conversations (ERC) aims to predict the emotional state of speakers in conversations.
The available supervised data for the ERC task is limited.
We propose a novel approach to leverage unsupervised conversation data.
arXiv Detail & Related papers (2020-10-02T13:28:47Z) - Improving Conversational Recommender Systems via Knowledge Graph based
Semantic Fusion [77.21442487537139]
Conversational recommender systems (CRS) aim to recommend high-quality items to users through interactive conversations.
First, the conversation data itself lacks of sufficient contextual information for accurately understanding users' preference.
Second, there is a semantic gap between natural language expression and item-level user preference.
arXiv Detail & Related papers (2020-07-08T11:14:23Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.