Intent Mining from past conversations for conversational agent
- URL: http://arxiv.org/abs/2005.11014v4
- Date: Mon, 18 Jan 2021 13:45:27 GMT
- Title: Intent Mining from past conversations for conversational agent
- Authors: Ajay Chatterjee and Shubhashis Sengupta
- Abstract summary: Bots are increasingly being deployed to provide round-the-clock support and to increase customer engagement.
Many of the commercial bot building frameworks follow a standard approach that requires one to build and train an intent model to recognize a user input.
We have introduced a novel density-based clustering algorithm ITERDB-LabelSCAN for unbalanced data clustering.
- Score: 1.9754522186574608
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Conversational systems are of primary interest in the AI community. Chatbots
are increasingly being deployed to provide round-the-clock support and to
increase customer engagement. Many of the commercial bot building frameworks
follow a standard approach that requires one to build and train an intent model
to recognize a user input. Intent models are trained in a supervised setting
with a collection of textual utterance and intent label pairs. Gathering a
substantial and wide coverage of training data for different intent is a
bottleneck in the bot building process. Moreover, the cost of labeling a
hundred to thousands of conversations with intent is a time consuming and
laborious job. In this paper, we present an intent discovery framework that
involves 4 primary steps: Extraction of textual utterances from a conversation
using a pre-trained domain agnostic Dialog Act Classifier (Data Extraction),
automatic clustering of similar user utterances (Clustering), manual annotation
of clusters with an intent label (Labeling) and propagation of intent labels to
the utterances from the previous step, which are not mapped to any cluster
(Label Propagation); to generate intent training data from raw conversations.
We have introduced a novel density-based clustering algorithm ITER-DBSCAN for
unbalanced data clustering. Subject Matter Expert (Annotators with domain
expertise) manually looks into the clustered user utterances and provides an
intent label for discovery. We conducted user studies to validate the
effectiveness of the trained intent model generated in terms of coverage of
intents, accuracy and time saving concerning manual annotation. Although the
system is developed for building an intent model for the conversational system,
this framework can also be used for a short text clustering or as a labeling
framework.
Related papers
- IDAS: Intent Discovery with Abstractive Summarization [16.731183915325584]
We show that recent competitive methods in intent discovery can be outperformed by clustering utterances based on abstractive summaries.
We contribute the IDAS approach, which collects a set of descriptive utterance labels by prompting a Large Language Model.
The utterances and their resulting noisy labels are then encoded by a frozen pre-trained encoder, and subsequently clustered to recover the latent intents.
arXiv Detail & Related papers (2023-05-31T12:19:40Z) - Goal-Driven Explainable Clustering via Language Descriptions [50.980832345025334]
We propose a new task formulation, "Goal-Driven Clustering with Explanations" (GoalEx)
GoalEx represents both the goal and the explanations as free-form language descriptions.
Our method produces more accurate and goal-related explanations than prior methods.
arXiv Detail & Related papers (2023-05-23T07:05:50Z) - Building the Intent Landscape of Real-World Conversational Corpora with
Extractive Question-Answering Transformers [0.0]
We propose an unsupervised pipeline that extracts intents and the taxonomy of intents from real-world dialogues.
Our results demonstrate the generalization ability of an ELECTRA large model fine-tuned on the SQuAD2 dataset to understand dialogues.
arXiv Detail & Related papers (2022-08-26T22:53:19Z) - New Intent Discovery with Pre-training and Contrastive Learning [21.25371293641141]
New intent discovery aims to uncover novel intent categories from user utterances to expand the set of supported intent classes.
Existing approaches typically rely on a large amount of labeled utterances.
We propose a new contrastive loss to exploit self-supervisory signals in unlabeled data for clustering.
arXiv Detail & Related papers (2022-05-25T17:07:25Z) - What is wrong with you?: Leveraging User Sentiment for Automatic Dialog
Evaluation [73.03318027164605]
We propose to use information that can be automatically extracted from the next user utterance as a proxy to measure the quality of the previous system response.
Our model generalizes across both spoken and written open-domain dialog corpora collected from real and paid users.
arXiv Detail & Related papers (2022-03-25T22:09:52Z) - Author Clustering and Topic Estimation for Short Texts [69.54017251622211]
We propose a novel model that expands on the Latent Dirichlet Allocation by modeling strong dependence among the words in the same document.
We also simultaneously cluster users, removing the need for post-hoc cluster estimation.
Our method performs as well as -- or better -- than traditional approaches to problems arising in short text.
arXiv Detail & Related papers (2021-06-15T20:55:55Z) - Open Intent Discovery through Unsupervised Semantic Clustering and
Dependency Parsing [44.99113692679489]
This paper proposes an unsupervised two-stage approach to discover intents and generate intent labels automatically from a collection of unlabeled utterances.
We empirically show that the proposed unsupervised approach can generate meaningful intent labels automatically and achieves high precision and recall in utterance clustering and intent discovery.
arXiv Detail & Related papers (2021-04-25T09:36:23Z) - Discovering New Intents with Deep Aligned Clustering [19.11073686645496]
We propose an effective method, Deep Aligned Clustering, to discover new intents with the aid of limited known intent data.
With an unknown number of new intents, we predict the number of intent categories by eliminating low-confidence intent-wise clusters.
Experiments on two benchmark datasets show that our method is more robust and achieves substantial improvements over the state-of-the-art methods.
arXiv Detail & Related papers (2020-12-16T14:32:06Z) - Learning with Weak Supervision for Email Intent Detection [56.71599262462638]
We propose to leverage user actions as a source of weak supervision to detect intents in emails.
We develop an end-to-end robust deep neural network model for email intent identification.
arXiv Detail & Related papers (2020-05-26T23:41:05Z) - AGIF: An Adaptive Graph-Interactive Framework for Joint Multiple Intent
Detection and Slot Filling [69.59096090788125]
In this paper, we propose an Adaptive Graph-Interactive Framework (AGIF) for joint multiple intent detection and slot filling.
We introduce an intent-slot graph interaction layer to model the strong correlation between the slot and intents.
Such an interaction layer is applied to each token adaptively, which has the advantage to automatically extract the relevant intents information.
arXiv Detail & Related papers (2020-04-21T15:07:34Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.