Uncovering Customer Issues through Topological Natural Language Analysis
- URL: http://arxiv.org/abs/2403.00804v1
- Date: Sat, 24 Feb 2024 00:15:09 GMT
- Title: Uncovering Customer Issues through Topological Natural Language Analysis
- Authors: Shu-Ting Pi, Sidarth Srinivasan, Yuying Zhu, Michael Yang, Qun Liu
- Abstract summary: We propose a novel machine learning algorithm to monitor emerging and trending customer issues.
Our approach involves an end-to-end deep learning framework that simultaneously tags the primary question sentence of each customer's transcript.
We have validated our results through various methods and found that they are highly consistent with news sources.
- Score: 17.694323151611275
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: E-commerce companies deal with a high volume of customer service requests
daily. While a simple annotation system is often used to summarize the topics
of customer contacts, thoroughly exploring each specific issue can be
challenging. This presents a critical concern, especially during an emerging
outbreak where companies must quickly identify and address specific issues. To
tackle this challenge, we propose a novel machine learning algorithm that
leverages natural language techniques and topological data analysis to monitor
emerging and trending customer issues. Our approach involves an end-to-end deep
learning framework that simultaneously tags the primary question sentence of
each customer's transcript and generates sentence embedding vectors. We then
whiten the embedding vectors and use them to construct an undirected graph.
From there, we define trending and emerging issues based on the topological
properties of each transcript. We have validated our results through various
methods and found that they are highly consistent with news sources.
Related papers
- Marking: Visual Grading with Highlighting Errors and Annotating Missing Bits [23.71250100390303]
"Marking" is a novel grading task that enhances automated grading systems by performing an in-depth analysis of student responses.
We introduce a new dataset meticulously curated by Subject Matter Experts specifically for this task.
arXiv Detail & Related papers (2024-04-22T16:00:46Z) - Context Matters: Pushing the Boundaries of Open-Ended Answer Generation
with Graph-Structured Knowledge Context [4.368725325557961]
This paper introduces a novel framework that combines graph-driven context retrieval in conjunction to knowledge graphs based enhancement.
We conduct experiments on various Large Language Models (LLMs) with different parameter sizes to evaluate their ability to ground knowledge and determine factual accuracy in answers to open-ended questions.
Our methodology GraphContextGen consistently outperforms dominant text-based retrieval systems, demonstrating its robustness and adaptability to a larger number of use cases.
arXiv Detail & Related papers (2024-01-23T11:25:34Z) - DIVKNOWQA: Assessing the Reasoning Ability of LLMs via Open-Domain
Question Answering over Knowledge Base and Text [73.68051228972024]
Large Language Models (LLMs) have exhibited impressive generation capabilities, but they suffer from hallucinations when relying on their internal knowledge.
Retrieval-augmented LLMs have emerged as a potential solution to ground LLMs in external knowledge.
arXiv Detail & Related papers (2023-10-31T04:37:57Z) - Summarization with Graphical Elements [55.5913491389047]
We propose a new task: summarization with graphical elements.
We collect a high quality human labeled dataset to support research into the task.
arXiv Detail & Related papers (2022-04-15T17:16:41Z) - On Decoding Strategies for Neural Text Generators [73.48162198041884]
We study the interaction between language generation tasks and decoding strategies.
We measure changes in attributes of generated text as a function of both decoding strategy and task.
Our results reveal both previously-observed and surprising findings.
arXiv Detail & Related papers (2022-03-29T16:25:30Z) - A combined approach to the analysis of speech conversations in a contact
center domain [2.575030923243061]
We describe an experimentation with a speech analytics process for an Italian contact center, that deals with call recordings extracted from inbound or outbound flows.
First, we illustrate in detail the development of an in-house speech-to-text solution, based on Kaldi framework.
Then, we evaluate and compare different approaches to the semantic tagging of call transcripts.
Finally, a decision tree inducer, called J48S, is applied to the problem of tagging.
arXiv Detail & Related papers (2022-03-12T10:03:20Z) - Compression, Transduction, and Creation: A Unified Framework for
Evaluating Natural Language Generation [85.32991360774447]
Natural language generation (NLG) spans a broad range of tasks, each of which serves for specific objectives.
We propose a unifying perspective based on the nature of information change in NLG tasks.
We develop a family of interpretable metrics that are suitable for evaluating key aspects of different NLG tasks.
arXiv Detail & Related papers (2021-09-14T01:00:42Z) - An NLP approach to quantify dynamic salience of predefined topics in a
text corpus [0.0]
We use natural language processing techniques to quantify how a set of pre-defined topics of interest change over time across a large corpus of text.
We find that given a predefined topic, we can identify and rank sets of terms, or n-grams, that map to those topics and have usage patterns that deviate from a normal baseline.
arXiv Detail & Related papers (2021-08-16T21:00:06Z) - Inquisitive Question Generation for High Level Text Comprehension [60.21497846332531]
We introduce INQUISITIVE, a dataset of 19K questions that are elicited while a person is reading through a document.
We show that readers engage in a series of pragmatic strategies to seek information.
We evaluate question generation models based on GPT-2 and show that our model is able to generate reasonable questions.
arXiv Detail & Related papers (2020-10-04T19:03:39Z) - Quda: Natural Language Queries for Visual Data Analytics [33.983060903399554]
We present a new dataset, called Quda, that aims to help V-NLIs recognize analytic tasks from free-form natural language.
Our dataset contains $14,035$ diverse user queries, and each is annotated with one or multiple analytic tasks.
This work is the first attempt to construct a large-scale corpus for recognizing analytic tasks.
arXiv Detail & Related papers (2020-05-07T05:35:16Z) - Neural Data-to-Text Generation via Jointly Learning the Segmentation and
Correspondence [48.765579605145454]
We propose to explicitly segment target text into fragment units and align them with their data correspondences.
The resulting architecture maintains the same expressive power as neural attention models.
On both E2E and WebNLG benchmarks, we show the proposed model consistently outperforms its neural attention counterparts.
arXiv Detail & Related papers (2020-05-03T14:28:28Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.