Multi-turn Dialogue Comprehension from a Topic-aware Perspective
- URL: http://arxiv.org/abs/2309.09666v1
- Date: Mon, 18 Sep 2023 11:03:55 GMT
- Title: Multi-turn Dialogue Comprehension from a Topic-aware Perspective
- Authors: Xinbei Ma, Yi Xu, Hai Zhao, Zhuosheng Zhang
- Abstract summary: This paper proposes to model multi-turn dialogues from a topic-aware perspective.
We use a dialogue segmentation algorithm to split a dialogue passage into topic-concentrated fragments in an unsupervised way.
We also present a novel model, Topic-Aware Dual-Attention Matching (TADAM) Network, which takes topic segments as processing elements.
- Score: 70.37126956655985
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Dialogue related Machine Reading Comprehension requires language models to
effectively decouple and model multi-turn dialogue passages. As a dialogue
development goes after the intentions of participants, its topic may not keep
constant through the whole passage. Hence, it is non-trivial to detect and
leverage the topic shift in dialogue modeling. Topic modeling, although has
been widely studied in plain text, deserves far more utilization in dialogue
reading comprehension. This paper proposes to model multi-turn dialogues from a
topic-aware perspective. We start with a dialogue segmentation algorithm to
split a dialogue passage into topic-concentrated fragments in an unsupervised
way. Then we use these fragments as topic-aware language processing units in
further dialogue comprehension. On one hand, the split segments indict specific
topics rather than mixed intentions, thus showing convenient on in-domain topic
detection and location. For this task, we design a clustering system with a
self-training auto-encoder, and we build two constructed datasets for
evaluation. On the other hand, the split segments are an appropriate element of
multi-turn dialogue response selection. For this purpose, we further present a
novel model, Topic-Aware Dual-Attention Matching (TADAM) Network, which takes
topic segments as processing elements and matches response candidates with a
dual cross-attention. Empirical studies on three public benchmarks show great
improvements over baselines. Our work continues the previous studies on
document topic, and brings the dialogue modeling to a novel topic-aware
perspective with exhaustive experiments and analyses.
Related papers
- WavChat: A Survey of Spoken Dialogue Models [66.82775211793547]
Recent advancements in spoken dialogue models, exemplified by systems like GPT-4o, have captured significant attention in the speech domain.
These advanced spoken dialogue models not only comprehend audio, music, and other speech-related features, but also capture stylistic and timbral characteristics in speech.
Despite the progress in spoken dialogue systems, there is a lack of comprehensive surveys that systematically organize and analyze these systems.
arXiv Detail & Related papers (2024-11-15T04:16:45Z) - With a Little Help from my (Linguistic) Friends: Topic Segmentation of
Multi-party Casual Conversations [0.565395466029518]
This paper tries to reach a comparable level of accuracy as recent machine learning based topic segmentation models.
The features we identify as meaningful for this task help us understand better the topical structure of a conversation.
arXiv Detail & Related papers (2024-02-05T09:48:07Z) - Unsupervised Dialogue Topic Segmentation with Topic-aware Utterance
Representation [51.22712675266523]
Dialogue Topic (DTS) plays an essential role in a variety of dialogue modeling tasks.
We propose a novel unsupervised DTS framework, which learns topic-aware utterance representations from unlabeled dialogue data.
arXiv Detail & Related papers (2023-05-04T11:35:23Z) - A Benchmark for Understanding and Generating Dialogue between Characters
in Stories [75.29466820496913]
We present the first study to explore whether machines can understand and generate dialogue in stories.
We propose two new tasks including Masked Dialogue Generation and Dialogue Speaker Recognition.
We show the difficulty of the proposed tasks by testing existing models with automatic and manual evaluation on DialStory.
arXiv Detail & Related papers (2022-09-18T10:19:04Z) - Who says like a style of Vitamin: Towards Syntax-Aware
DialogueSummarization using Multi-task Learning [2.251583286448503]
We focus on the association between utterances from individual speakers and unique syntactic structures.
Speakers have unique textual styles that can contain linguistic information, such as voiceprint.
We employ multi-task learning of both syntax-aware information and dialogue summarization.
arXiv Detail & Related papers (2021-09-29T05:30:39Z) - DialogLM: Pre-trained Model for Long Dialogue Understanding and
Summarization [19.918194137007653]
We present a pre-training framework for long dialogue understanding and summarization.
Considering the nature of long conversations, we propose a window-based denoising approach for generative pre-training.
We conduct extensive experiments on five datasets of long dialogues, covering tasks of dialogue summarization, abstractive question answering and topic segmentation.
arXiv Detail & Related papers (2021-09-06T13:55:03Z) - Multi-View Sequence-to-Sequence Models with Conversational Structure for
Abstractive Dialogue Summarization [72.54873655114844]
Text summarization is one of the most challenging and interesting problems in NLP.
This work proposes a multi-view sequence-to-sequence model by first extracting conversational structures of unstructured daily chats from different views to represent conversations.
Experiments on a large-scale dialogue summarization corpus demonstrated that our methods significantly outperformed previous state-of-the-art models via both automatic evaluations and human judgment.
arXiv Detail & Related papers (2020-10-04T20:12:44Z) - Topic-Aware Multi-turn Dialogue Modeling [91.52820664879432]
This paper presents a novel solution for multi-turn dialogue modeling, which segments and extracts topic-aware utterances in an unsupervised way.
Our topic-aware modeling is implemented by a newly proposed unsupervised topic-aware segmentation algorithm and Topic-Aware Dual-attention Matching (TADAM) Network.
arXiv Detail & Related papers (2020-09-26T08:43:06Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.