Unsupervised Dialogue Topic Segmentation with Topic-aware Utterance
Representation
- URL: http://arxiv.org/abs/2305.02747v1
- Date: Thu, 4 May 2023 11:35:23 GMT
- Title: Unsupervised Dialogue Topic Segmentation with Topic-aware Utterance
Representation
- Authors: Haoyu Gao, Rui Wang, Ting-En Lin, Yuchuan Wu, Min Yang, Fei Huang,
Yongbin Li
- Abstract summary: Dialogue Topic (DTS) plays an essential role in a variety of dialogue modeling tasks.
We propose a novel unsupervised DTS framework, which learns topic-aware utterance representations from unlabeled dialogue data.
- Score: 51.22712675266523
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Dialogue Topic Segmentation (DTS) plays an essential role in a variety of
dialogue modeling tasks. Previous DTS methods either focus on semantic
similarity or dialogue coherence to assess topic similarity for unsupervised
dialogue segmentation. However, the topic similarity cannot be fully identified
via semantic similarity or dialogue coherence. In addition, the unlabeled
dialogue data, which contains useful clues of utterance relationships, remains
underexploited. In this paper, we propose a novel unsupervised DTS framework,
which learns topic-aware utterance representations from unlabeled dialogue data
through neighboring utterance matching and pseudo-segmentation. Extensive
experiments on two benchmark datasets (i.e., DialSeg711 and Doc2Dial)
demonstrate that our method significantly outperforms the strong baseline
methods. For reproducibility, we provide our code and data
at:https://github.com/AlibabaResearch/DAMO-ConvAI/tree/main/dial-start.
Related papers
- An Unsupervised Dialogue Topic Segmentation Model Based on Utterance Rewriting [3.5399864027190366]
This study proposes a novel unsupervised dialog topic segmentation method that combines the Utterance Rewriting (UR) technique with an unsupervised learning algorithm.
The proposed Discourse Rewriting Topic Model (UR-DTS) significantly improves the accuracy of topic segmentation.
arXiv Detail & Related papers (2024-09-12T00:27:31Z) - Multi-turn Dialogue Comprehension from a Topic-aware Perspective [70.37126956655985]
This paper proposes to model multi-turn dialogues from a topic-aware perspective.
We use a dialogue segmentation algorithm to split a dialogue passage into topic-concentrated fragments in an unsupervised way.
We also present a novel model, Topic-Aware Dual-Attention Matching (TADAM) Network, which takes topic segments as processing elements.
arXiv Detail & Related papers (2023-09-18T11:03:55Z) - VSTAR: A Video-grounded Dialogue Dataset for Situated Semantic
Understanding with Scene and Topic Transitions [47.94531693056304]
Video-grounded dialogue understanding is a challenging problem that requires machine to perceive, parse and reason over situated semantics.
We present Video-grounded Scene&Topic AwaRe dialogue dataset, a large scale video-grounded dialogue understanding dataset based on 395 TV series.
arXiv Detail & Related papers (2023-05-30T05:40:37Z) - SuperDialseg: A Large-scale Dataset for Supervised Dialogue Segmentation [55.82577086422923]
We provide a feasible definition of dialogue segmentation points with the help of document-grounded dialogues.
We release a large-scale supervised dataset called SuperDialseg, containing 9,478 dialogues.
We also provide a benchmark including 18 models across five categories for the dialogue segmentation task.
arXiv Detail & Related papers (2023-05-15T06:08:01Z) - Graph Based Network with Contextualized Representations of Turns in
Dialogue [0.0]
Dialogue-based relation extraction (RE) aims to extract relation(s) between two arguments that appear in a dialogue.
We propose the TUrn COntext awaRE Graph Convolutional Network (TUCORE-GCN) modeled by paying attention to the way people understand dialogues.
arXiv Detail & Related papers (2021-09-09T03:09:08Z) - Topic-Aware Multi-turn Dialogue Modeling [91.52820664879432]
This paper presents a novel solution for multi-turn dialogue modeling, which segments and extracts topic-aware utterances in an unsupervised way.
Our topic-aware modeling is implemented by a newly proposed unsupervised topic-aware segmentation algorithm and Topic-Aware Dual-attention Matching (TADAM) Network.
arXiv Detail & Related papers (2020-09-26T08:43:06Z) - Dialogue-Based Relation Extraction [53.2896545819799]
We present the first human-annotated dialogue-based relation extraction (RE) dataset DialogRE.
We argue that speaker-related information plays a critical role in the proposed task, based on an analysis of similarities and differences between dialogue-based and traditional RE tasks.
Experimental results demonstrate that a speaker-aware extension on the best-performing model leads to gains in both the standard and conversational evaluation settings.
arXiv Detail & Related papers (2020-04-17T03:51:57Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.