Dial2vec: Self-Guided Contrastive Learning of Unsupervised Dialogue
Embeddings
- URL: http://arxiv.org/abs/2210.15332v1
- Date: Thu, 27 Oct 2022 11:14:06 GMT
- Title: Dial2vec: Self-Guided Contrastive Learning of Unsupervised Dialogue
Embeddings
- Authors: Che Liu, Rui Wang, Junfeng Jiang, Yongbin Li, Fei Huang
- Abstract summary: We introduce the task of learning unsupervised dialogue embeddings.
Trivial approaches such as combining pre-trained word or sentence embeddings and encoding through pre-trained language models have been shown to be feasible.
We propose a self-guided contrastive learning approach named dial2vec.
- Score: 41.79937481022846
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In this paper, we introduce the task of learning unsupervised dialogue
embeddings. Trivial approaches such as combining pre-trained word or sentence
embeddings and encoding through pre-trained language models (PLMs) have been
shown to be feasible for this task. However, these approaches typically ignore
the conversational interactions between interlocutors, resulting in poor
performance. To address this issue, we proposed a self-guided contrastive
learning approach named dial2vec. Dial2vec considers a dialogue as an
information exchange process. It captures the conversational interaction
patterns between interlocutors and leverages them to guide the learning of the
embeddings corresponding to each interlocutor. The dialogue embedding is
obtained by an aggregation of the embeddings from all interlocutors. To verify
our approach, we establish a comprehensive benchmark consisting of six
widely-used dialogue datasets. We consider three evaluation tasks: domain
categorization, semantic relatedness, and dialogue retrieval. Dial2vec achieves
on average 8.7, 9.0, and 13.8 points absolute improvements in terms of purity,
Spearman's correlation, and mean average precision (MAP) over the strongest
baseline on the three tasks respectively. Further analysis shows that dial2vec
obtains informative and discriminative embeddings for both interlocutors under
the guidance of the conversational interactions and achieves the best
performance when aggregating them through the interlocutor-level pooling
strategy. All codes and data are publicly available at
https://github.com/AlibabaResearch/DAMO-ConvAI/tree/main/dial2vec.
Related papers
- An Unsupervised Dialogue Topic Segmentation Model Based on Utterance Rewriting [3.5399864027190366]
This study proposes a novel unsupervised dialog topic segmentation method that combines the Utterance Rewriting (UR) technique with an unsupervised learning algorithm.
The proposed Discourse Rewriting Topic Model (UR-DTS) significantly improves the accuracy of topic segmentation.
arXiv Detail & Related papers (2024-09-12T00:27:31Z) - Learning to Memorize Entailment and Discourse Relations for
Persona-Consistent Dialogues [8.652711997920463]
Existing works have improved the performance of dialogue systems by intentionally learning interlocutor personas with sophisticated network structures.
This study proposes a method of learning to memorize entailment and discourse relations for persona-consistent dialogue tasks.
arXiv Detail & Related papers (2023-01-12T08:37:00Z) - CGoDial: A Large-Scale Benchmark for Chinese Goal-oriented Dialog
Evaluation [75.60156479374416]
CGoDial is a new challenging and comprehensive Chinese benchmark for Goal-oriented Dialog evaluation.
It contains 96,763 dialog sessions and 574,949 dialog turns totally, covering three datasets with different knowledge sources.
To bridge the gap between academic benchmarks and spoken dialog scenarios, we either collect data from real conversations or add spoken features to existing datasets via crowd-sourcing.
arXiv Detail & Related papers (2022-11-21T16:21:41Z) - SPACE-2: Tree-Structured Semi-Supervised Contrastive Pre-training for
Task-Oriented Dialog Understanding [68.94808536012371]
We propose a tree-structured pre-trained conversation model, which learns dialog representations from limited labeled dialogs and large-scale unlabeled dialog corpora.
Our method can achieve new state-of-the-art results on the DialoGLUE benchmark consisting of seven datasets and four popular dialog understanding tasks.
arXiv Detail & Related papers (2022-09-14T13:42:50Z) - DialogueCSE: Dialogue-based Contrastive Learning of Sentence Embeddings [33.89889949577356]
We propose DialogueCSE, a dialogue-based contrastive learning approach to tackle this issue.
We evaluate our model on three multi-turn dialogue datasets: the Microsoft Dialogue Corpus, the Jing Dong Dialogue Corpus, and the E-commerce Dialogue Corpus.
arXiv Detail & Related papers (2021-09-26T13:25:41Z) - Dialogue History Matters! Personalized Response Selectionin Multi-turn
Retrieval-based Chatbots [62.295373408415365]
We propose a personalized hybrid matching network (PHMN) for context-response matching.
Our contributions are two-fold: 1) our model extracts personalized wording behaviors from user-specific dialogue history as extra matching information.
We evaluate our model on two large datasets with user identification, i.e., personalized dialogue Corpus Ubuntu (P- Ubuntu) and personalized Weibo dataset (P-Weibo)
arXiv Detail & Related papers (2021-03-17T09:42:11Z) - DCR-Net: A Deep Co-Interactive Relation Network for Joint Dialog Act
Recognition and Sentiment Classification [77.59549450705384]
In dialog system, dialog act recognition and sentiment classification are two correlative tasks.
Most of the existing systems either treat them as separate tasks or just jointly model the two tasks.
We propose a Deep Co-Interactive Relation Network (DCR-Net) to explicitly consider the cross-impact and model the interaction between the two tasks.
arXiv Detail & Related papers (2020-08-16T14:13:32Z) - Rethinking Dialogue State Tracking with Reasoning [76.0991910623001]
This paper proposes to track dialogue states gradually with reasoning over dialogue turns with the help of the back-end data.
Empirical results demonstrate that our method significantly outperforms the state-of-the-art methods by 38.6% in terms of joint belief accuracy for MultiWOZ 2.1.
arXiv Detail & Related papers (2020-05-27T02:05:33Z) - Matching Questions and Answers in Dialogues from Online Forums [12.64602629459043]
Matching question-answer relations between two turns in conversations is not only the first step in analyzing dialogue structures, but also valuable for training dialogue systems.
This paper presents a QA matching model considering both distance information and dialogue history by two simultaneous attention mechanisms called mutual attention.
arXiv Detail & Related papers (2020-05-19T08:18:52Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.