Discern: Discourse-Aware Entailment Reasoning Network for Conversational
Machine Reading
- URL: http://arxiv.org/abs/2010.01838v3
- Date: Fri, 16 Oct 2020 10:06:46 GMT
- Title: Discern: Discourse-Aware Entailment Reasoning Network for Conversational
Machine Reading
- Authors: Yifan Gao, Chien-Sheng Wu, Jingjing Li, Shafiq Joty, Steven C.H. Hoi,
Caiming Xiong, Irwin King, Michael R. Lyu
- Abstract summary: Discern is a discourse-aware entailment reasoning network to strengthen the connection and enhance the understanding for both document and dialog.
Our experiments show that Discern achieves state-of-the-art results of 78.3% macro-averaged accuracy on decision making and 64.0 BLEU1 on follow-up question generation.
- Score: 157.14821839576678
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Document interpretation and dialog understanding are the two major challenges
for conversational machine reading. In this work, we propose Discern, a
discourse-aware entailment reasoning network to strengthen the connection and
enhance the understanding for both document and dialog. Specifically, we split
the document into clause-like elementary discourse units (EDU) using a
pre-trained discourse segmentation model, and we train our model in a
weakly-supervised manner to predict whether each EDU is entailed by the user
feedback in a conversation. Based on the learned EDU and entailment
representations, we either reply to the user our final decision
"yes/no/irrelevant" of the initial question, or generate a follow-up question
to inquiry more information. Our experiments on the ShARC benchmark (blind,
held-out test set) show that Discern achieves state-of-the-art results of 78.3%
macro-averaged accuracy on decision making and 64.0 BLEU1 on follow-up question
generation. Code and models are released at
https://github.com/Yifan-Gao/Discern.
Related papers
- Explicit Alignment and Many-to-many Entailment Based Reasoning for
Conversational Machine Reading [8.910847114561191]
Conversational Machine Reading (CMR) requires answering a user's initial question through multi-turn dialogue interactions based on a given document.
Our proposed method achieves state-of-the-art in micro-accuracy and ranks the first place on the public leaderboard of the CMR benchmark dataset ShARC.
arXiv Detail & Related papers (2023-10-20T10:27:24Z) - Disco-Bench: A Discourse-Aware Evaluation Benchmark for Language
Modelling [70.23876429382969]
We propose a benchmark that can evaluate intra-sentence discourse properties across a diverse set of NLP tasks.
Disco-Bench consists of 9 document-level testsets in the literature domain, which contain rich discourse phenomena.
For linguistic analysis, we also design a diagnostic test suite that can examine whether the target models learn discourse knowledge.
arXiv Detail & Related papers (2023-07-16T15:18:25Z) - Pre-training Multi-party Dialogue Models with Latent Discourse Inference [85.9683181507206]
We pre-train a model that understands the discourse structure of multi-party dialogues, namely, to whom each utterance is replying.
To fully utilize the unlabeled data, we propose to treat the discourse structures as latent variables, then jointly infer them and pre-train the discourse-aware model.
arXiv Detail & Related papers (2023-05-24T14:06:27Z) - TANet: Thread-Aware Pretraining for Abstractive Conversational
Summarization [27.185068253347257]
We build a large-scale (11M) pretraining dataset called RCS based on the multi-person discussions in the Reddit community.
We then present TANet, a thread-aware Transformer-based network.
Unlike the existing pre-trained models that treat a conversation as a sequence of sentences, we argue that the inherent contextual dependency plays an essential role in understanding the entire conversation.
arXiv Detail & Related papers (2022-04-09T16:08:46Z) - Open-Retrieval Conversational Machine Reading [80.13988353794586]
In conversational machine reading, systems need to interpret natural language rules, answer high-level questions, and ask follow-up clarification questions.
Existing works assume the rule text is provided for each user question, which neglects the essential retrieval step in real scenarios.
In this work, we propose and investigate an open-retrieval setting of conversational machine reading.
arXiv Detail & Related papers (2021-02-17T08:55:01Z) - Exploiting Unsupervised Data for Emotion Recognition in Conversations [76.01690906995286]
Emotion Recognition in Conversations (ERC) aims to predict the emotional state of speakers in conversations.
The available supervised data for the ERC task is limited.
We propose a novel approach to leverage unsupervised conversation data.
arXiv Detail & Related papers (2020-10-02T13:28:47Z) - Explicit Memory Tracker with Coarse-to-Fine Reasoning for Conversational
Machine Reading [177.50355465392047]
We present a new framework of conversational machine reading that comprises a novel Explicit Memory Tracker (EMT)
Our framework generates clarification questions by adopting a coarse-to-fine reasoning strategy.
EMT achieves new state-of-the-art results of 74.6% micro-averaged decision accuracy and 49.5 BLEU4.
arXiv Detail & Related papers (2020-05-26T02:21:31Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.