Incorporating Casual Analysis into Diversified and Logical Response
Generation
- URL: http://arxiv.org/abs/2209.09482v1
- Date: Tue, 20 Sep 2022 05:51:11 GMT
- Title: Incorporating Casual Analysis into Diversified and Logical Response
Generation
- Authors: Jiayi Liu, Wei Wei, Zhixuan Chu, Xing Gao, Ji Zhang, Tan Yan and Yulin
Kang
- Abstract summary: Conditional Variational AutoEncoder (CVAE) model can generate more diversified responses than the traditional Seq2Seq model.
We propose to predict the mediators to preserve relevant information and auto-regressively incorporate the mediators into generating process.
- Score: 14.4586344491264
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Although the Conditional Variational AutoEncoder (CVAE) model can generate
more diversified responses than the traditional Seq2Seq model, the responses
often have low relevance with the input words or are illogical with the
question. A causal analysis is carried out to study the reasons behind, and a
methodology of searching for the mediators and mitigating the confounding bias
in dialogues is provided. Specifically, we propose to predict the mediators to
preserve relevant information and auto-regressively incorporate the mediators
into generating process. Besides, a dynamic topic graph guided conditional
variational autoencoder (TGG-CVAE) model is utilized to complement the semantic
space and reduce the confounding bias in responses. Extensive experiments
demonstrate that the proposed model is able to generate both relevant and
informative responses, and outperforms the state-of-the-art in terms of
automatic metrics and human evaluations.
Related papers
- Self-Labeling in Multivariate Causality and Quantification for Adaptive Machine Learning [0.0]
An interactive causality based self-labeling method was proposed to autonomously associate causally related data streams for domain adaptation.
This paper further develops the self-labeling framework and its theoretical foundations to address these research questions.
arXiv Detail & Related papers (2024-04-08T18:16:22Z) - Promoting Open-domain Dialogue Generation through Learning Pattern
Information between Contexts and Responses [5.936682548344234]
This paper improves the quality of generated responses by learning the implicit pattern information between contexts and responses in the training samples.
We also design a response-aware mechanism for mining the implicit pattern information between contexts and responses so that the generated replies are more diverse and approximate to human replies.
arXiv Detail & Related papers (2023-09-06T08:11:39Z) - Less is More: Mitigate Spurious Correlations for Open-Domain Dialogue
Response Generation Models by Causal Discovery [52.95935278819512]
We conduct the first study on spurious correlations for open-domain response generation models based on a corpus CGDIALOG curated in our work.
Inspired by causal discovery algorithms, we propose a novel model-agnostic method for training and inference of response generation model.
arXiv Detail & Related papers (2023-03-02T06:33:48Z) - Are Neural Topic Models Broken? [81.15470302729638]
We study the relationship between automated and human evaluation of topic models.
We find that neural topic models fare worse in both respects compared to an established classical method.
arXiv Detail & Related papers (2022-10-28T14:38:50Z) - ER: Equivariance Regularizer for Knowledge Graph Completion [107.51609402963072]
We propose a new regularizer, namely, Equivariance Regularizer (ER)
ER can enhance the generalization ability of the model by employing the semantic equivariance between the head and tail entities.
The experimental results indicate a clear and substantial improvement over the state-of-the-art relation prediction methods.
arXiv Detail & Related papers (2022-06-24T08:18:05Z) - Using Ontologies for the Formalization and Recognition of Criticality
for Automated Driving [0.0]
Recent advances suggest the ability to leverage relevant knowledge in handling the inherently open and complex context of the traffic world.
This paper demonstrates to be a powerful tool for modeling and formalization of factors associated with criticality in the environment of automated vehicles.
We elaborate on the modular approach, present a publicly available implementation, and evaluate the method by means of a large-scale drone data set of urban traffic scenarios.
arXiv Detail & Related papers (2022-05-03T14:32:11Z) - Is Automated Topic Model Evaluation Broken?: The Incoherence of
Coherence [62.826466543958624]
We look at the standardization gap and the validation gap in topic model evaluation.
Recent models relying on neural components surpass classical topic models according to these metrics.
We use automatic coherence along with the two most widely accepted human judgment tasks, namely, topic rating and word intrusion.
arXiv Detail & Related papers (2021-07-05T17:58:52Z) - Model-Based Counterfactual Synthesizer for Interpretation [40.01787107375103]
We propose a Model-based Counterfactual Synthesizer (MCS) framework for interpreting machine learning models.
We first analyze the model-based counterfactual process and construct a base synthesizer using a conditional generative adversarial net (CGAN)
To better approximate the counterfactual universe for those rare queries, we novelly employ the umbrella sampling technique to conduct the MCS framework training.
arXiv Detail & Related papers (2021-06-16T17:09:57Z) - Local Explanation of Dialogue Response Generation [77.68077106724522]
Local explanation of response generation (LERG) is proposed to gain insights into the reasoning process of a generation model.
LERG views the sequence prediction as uncertainty estimation of a human response and then creates explanations by perturbing the input and calculating the certainty change over the human response.
Our results show that our method consistently improves other widely used methods on proposed automatic- and human- evaluation metrics for this new task by 4.4-12.8%.
arXiv Detail & Related papers (2021-06-11T17:58:36Z) - A Controllable Model of Grounded Response Generation [122.7121624884747]
Current end-to-end neural conversation models inherently lack the flexibility to impose semantic control in the response generation process.
We propose a framework that we call controllable grounded response generation (CGRG)
We show that using this framework, a transformer based model with a novel inductive attention mechanism, trained on a conversation-like Reddit dataset, outperforms strong generation baselines.
arXiv Detail & Related papers (2020-05-01T21:22:08Z) - Hybrid Autoregressive Transducer (hat) [11.70833387055716]
This paper proposes and evaluates the hybrid autoregressive transducer (HAT) model.
It is a time-synchronous encoderdecoder model that preserves the modularity of conventional automatic speech recognition systems.
We evaluate our proposed model on a large-scale voice search task.
arXiv Detail & Related papers (2020-03-12T20:47:06Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.