Sequential Sentence Matching Network for Multi-turn Response Selection
in Retrieval-based Chatbots
- URL: http://arxiv.org/abs/2005.07923v1
- Date: Sat, 16 May 2020 09:47:19 GMT
- Title: Sequential Sentence Matching Network for Multi-turn Response Selection
in Retrieval-based Chatbots
- Authors: Chao Xiong, Che Liu, Zijun Xu, Junfeng Jiang, Jieping Ye
- Abstract summary: We propose a matching network, called sequential sentence matching network (S2M), to use the sentence-level semantic information to address the problem.
Firstly, we find that by using the sentence-level semantic information, the network successfully addresses the problem and gets a significant improvement on matching, resulting in a state-of-the-art performance.
- Score: 45.920841134523286
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Recently, open domain multi-turn chatbots have attracted much interest from
lots of researchers in both academia and industry. The dominant retrieval-based
methods use context-response matching mechanisms for multi-turn response
selection. Specifically, the state-of-the-art methods perform the
context-response matching by word or segment similarity. However, these models
lack a full exploitation of the sentence-level semantic information, and make
simple mistakes that humans can easily avoid. In this work, we propose a
matching network, called sequential sentence matching network (S2M), to use the
sentence-level semantic information to address the problem. Firstly and most
importantly, we find that by using the sentence-level semantic information, the
network successfully addresses the problem and gets a significant improvement
on matching, resulting in a state-of-the-art performance. Furthermore, we
integrate the sentence matching we introduced here and the usual word
similarity matching reported in the current literature, to match at different
semantic levels. Experiments on three public data sets show that such
integration further improves the model performance.
Related papers
- Relational Sentence Embedding for Flexible Semantic Matching [86.21393054423355]
We present Sentence Embedding (RSE), a new paradigm to discover further the potential of sentence embeddings.
RSE is effective and flexible in modeling sentence relations and outperforms a series of state-of-the-art embedding methods.
arXiv Detail & Related papers (2022-12-17T05:25:17Z) - Unsupervised Mismatch Localization in Cross-Modal Sequential Data [5.932046800902776]
We develop an unsupervised learning algorithm that can infer the relationship between content-mismatched cross-modal data.
We propose a hierarchical Bayesian deep learning model, named mismatch localization variational autoencoder (ML-VAE), that decomposes the generative process of the speech into hierarchically structured latent variables.
Our experimental results show that ML-VAE successfully locates the mismatch between text and speech, without the need for human annotations.
arXiv Detail & Related papers (2022-05-05T14:23:27Z) - DEIM: An effective deep encoding and interaction model for sentence
matching [0.0]
We propose a sentence matching method based on deep encoding and interaction to extract deep semantic information.
In the encoder layer,we refer to the information of another sentence in the process of encoding a single sentence, and later use a algorithm to fuse the information.
In the interaction layer, we use a bidirectional attention mechanism and a self-attention mechanism to obtain deep semantic information.
arXiv Detail & Related papers (2022-03-20T07:59:42Z) - Domain-level Pairwise Semantic Interaction for Aspect-Based Sentiment
Classification [3.1977819149534987]
We propose a plug-and-play Pairwise Semantic Interaction (PSI) module, which takes pairwise sentences as input.
Different gates are generated to effectively highlight the key semantic features of each sentence.
Finally, the adversarial interaction between the vectors is used to make the semantic representation of two sentences more distinguishable.
arXiv Detail & Related papers (2022-02-21T07:59:17Z) - LadRa-Net: Locally-Aware Dynamic Re-read Attention Net for Sentence
Semantic Matching [66.65398852962177]
We develop a novel Dynamic Re-read Network (DRr-Net) for sentence semantic matching.
We extend DRr-Net to Locally-Aware Dynamic Re-read Attention Net (LadRa-Net)
Experiments on two popular sentence semantic matching tasks demonstrate that DRr-Net can significantly improve the performance of sentence semantic matching.
arXiv Detail & Related papers (2021-08-06T02:07:04Z) - Match-Ignition: Plugging PageRank into Transformer for Long-form Text
Matching [66.71886789848472]
We propose a novel hierarchical noise filtering model, namely Match-Ignition, to tackle the effectiveness and efficiency problem.
The basic idea is to plug the well-known PageRank algorithm into the Transformer, to identify and filter both sentence and word level noisy information.
Noisy sentences are usually easy to detect because the sentence is the basic unit of a long-form text, so we directly use PageRank to filter such information.
arXiv Detail & Related papers (2021-01-16T10:34:03Z) - Narrative Incoherence Detection [76.43894977558811]
We propose the task of narrative incoherence detection as a new arena for inter-sentential semantic understanding.
Given a multi-sentence narrative, decide whether there exist any semantic discrepancies in the narrative flow.
arXiv Detail & Related papers (2020-12-21T07:18:08Z) - Dynamic Semantic Matching and Aggregation Network for Few-shot Intent
Detection [69.2370349274216]
Few-shot Intent Detection is challenging due to the scarcity of available annotated utterances.
Semantic components are distilled from utterances via multi-head self-attention.
Our method provides a comprehensive matching measure to enhance representations of both labeled and unlabeled instances.
arXiv Detail & Related papers (2020-10-06T05:16:38Z) - Combining word embeddings and convolutional neural networks to detect
duplicated questions [0.0]
We propose a simple approach to identifying semantically similar questions by combining the strengths of word embeddings and Convolutional Neural Networks.
Our network is trained on the Quora dataset, which contains over 400k question pairs.
arXiv Detail & Related papers (2020-06-08T12:30:25Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.