Where's the Question? A Multi-channel Deep Convolutional Neural Network
for Question Identification in Textual Data
- URL: http://arxiv.org/abs/2010.07816v1
- Date: Thu, 15 Oct 2020 15:11:22 GMT
- Title: Where's the Question? A Multi-channel Deep Convolutional Neural Network
for Question Identification in Textual Data
- Authors: George Michalopoulos, Helen Chen, Alexander Wong
- Abstract summary: We propose a novel multi-channel deep convolutional neural network architecture, namely Quest-CNN, for the purpose of separating real questions.
We conducted a comprehensive performance comparison analysis of the proposed network against other deep neural networks.
The proposed Quest-CNN achieved the best F1 score both on a dataset of data entry-review dialogue in a dialysis care setting, and on a general domain dataset.
- Score: 83.89578557287658
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In most clinical practice settings, there is no rigorous reviewing of the
clinical documentation, resulting in inaccurate information captured in the
patient medical records. The gold standard in clinical data capturing is
achieved via "expert-review", where clinicians can have a dialogue with a
domain expert (reviewers) and ask them questions about data entry rules.
Automatically identifying "real questions" in these dialogues could uncover
ambiguities or common problems in data capturing in a given clinical setting.
In this study, we proposed a novel multi-channel deep convolutional neural
network architecture, namely Quest-CNN, for the purpose of separating real
questions that expect an answer (information or help) about an issue from
sentences that are not questions, as well as from questions referring to an
issue mentioned in a nearby sentence (e.g., can you clarify this?), which we
will refer as "c-questions". We conducted a comprehensive performance
comparison analysis of the proposed multi-channel deep convolutional neural
network against other deep neural networks. Furthermore, we evaluated the
performance of traditional rule-based and learning-based methods for detecting
question sentences. The proposed Quest-CNN achieved the best F1 score both on a
dataset of data entry-review dialogue in a dialysis care setting, and on a
general domain dataset.
Related papers
- Learning to Ask Like a Physician [24.15961995052862]
We present Discharge Summary Clinical Questions (DiSCQ), a newly curated question dataset composed of 2,000+ questions.
The questions are generated by medical experts from 100+ MIMIC-III discharge summaries.
We analyze this dataset to characterize the types of information sought by medical experts.
arXiv Detail & Related papers (2022-06-06T15:50:54Z) - Semi-self-supervised Automated ICD Coding [2.449909275410288]
Clinical Text Notes (CTNs) contain physicians' reasoning process, written in an unstructured free text format, as they examine and interview patients.
This paper presents a method of augmenting a sparsely annotated dataset of Icelandic CTNs with a machine-learned imputation in a semi-self-supervised manner.
We train a neural network on a small set of annotated CTNs and use it to extract clinical features from a set of un-annotated CTNs.
arXiv Detail & Related papers (2022-05-20T11:12:54Z) - Building and Evaluating Open-Domain Dialogue Corpora with Clarifying
Questions [65.60888490988236]
We release a dataset focused on open-domain single- and multi-turn conversations.
We benchmark several state-of-the-art neural baselines.
We propose a pipeline consisting of offline and online steps for evaluating the quality of clarifying questions in various dialogues.
arXiv Detail & Related papers (2021-09-13T09:16:14Z) - Self-supervised Answer Retrieval on Clinical Notes [68.87777592015402]
We introduce CAPR, a rule-based self-supervision objective for training Transformer language models for domain-specific passage matching.
We apply our objective in four Transformer-based architectures: Contextual Document Vectors, Bi-, Poly- and Cross-encoders.
We report that CAPR outperforms strong baselines in the retrieval of domain-specific passages and effectively generalizes across rule-based and human-labeled passages.
arXiv Detail & Related papers (2021-08-02T10:42:52Z) - Attention-based Aspect Reasoning for Knowledge Base Question Answering
on Clinical Notes [12.831807443341214]
We aim at creating knowledge base from clinical notes to link different patients and clinical notes, and performing knowledge base question answering (KBQA)
Based on the expert annotations in n2c2, we first created the ClinicalKBQA dataset that includes 8,952 QA pairs and covers questions about seven medical topics through 322 question templates.
We propose an attention-based aspect reasoning (AAR) method for KBQA and investigated the impact of different aspects of answers for prediction.
arXiv Detail & Related papers (2021-08-01T17:58:46Z) - Neural Response Interpretation through the Lens of Critical Pathways [52.41018985255681]
We discuss the problem of identifying critical pathways and leverage them for interpreting the network's response to an input.
We demonstrate that sparse pathways derived from pruning do not necessarily encode critical input information.
To ensure sparse pathways include critical fragments of the encoded input information, we propose pathway selection via neurons' contribution to the response.
arXiv Detail & Related papers (2021-03-31T08:08:41Z) - Chinese Medical Question Answer Matching Based on Interactive Sentence
Representation Learning [0.0]
Chinese medical question-answer matching is more challenging than the open-domain question answer matching in English.
In this paper, we design a series of interactive sentence representation learning models to tackle this problem.
Our model significantly outperforms all the existing state-of-the-art models of Chinese medical question answer matching.
arXiv Detail & Related papers (2020-11-27T06:13:56Z) - Domain Generalization for Medical Imaging Classification with
Linear-Dependency Regularization [59.5104563755095]
We introduce a simple but effective approach to improve the generalization capability of deep neural networks in the field of medical imaging classification.
Motivated by the observation that the domain variability of the medical images is to some extent compact, we propose to learn a representative feature space through variational encoding.
arXiv Detail & Related papers (2020-09-27T12:30:30Z) - Better Early than Late: Fusing Topics with Word Embeddings for Neural
Question Paraphrase Identification [24.574227630018758]
We propose two ways of merging topics with word embeddings in a new neural architecture for question paraphrase identification.
Our results show that our system outperforms neural baselines on multiple CQA datasets.
arXiv Detail & Related papers (2020-07-22T10:09:26Z) - Boosting Deep Neural Networks with Geometrical Prior Knowledge: A Survey [77.99182201815763]
Deep Neural Networks (DNNs) achieve state-of-the-art results in many different problem settings.
DNNs are often treated as black box systems, which complicates their evaluation and validation.
One promising field, inspired by the success of convolutional neural networks (CNNs) in computer vision tasks, is to incorporate knowledge about symmetric geometrical transformations.
arXiv Detail & Related papers (2020-06-30T14:56:05Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.