Unsupervised Question Duplicate and Related Questions Detection in
e-learning platforms
- URL: http://arxiv.org/abs/2301.05150v1
- Date: Tue, 20 Dec 2022 11:52:52 GMT
- Title: Unsupervised Question Duplicate and Related Questions Detection in
e-learning platforms
- Authors: Maksimjeet Chowdhary, Sanyam Goyal, Venktesh V, Mukesh Mohania and
Vikram Goyal
- Abstract summary: We propose a tool that can surface near-duplicate and semantically related questions without supervised data.
The proposed tool follows an unsupervised hybrid pipeline of statistical and neural approaches.
We demonstrate that QDup can detect near-duplicate questions and also suggest related questions for practice with remarkable accuracy and speed.
- Score: 1.8749305679160364
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Online learning platforms provide diverse questions to gauge the learners'
understanding of different concepts. The repository of questions has to be
constantly updated to ensure a diverse pool of questions to conduct assessments
for learners. However, it is impossible for the academician to manually skim
through the large repository of questions to check for duplicates when
onboarding new questions from external sources. Hence, we propose a tool QDup
in this paper that can surface near-duplicate and semantically related
questions without any supervised data. The proposed tool follows an
unsupervised hybrid pipeline of statistical and neural approaches for
incorporating different nuances in similarity for the task of question
duplicate detection. We demonstrate that QDup can detect near-duplicate
questions and also suggest related questions for practice with remarkable
accuracy and speed from a large repository of questions. The demo video of the
tool can be found at https://www.youtube.com/watch?v=loh0_-7XLW4.
Related papers
- Duplicate Question Retrieval and Confirmation Time Prediction in
Software Communities [4.721583392950402]
Community Question Answering (CQA) in different domains is growing at a large scale because of the availability of several platforms and huge shareable information among users.
With the rapid growth of such online platforms, a massive amount of archived data makes it difficult for moderators to retrieve possible duplicates for a new question.
We propose a Siamese neural network based approach by exploiting both text and network-based features.
arXiv Detail & Related papers (2023-09-10T14:13:54Z) - Answering Ambiguous Questions with a Database of Questions, Answers, and
Revisions [95.92276099234344]
We present a new state-of-the-art for answering ambiguous questions that exploits a database of unambiguous questions generated from Wikipedia.
Our method improves performance by 15% on recall measures and 10% on measures which evaluate disambiguating questions from predicted outputs.
arXiv Detail & Related papers (2023-08-16T20:23:16Z) - Learning with Instance Bundles for Reading Comprehension [61.823444215188296]
We introduce new supervision techniques that compare question-answer scores across multiple related instances.
Specifically, we normalize these scores across various neighborhoods of closely contrasting questions and/or answers.
We empirically demonstrate the effectiveness of training with instance bundles on two datasets.
arXiv Detail & Related papers (2021-04-18T06:17:54Z) - Attention-based model for predicting question relatedness on Stack
Overflow [0.0]
We propose an Attention-based Sentence pair Interaction Model (ASIM) to predict the relatedness between questions on Stack Overflow automatically.
ASIM has made significant improvement over the baseline approaches in Precision, Recall, and Micro-F1 evaluation metrics.
Our model also performs well in the duplicate question detection task of Ask Ubuntu.
arXiv Detail & Related papers (2021-03-19T12:18:03Z) - Knowledge-Routed Visual Question Reasoning: Challenges for Deep
Representation Embedding [140.5911760063681]
We propose a novel dataset named Knowledge-Routed Visual Question Reasoning for VQA model evaluation.
We generate the question-answer pair based on both the Visual Genome scene graph and an external knowledge base with controlled programs.
arXiv Detail & Related papers (2020-12-14T00:33:44Z) - Few-Shot Complex Knowledge Base Question Answering via Meta
Reinforcement Learning [55.08037694027792]
Complex question-answering (CQA) involves answering complex natural-language questions on a knowledge base (KB)
The conventional neural program induction (NPI) approach exhibits uneven performance when the questions have different types.
This paper proposes a meta-reinforcement learning approach to program induction in CQA to tackle the potential distributional bias in questions.
arXiv Detail & Related papers (2020-10-29T18:34:55Z) - Retrieve, Program, Repeat: Complex Knowledge Base Question Answering via
Alternate Meta-learning [56.771557756836906]
We present a novel method that automatically learns a retrieval model alternately with the programmer from weak supervision.
Our system leads to state-of-the-art performance on a large-scale task for complex question answering over knowledge bases.
arXiv Detail & Related papers (2020-10-29T18:28:16Z) - Unsupervised Question Decomposition for Question Answering [102.56966847404287]
We propose an algorithm for One-to-N Unsupervised Sequence Sequence (ONUS) that learns to map one hard, multi-hop question to many simpler, single-hop sub-questions.
We show large QA improvements on HotpotQA over a strong baseline on the original, out-of-domain, and multi-hop dev sets.
arXiv Detail & Related papers (2020-02-22T19:40:35Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.