Question-type Identification for Academic Questions in Online Learning
Platform
- URL: http://arxiv.org/abs/2211.13727v1
- Date: Thu, 24 Nov 2022 17:28:29 GMT
- Title: Question-type Identification for Academic Questions in Online Learning
Platform
- Authors: Azam Rabiee, Alok Goel, Johnson D'Souza, Saurabh Khanwalkar
- Abstract summary: This paper explores question-type identification as a step in content understanding for an online learning platform.
We have defined twelve question-type classes, including Multiple-Choice Question (MCQ), essay, and others.
We trained a BERT-based ensemble model on this dataset and evaluated this model on a separate human-labeled test set.
- Score: 1.3764085113103222
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Online learning platforms provide learning materials and answers to students'
academic questions by experts, peers, or systems. This paper explores
question-type identification as a step in content understanding for an online
learning platform. The aim of the question-type identifier is to categorize
question types based on their structure and complexity, using the question
text, subject, and structural features. We have defined twelve question-type
classes, including Multiple-Choice Question (MCQ), essay, and others. We have
compiled an internal dataset of students' questions and used a combination of
weak-supervision techniques and manual annotation. We then trained a BERT-based
ensemble model on this dataset and evaluated this model on a separate
human-labeled test set. Our experiments yielded an F1-score of 0.94 for MCQ
binary classification and promising results for 12-class multilabel
classification. We deployed the model in our online learning platform as a
crucial enabler for content understanding to enhance the student learning
experience.
Related papers
- How Do Students Interact with an LLM-powered Virtual Teaching Assistant in Different Educational Settings? [3.9134031118910264]
Jill Watson, a virtual teaching assistant powered by LLMs, answers student questions and engages them in extended conversations on courseware provided by the instructors.
In this paper, we analyze student interactions with Jill across multiple courses and colleges.
We find that, by supporting a wide range of cognitive demands, Jill encourages students to engage in sophisticated, higher-order cognitive questions.
arXiv Detail & Related papers (2024-07-15T01:22:50Z) - SINKT: A Structure-Aware Inductive Knowledge Tracing Model with Large Language Model [64.92472567841105]
Knowledge Tracing (KT) aims to determine whether students will respond correctly to the next question.
Structure-aware Inductive Knowledge Tracing model with large language model (dubbed SINKT)
SINKT predicts the student's response to the target question by interacting with the student's knowledge state and the question representation.
arXiv Detail & Related papers (2024-07-01T12:44:52Z) - DIVKNOWQA: Assessing the Reasoning Ability of LLMs via Open-Domain
Question Answering over Knowledge Base and Text [73.68051228972024]
Large Language Models (LLMs) have exhibited impressive generation capabilities, but they suffer from hallucinations when relying on their internal knowledge.
Retrieval-augmented LLMs have emerged as a potential solution to ground LLMs in external knowledge.
arXiv Detail & Related papers (2023-10-31T04:37:57Z) - A Survey on Continual Semantic Segmentation: Theory, Challenge, Method and Application [7.595948982978669]
Continual learning, also known as incremental learning or life-long learning, stands at the forefront of deep learning and AI systems.
We present a review of Continual semantic segmentation (CSS), of which the dense prediction peculiarity makes it a challenging, intricate and burgeoning task.
We sort out and categorize current CSS models into two main branches including data-replay and data-free sets.
We also introduce four CSS specialities with diverse application scenarios and development tendencies.
arXiv Detail & Related papers (2023-10-22T11:53:56Z) - EduQG: A Multi-format Multiple Choice Dataset for the Educational Domain [20.801638768447948]
This dataset contains 3,397 samples of multiple choice questions, answers (including distractors), and their source documents from the educational domain.
Each question is phrased in two forms, normal and close. Correct answers are linked to source documents with sentence-level annotations.
All questions have been generated by educational experts rather than crowd workers to ensure they are maintaining educational and learning standards.
arXiv Detail & Related papers (2022-10-12T11:28:34Z) - Auxiliary Task Guided Interactive Attention Model for Question
Difficulty Prediction [6.951136079043972]
We propose a multi-task method with an interactive attention mechanism, Qdiff, for jointly predicting Bloom's taxonomy and difficulty levels of academic questions.
The proposed learning method would help learn representations that capture the relationship between Bloom's taxonomy and difficulty labels.
arXiv Detail & Related papers (2022-05-24T19:55:30Z) - Fantastic Questions and Where to Find Them: FairytaleQA -- An Authentic
Dataset for Narrative Comprehension [136.82507046638784]
We introduce FairytaleQA, a dataset focusing on narrative comprehension of kindergarten to eighth-grade students.
FairytaleQA consists of 10,580 explicit and implicit questions derived from 278 children-friendly stories.
arXiv Detail & Related papers (2022-03-26T00:20:05Z) - Quality meets Diversity: A Model-Agnostic Framework for Computerized
Adaptive Testing [60.38182654847399]
Computerized Adaptive Testing (CAT) is emerging as a promising testing application in many scenarios.
We propose a novel framework, namely Model-Agnostic Adaptive Testing (MAAT) for CAT solution.
arXiv Detail & Related papers (2021-01-15T06:48:50Z) - Retrieve, Program, Repeat: Complex Knowledge Base Question Answering via
Alternate Meta-learning [56.771557756836906]
We present a novel method that automatically learns a retrieval model alternately with the programmer from weak supervision.
Our system leads to state-of-the-art performance on a large-scale task for complex question answering over knowledge bases.
arXiv Detail & Related papers (2020-10-29T18:28:16Z) - Inquisitive Question Generation for High Level Text Comprehension [60.21497846332531]
We introduce INQUISITIVE, a dataset of 19K questions that are elicited while a person is reading through a document.
We show that readers engage in a series of pragmatic strategies to seek information.
We evaluate question generation models based on GPT-2 and show that our model is able to generate reasonable questions.
arXiv Detail & Related papers (2020-10-04T19:03:39Z) - Educational Question Mining At Scale: Prediction, Analysis and
Personalization [35.42197158180065]
We propose a framework for mining insights from educational questions at scale.
We utilize the state-of-the-art Bayesian deep learning method, in particular partial variational auto-encoders (p-VAE)
We apply our proposed framework to a real-world dataset with tens of thousands of questions and tens of millions of answers from an online education platform.
arXiv Detail & Related papers (2020-03-12T19:07:49Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.