DBE-KT22: A Knowledge Tracing Dataset Based on Online Student Evaluation
- URL: http://arxiv.org/abs/2208.12651v3
- Date: Sat, 28 Jan 2023 07:56:41 GMT
- Title: DBE-KT22: A Knowledge Tracing Dataset Based on Online Student Evaluation
- Authors: Ghodai Abdelrahman, Sherif Abdelfattah, Qing Wang, Yu Lin
- Abstract summary: We propose a new knowledge tracing dataset named Database Exercises for Knowledge Tracing (DBE-KT22)
It is collected from an online student exercise system in a course taught at the Australian National University in Australia.
- Score: 6.341812549259541
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Online education has gained an increasing importance over the last decade for
providing affordable high-quality education to students worldwide. This has
been further magnified during the global pandemic as more students switched to
study online. The majority of online education tasks, e.g., course
recommendation, exercise recommendation, or automated evaluation, depends on
tracking students' knowledge progress. This is known as the \emph{Knowledge
Tracing} problem in the literature. Addressing this problem requires collecting
student evaluation data that can reflect their knowledge evolution over time.
In this paper, we propose a new knowledge tracing dataset named Database
Exercises for Knowledge Tracing (DBE-KT22) that is collected from an online
student exercise system in a course taught at the Australian National
University in Australia. We discuss the characteristics of the DBE-KT22 dataset
and contrast it with the existing datasets in the knowledge tracing literature.
Our dataset is available for public access through the Australian Data Archive
platform.
Related papers
- Knowledge Tracing Challenge: Optimal Activity Sequencing for Students [0.9814642627359286]
Knowledge tracing is a method used in education to assess and track the acquisition of knowledge by individual learners.
We will present the results of the implementation of two Knowledge Tracing algorithms on a newly released dataset as part of the AAAI2023 Global Knowledge Tracing Challenge.
arXiv Detail & Related papers (2023-11-13T16:28:34Z) - On Responsible Machine Learning Datasets with Fairness, Privacy, and Regulatory Norms [56.119374302685934]
There have been severe concerns over the trustworthiness of AI technologies.
Machine and deep learning algorithms depend heavily on the data used during their development.
We propose a framework to evaluate the datasets through a responsible rubric.
arXiv Detail & Related papers (2023-10-24T14:01:53Z) - Responsible Active Learning via Human-in-the-loop Peer Study [88.01358655203441]
We propose a responsible active learning method, namely Peer Study Learning (PSL), to simultaneously preserve data privacy and improve model stability.
We first introduce a human-in-the-loop teacher-student architecture to isolate unlabelled data from the task learner (teacher) on the cloud-side.
During training, the task learner instructs the light-weight active learner which then provides feedback on the active sampling criterion.
arXiv Detail & Related papers (2022-11-24T13:18:27Z) - Investigating the impact of COVID-19 on Online Learning-based Web
Behavior [0.0]
The study specifically focused on investigating Google Search-based web behavior data as Google is the most popular search engine globally.
The impact of COVID-19 related to online learning-based web behavior on Google was studied for the top 20 worst affected countries in terms of the total number of COVID-19 cases.
arXiv Detail & Related papers (2022-04-27T01:38:10Z) - DCUR: Data Curriculum for Teaching via Samples with Reinforcement
Learning [6.9884912034790405]
We propose a framework, Data CUrriculum for Reinforcement learning (DCUR), which first trains teachers using online deep RL.
Then, students learn by running either offline RL or by using teacher data in combination with a small amount of self-generated data.
arXiv Detail & Related papers (2021-09-15T15:39:46Z) - PEEK: A Large Dataset of Learner Engagement with Educational Videos [20.49299110732228]
We release a large, novel dataset of learners engaging with educational videos in-the-wild.
The dataset, named Personalised Educational Engagement with Knowledge Topics PEEK, is the first publicly available dataset of this nature.
We believe that granular learner engagement signals in unison with rich content representations will pave the way to building powerful personalization algorithms.
arXiv Detail & Related papers (2021-09-03T11:23:02Z) - Online Continual Learning with Natural Distribution Shifts: An Empirical
Study with Visual Data [101.6195176510611]
"Online" continual learning enables evaluating both information retention and online learning efficacy.
In online continual learning, each incoming small batch of data is first used for testing and then added to the training set, making the problem truly online.
We introduce a new benchmark for online continual visual learning that exhibits large scale and natural distribution shifts.
arXiv Detail & Related papers (2021-08-20T06:17:20Z) - Combining Online Learning and Offline Learning for Contextual Bandits
with Deficient Support [53.11601029040302]
Current offline-policy learning algorithms are mostly based on inverse propensity score (IPS) weighting.
We propose a novel approach that uses a hybrid of offline learning with online exploration.
Our approach determines an optimal policy with theoretical guarantees using the minimal number of online explorations.
arXiv Detail & Related papers (2021-07-24T05:07:43Z) - Application of Deep Self-Attention in Knowledge Tracing [2.5852720579998336]
This paper proposed Deep Self-Attentive Knowledge Tracing (DSAKT) based on the data of PTA, an online assessment system used by students in many universities in China.
Experimentation on the data of PTA shows that DSAKT outperforms the other models for knowledge tracing an improvement of 2.1% on average.
arXiv Detail & Related papers (2021-05-17T14:45:38Z) - Peer-inspired Student Performance Prediction in Interactive Online
Question Pools with Graph Neural Network [56.62345811216183]
We propose a novel approach using Graph Neural Networks (GNNs) to achieve better student performance prediction in interactive online question pools.
Specifically, we model the relationship between students and questions using student interactions to construct the student-interaction-question network.
We evaluate the effectiveness of our approach on a real-world dataset consisting of 104,113 mouse trajectories generated in the problem-solving process of over 4000 students on 1631 questions.
arXiv Detail & Related papers (2020-08-04T14:55:32Z) - Drinking from a Firehose: Continual Learning with Web-scale Natural
Language [109.80198763438248]
We study a natural setting for continual learning on a massive scale.
We collect massive datasets of Twitter posts.
We present a rigorous evaluation of continual learning algorithms on an unprecedented scale.
arXiv Detail & Related papers (2020-07-18T05:40:02Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.