LANA: Towards Personalized Deep Knowledge Tracing Through
Distinguishable Interactive Sequences
- URL: http://arxiv.org/abs/2105.06266v1
- Date: Wed, 21 Apr 2021 02:57:42 GMT
- Title: LANA: Towards Personalized Deep Knowledge Tracing Through
Distinguishable Interactive Sequences
- Authors: Yuhao Zhou, Xihua Li, Yunbo Cao, Xuemin Zhao, Qing Ye and Jiancheng Lv
- Abstract summary: We propose Leveled Attentive KNowledge TrAcing (LANA) to predict students' responses to future questions.
It uses a novel student-related features extractor (SRFE) to distill students' unique inherent properties from their respective interactive sequences.
With pivot module reconstructed the decoder for individual students and leveled learning specialized encoders for groups, personalized DKT was achieved.
- Score: 21.67751919579854
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In educational applications, Knowledge Tracing (KT), the problem of
accurately predicting students' responses to future questions by summarizing
their knowledge states, has been widely studied for decades as it is considered
a fundamental task towards adaptive online learning. Among all the proposed KT
methods, Deep Knowledge Tracing (DKT) and its variants are by far the most
effective ones due to the high flexibility of the neural network. However, DKT
often ignores the inherent differences between students (e.g. memory skills,
reasoning skills, ...), averaging the performances of all students, leading to
the lack of personalization, and therefore was considered insufficient for
adaptive learning. To alleviate this problem, in this paper, we proposed
Leveled Attentive KNowledge TrAcing (LANA), which firstly uses a novel
student-related features extractor (SRFE) to distill students' unique inherent
properties from their respective interactive sequences. Secondly, the pivot
module was utilized to dynamically reconstruct the decoder of the neural
network on attention of the extracted features, successfully distinguishing the
performance between students over time. Moreover, inspired by Item Response
Theory (IRT), the interpretable Rasch model was used to cluster students by
their ability levels, and thereby utilizing leveled learning to assign
different encoders to different groups of students. With pivot module
reconstructed the decoder for individual students and leveled learning
specialized encoders for groups, personalized DKT was achieved. Extensive
experiments conducted on two real-world large-scale datasets demonstrated that
our proposed LANA improves the AUC score by at least 1.00% (i.e. EdNet 1.46%
and RAIEd2020 1.00%), substantially surpassing the other State-Of-The-Art KT
methods.
Related papers
- Ensemble Learning via Knowledge Transfer for CTR Prediction [9.891226177252653]
In this paper, we investigate larger ensemble networks and find three inherent limitations in commonly used ensemble learning method.
We propose a novel model-agnostic Ensemble Knowledge Transfer Framework (EKTF)
Experimental results on five real-world datasets demonstrate the effectiveness and compatibility of EKTF.
arXiv Detail & Related papers (2024-11-25T06:14:20Z) - Automated Knowledge Concept Annotation and Question Representation Learning for Knowledge Tracing [59.480951050911436]
We present KCQRL, a framework for automated knowledge concept annotation and question representation learning.
We demonstrate the effectiveness of KCQRL across 15 KT algorithms on two large real-world Math learning datasets.
arXiv Detail & Related papers (2024-10-02T16:37:19Z) - Topological Persistence Guided Knowledge Distillation for Wearable Sensor Data [15.326571438985466]
topological features obtained by topological data analysis (TDA) have been suggested as a potential solution.
There are two significant obstacles to using topological features in deep learning.
We propose to use two teacher networks, one trained on the raw time-series data, and another trained on persistence images generated by TDA methods.
A robust student model is distilled, which uses only the time-series data as an input, while implicitly preserving topological features.
arXiv Detail & Related papers (2024-07-07T10:08:34Z) - BAL: Balancing Diversity and Novelty for Active Learning [53.289700543331925]
We introduce a novel framework, Balancing Active Learning (BAL), which constructs adaptive sub-pools to balance diverse and uncertain data.
Our approach outperforms all established active learning methods on widely recognized benchmarks by 1.20%.
arXiv Detail & Related papers (2023-12-26T08:14:46Z) - DKT-STDRL: Spatial and Temporal Representation Learning Enhanced Deep
Knowledge Tracing for Learning Performance Prediction [11.75131482747055]
The DKT-STDRL model uses CNN to extract the spatial feature information of students' exercise sequences.
The BiLSTM part extracts the temporal features from the joint learning features to obtain the prediction information of whether the students answer correctly at the next time step.
Experiments on the public education datasets ASSISTment2009, ASSISTment2015, Synthetic-5, ASSISTchall, and Statics2011 prove that DKT-STDRL can achieve better prediction effects than DKT and CKT.
arXiv Detail & Related papers (2023-02-15T09:23:21Z) - HiTSKT: A Hierarchical Transformer Model for Session-Aware Knowledge
Tracing [35.02243127325724]
Knowledge tracing (KT) aims to leverage students' learning histories to estimate their mastery levels on a set of pre-defined skills, based on which the corresponding future performance can be accurately predicted.
In practice, a student's learning history comprises answers to sets of massed questions, each known as a session, rather than merely being a sequence of independent answers.
Most existing KT models treat student's learning records as a single continuing sequence, without capturing the sessional shift of students' knowledge state.
arXiv Detail & Related papers (2022-12-23T04:22:42Z) - Distantly-Supervised Named Entity Recognition with Adaptive Teacher
Learning and Fine-grained Student Ensemble [56.705249154629264]
Self-training teacher-student frameworks are proposed to improve the robustness of NER models.
In this paper, we propose an adaptive teacher learning comprised of two teacher-student networks.
Fine-grained student ensemble updates each fragment of the teacher model with a temporal moving average of the corresponding fragment of the student, which enhances consistent predictions on each model fragment against noise.
arXiv Detail & Related papers (2022-12-13T12:14:09Z) - Responsible Active Learning via Human-in-the-loop Peer Study [88.01358655203441]
We propose a responsible active learning method, namely Peer Study Learning (PSL), to simultaneously preserve data privacy and improve model stability.
We first introduce a human-in-the-loop teacher-student architecture to isolate unlabelled data from the task learner (teacher) on the cloud-side.
During training, the task learner instructs the light-weight active learner which then provides feedback on the active sampling criterion.
arXiv Detail & Related papers (2022-11-24T13:18:27Z) - Context-Aware Attentive Knowledge Tracing [21.397976659857793]
We propose attentive knowledge tracing, which couples flexible attention-based neural network models with a series of novel, interpretable model components.
AKT uses a novel monotonic attention mechanism that relates a learner's future responses to assessment questions to their past responses.
We show that AKT outperforms existing KT methods (by up to $6%$ in AUC in some cases) on predicting future learner responses.
arXiv Detail & Related papers (2020-07-24T02:45:43Z) - Mining Implicit Entity Preference from User-Item Interaction Data for
Knowledge Graph Completion via Adversarial Learning [82.46332224556257]
We propose a novel adversarial learning approach by leveraging user interaction data for the Knowledge Graph Completion task.
Our generator is isolated from user interaction data, and serves to improve the performance of the discriminator.
To discover implicit entity preference of users, we design an elaborate collaborative learning algorithms based on graph neural networks.
arXiv Detail & Related papers (2020-03-28T05:47:33Z) - Efficient Crowd Counting via Structured Knowledge Transfer [122.30417437707759]
Crowd counting is an application-oriented task and its inference efficiency is crucial for real-world applications.
We propose a novel Structured Knowledge Transfer framework to generate a lightweight but still highly effective student network.
Our models obtain at least 6.5$times$ speed-up on an Nvidia 1080 GPU and even achieve state-of-the-art performance.
arXiv Detail & Related papers (2020-03-23T08:05:41Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.