Cognitive Diagnosis with Explicit Student Vector Estimation and
Unsupervised Question Matrix Learning
- URL: http://arxiv.org/abs/2203.03722v1
- Date: Tue, 1 Mar 2022 03:53:19 GMT
- Title: Cognitive Diagnosis with Explicit Student Vector Estimation and
Unsupervised Question Matrix Learning
- Authors: Lu Dong, Zhenhua Ling, Qiang Ling and Zefeng Lai
- Abstract summary: We propose an explicit student vector estimation (ESVE) method to estimate the student vectors of DINA.
We also propose an unsupervised method called bidirectional calibration algorithm (HBCA) to label the Q-matrix automatically.
The experimental results on two real-world datasets show that ESVE-DINA outperforms the DINA model on accuracy and that the Q-matrix labeled automatically by HBCA can achieve performance comparable to that obtained with the manually labeled Q-matrix.
- Score: 53.79108239032941
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Cognitive diagnosis is an essential task in many educational applications.
Many solutions have been designed in the literature. The deterministic input,
noisy "and" gate (DINA) model is a classical cognitive diagnosis model and can
provide interpretable cognitive parameters, e.g., student vectors. However, the
assumption of the probabilistic part of DINA is too strong, because it assumes
that the slip and guess rates of questions are student-independent. Besides,
the question matrix (i.e., Q-matrix) recording the skill distribution of the
questions in the cognitive diagnosis domain often requires precise labels given
by domain experts. Thus, we propose an explicit student vector estimation
(ESVE) method to estimate the student vectors of DINA with a local
self-consistent test, which does not rely on any assumptions for the
probabilistic part of DINA. Then, based on the estimated student vectors, the
probabilistic part of DINA can be modified to a student dependent model that
the slip and guess rates are related to student vectors. Furthermore, we
propose an unsupervised method called heuristic bidirectional calibration
algorithm (HBCA) to label the Q-matrix automatically, which connects the
question difficulty relation and the answer results for initialization and uses
the fault tolerance of ESVE-DINA for calibration. The experimental results on
two real-world datasets show that ESVE-DINA outperforms the DINA model on
accuracy and that the Q-matrix labeled automatically by HBCA can achieve
performance comparable to that obtained with the manually labeled Q-matrix when
using the same model structure.
Related papers
- Uncertainty-preserving deep knowledge tracing with state-space models [1.3791394805787949]
A central goal of knowledge tracing and traditional assessment is to quantify student knowledge and skills at a given point in time.
We introduce Dynamic LENS, a modeling paradigm that combines the flexible uncertainty-preserving properties of variational autoencoders with the principled information integration of Bayesian state-space models.
arXiv Detail & Related papers (2024-07-09T13:40:28Z) - MalAlgoQA: Pedagogical Evaluation of Counterfactual Reasoning in Large Language Models and Implications for AI in Education [2.872215065231376]
This paper introduces MalAlgoQA, a dataset designed to evaluate the counterfactual reasoning capabilities of Large Language Models.
At the heart of MalAlgoQA are malgorithms'' - rationales behind incorrect answer choices that represent flawed yet logically coherent reasoning paths.
arXiv Detail & Related papers (2024-07-01T03:39:13Z) - Querying Easily Flip-flopped Samples for Deep Active Learning [63.62397322172216]
Active learning is a machine learning paradigm that aims to improve the performance of a model by strategically selecting and querying unlabeled data.
One effective selection strategy is to base it on the model's predictive uncertainty, which can be interpreted as a measure of how informative a sample is.
This paper proposes the it least disagree metric (LDM) as the smallest probability of disagreement of the predicted label.
arXiv Detail & Related papers (2024-01-18T08:12:23Z) - Uncertainty-aware Language Modeling for Selective Question Answering [107.47864420630923]
We present an automatic large language model (LLM) conversion approach that produces uncertainty-aware LLMs.
Our approach is model- and data-agnostic, is computationally-efficient, and does not rely on external models or systems.
arXiv Detail & Related papers (2023-11-26T22:47:54Z) - Topological Interpretability for Deep-Learning [0.30806551485143496]
Deep learning (DL) models cannot quantify the certainty of their predictions.
This work presents a method to infer prominent features in two DL classification models trained on clinical and non-clinical text.
arXiv Detail & Related papers (2023-05-15T13:38:13Z) - Attentive Q-Matrix Learning for Knowledge Tracing [4.863310073296471]
We propose Q-matrix-based Attentive Knowledge Tracing (QAKT) as an end-to-end style model.
QAKT is capable of modeling problems hierarchically and learning the q-matrix efficiently based on students' sequences.
Results of further experiments suggest that the q-matrix learned by QAKT is highly model-agnostic and more information-sufficient than the one labeled by human experts.
arXiv Detail & Related papers (2023-04-06T12:31:34Z) - General stochastic separation theorems with optimal bounds [68.8204255655161]
Phenomenon of separability was revealed and used in machine learning to correct errors of Artificial Intelligence (AI) systems and analyze AI instabilities.
Errors or clusters of errors can be separated from the rest of the data.
The ability to correct an AI system also opens up the possibility of an attack on it, and the high dimensionality induces vulnerabilities caused by the same separability.
arXiv Detail & Related papers (2020-10-11T13:12:41Z) - Verification of ML Systems via Reparameterization [6.482926592121413]
We show how a probabilistic program can be automatically represented in a theorem prover.
We also prove that the null model used in a Bayesian hypothesis test satisfies a fairness criterion called demographic parity.
arXiv Detail & Related papers (2020-07-14T02:19:25Z) - Towards Causality-Aware Inferring: A Sequential Discriminative Approach
for Medical Diagnosis [142.90770786804507]
Medical diagnosis assistant (MDA) aims to build an interactive diagnostic agent to sequentially inquire about symptoms for discriminating diseases.
This work attempts to address these critical issues in MDA by taking advantage of the causal diagram.
We propose a propensity-based patient simulator to effectively answer unrecorded inquiry by drawing knowledge from the other records.
arXiv Detail & Related papers (2020-03-14T02:05:54Z) - Progressive Identification of True Labels for Partial-Label Learning [112.94467491335611]
Partial-label learning (PLL) is a typical weakly supervised learning problem, where each training instance is equipped with a set of candidate labels among which only one is the true label.
Most existing methods elaborately designed as constrained optimizations that must be solved in specific manners, making their computational complexity a bottleneck for scaling up to big data.
This paper proposes a novel framework of classifier with flexibility on the model and optimization algorithm.
arXiv Detail & Related papers (2020-02-19T08:35:15Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.