Measuring the Credibility of Student Attendance Data in Higher Education
for Data Mining
- URL: http://arxiv.org/abs/2009.00679v1
- Date: Tue, 1 Sep 2020 20:21:46 GMT
- Title: Measuring the Credibility of Student Attendance Data in Higher Education
for Data Mining
- Authors: Mohammed Alsuwaiket, Christian Dawson, Firat Batmaz
- Abstract summary: Student attendance in higher education has always been dealt with in a classical way.
This study tries to formulate the extracted knowledge in a way that guarantees achieving accurate and credible results.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Educational Data Mining (EDM) is a developing discipline, concerned with
expanding the classical Data Mining (DM) methods and developing new methods for
discovering the data that originate from educational systems. Student
attendance in higher education has always been dealt with in a classical way,
educators rely on counting the occurrence of attendance or absence building
their knowledge about students as well as modules based on this count. This
method is neither credible nor does it necessarily provide a real indication of
a student performance. This study tries to formulate the extracted knowledge in
a way that guarantees achieving accurate and credible results. Student
attendance data, gathered from the educational system, were first cleaned in
order to remove any randomness and noise, then various attributes were studied
so as to highlight the most significant ones that affect the real attendance of
students. The next step was to derive an equation that measures the Student
Attendance Credibility (SAC) considering the attributes chosen in the previous
step. The reliability of the newly developed measure was then evaluated in
order to examine its consistency. Finally, the J48 DM classification technique
was utilized in order to classify modules based on the strength of their SAC
values. Results of this study were promising, and credibility values achieved
using the newly derived formula gave accurate, credible, and real indicators of
student attendance, as well as accurate classification of modules based on the
credibility of student attendance on those modules.
Related papers
- Speculative Knowledge Distillation: Bridging the Teacher-Student Gap Through Interleaved Sampling [81.00825302340984]
We introduce Speculative Knowledge Distillation (SKD) to generate high-quality training data on-the-fly.
In SKD, the student proposes tokens, and the teacher replaces poorly ranked ones based on its own distribution.
We evaluate SKD on various text generation tasks, including translation, summarization, math, and instruction following.
arXiv Detail & Related papers (2024-10-15T06:51:25Z) - A Predictive Model using Machine Learning Algorithm in Identifying
Students Probability on Passing Semestral Course [0.0]
This study employs classification for data mining techniques, and decision tree for algorithm.
With the utilization of the newly discovered predictive model, the prediction of students probabilities to pass the current courses they take gives 0.7619 accuracy, 0.8333 precision, 0.8823 recall, and 0.8571 f1 score.
arXiv Detail & Related papers (2023-04-12T01:57:08Z) - Distantly-Supervised Named Entity Recognition with Adaptive Teacher
Learning and Fine-grained Student Ensemble [56.705249154629264]
Self-training teacher-student frameworks are proposed to improve the robustness of NER models.
In this paper, we propose an adaptive teacher learning comprised of two teacher-student networks.
Fine-grained student ensemble updates each fragment of the teacher model with a temporal moving average of the corresponding fragment of the student, which enhances consistent predictions on each model fragment against noise.
arXiv Detail & Related papers (2022-12-13T12:14:09Z) - Responsible Active Learning via Human-in-the-loop Peer Study [88.01358655203441]
We propose a responsible active learning method, namely Peer Study Learning (PSL), to simultaneously preserve data privacy and improve model stability.
We first introduce a human-in-the-loop teacher-student architecture to isolate unlabelled data from the task learner (teacher) on the cloud-side.
During training, the task learner instructs the light-weight active learner which then provides feedback on the active sampling criterion.
arXiv Detail & Related papers (2022-11-24T13:18:27Z) - Private Semi-supervised Knowledge Transfer for Deep Learning from Noisy
Labels [21.374707481630697]
We propose PATE++, which combines the current advanced noisy label training mechanisms with the original PATE framework to enhance its accuracy.
A novel structure of Generative Adversarial Nets (GANs) is developed in order to integrate them effectively.
In addition, we develop a novel noisy label detection mechanism for semi-supervised model training to further improve student model performance when training with noisy labels.
arXiv Detail & Related papers (2022-11-03T07:33:49Z) - A Machine Learning system to monitor student progress in educational
institutes [0.0]
We propose a data driven approach that makes use of Machine Learning techniques to generate a classifier called credit score.
The proposal to use credit score as progress indicator is well suited to be used in a Learning Management System.
arXiv Detail & Related papers (2022-11-02T08:24:08Z) - Exploring Inconsistent Knowledge Distillation for Object Detection with
Data Augmentation [66.25738680429463]
Knowledge Distillation (KD) for object detection aims to train a compact detector by transferring knowledge from a teacher model.
We propose inconsistent knowledge distillation (IKD) which aims to distill knowledge inherent in the teacher model's counter-intuitive perceptions.
Our method outperforms state-of-the-art KD baselines on one-stage, two-stage and anchor-free object detectors.
arXiv Detail & Related papers (2022-09-20T16:36:28Z) - Better Teacher Better Student: Dynamic Prior Knowledge for Knowledge
Distillation [70.92135839545314]
We propose the dynamic prior knowledge (DPK), which integrates part of teacher's features as the prior knowledge before the feature distillation.
Our DPK makes the performance of the student model positively correlated with that of the teacher model, which means that we can further boost the accuracy of students by applying larger teachers.
arXiv Detail & Related papers (2022-06-13T11:52:13Z) - SLADE: A Self-Training Framework For Distance Metric Learning [75.54078592084217]
We present a self-training framework, SLADE, to improve retrieval performance by leveraging additional unlabeled data.
We first train a teacher model on the labeled data and use it to generate pseudo labels for the unlabeled data.
We then train a student model on both labels and pseudo labels to generate final feature embeddings.
arXiv Detail & Related papers (2020-11-20T08:26:10Z) - Refining Student Marks based on Enrolled Modules Assessment Methods
using Data Mining Techniques [0.0]
We propose a different data preparation process by investigating more than 230000 student records for the preparation of scores.
The effect of Module Assessment Index on the prediction process using Random Forest and Naive Bayes classification techniques were investigated.
arXiv Detail & Related papers (2020-08-30T19:47:45Z) - Formulating Module Assessment for Improved Academic Performance
Predictability in Higher Education [0.0]
This paper proposes a different data preparation process through investigating more than 230000 student records.
The effect of CAR on prediction process using the random forest classification technique has been investigated.
arXiv Detail & Related papers (2020-08-30T19:42:31Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.