Automated Scoring of Graphical Open-Ended Responses Using Artificial
Neural Networks
- URL: http://arxiv.org/abs/2201.01783v1
- Date: Wed, 5 Jan 2022 19:00:18 GMT
- Title: Automated Scoring of Graphical Open-Ended Responses Using Artificial
Neural Networks
- Authors: Matthias von Davier, Lillian Tyack, Lale Khorramdel
- Abstract summary: convolutional neural networks (CNNs) outperform feedforward neural networks in both loss and accuracy.
As an additional innovation, we outline a method to select human rated responses for the training sample based on an application of the expected response function.
This paper argues that CNN-based automated scoring of image responses is a highly accurate procedure that could potentially replace the workload and cost of second human raters for large scale assessments.
- Score: 0.0
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Automated scoring of free drawings or images as responses has yet to be
utilized in large-scale assessments of student achievement. In this study, we
propose artificial neural networks to classify these types of graphical
responses from a computer based international mathematics and science
assessment. We are comparing classification accuracy of convolutional and
feedforward approaches. Our results show that convolutional neural networks
(CNNs) outperform feedforward neural networks in both loss and accuracy. The
CNN models classified up to 97.71% of the image responses into the appropriate
scoring category, which is comparable to, if not more accurate, than typical
human raters. These findings were further strengthened by the observation that
the most accurate CNN models correctly classified some image responses that had
been incorrectly scored by the human raters. As an additional innovation, we
outline a method to select human rated responses for the training sample based
on an application of the expected response function derived from item response
theory. This paper argues that CNN-based automated scoring of image responses
is a highly accurate procedure that could potentially replace the workload and
cost of second human raters for large scale assessments, while improving the
validity and comparability of scoring complex constructed-response items.
Related papers
- Graph Neural Networks for Learning Equivariant Representations of Neural Networks [55.04145324152541]
We propose to represent neural networks as computational graphs of parameters.
Our approach enables a single model to encode neural computational graphs with diverse architectures.
We showcase the effectiveness of our method on a wide range of tasks, including classification and editing of implicit neural representations.
arXiv Detail & Related papers (2024-03-18T18:01:01Z) - Adversarial training with informed data selection [53.19381941131439]
Adrial training is the most efficient solution to defend the network against these malicious attacks.
This work proposes a data selection strategy to be applied in the mini-batch training.
The simulation results show that a good compromise can be obtained regarding robustness and standard accuracy.
arXiv Detail & Related papers (2023-01-07T12:09:50Z) - Increasing the Accuracy of a Neural Network Using Frequency Selective
Mesh-to-Grid Resampling [4.211128681972148]
We propose the use of keypoint frequency selective mesh-to-grid resampling (FSMR) for the processing of input data for neural networks.
We show that depending on the network architecture and classification task the application of FSMR during training aids learning process.
The classification accuracy can be increased by up to 4.31 percentage points for ResNet50 and the Oxflower17 dataset.
arXiv Detail & Related papers (2022-09-28T21:34:47Z) - SATBench: Benchmarking the speed-accuracy tradeoff in object recognition
by humans and dynamic neural networks [0.45438205344305216]
People show a flexible tradeoff between speed and accuracy.
We present the first large-scale dataset of the speed-accuracy tradeoff (SAT) in recognizing ImageNet images.
We compare networks with humans on curve-fit error, category-wise correlation, and curve steepness.
arXiv Detail & Related papers (2022-06-16T20:03:31Z) - Lost Vibration Test Data Recovery Using Convolutional Neural Network: A
Case Study [0.0]
This paper proposes a CNN algorithm for the Alamosa Canyon Bridge as a real structure.
Three different CNN models were considered to predict one and two malfunctioned sensors.
The accuracy of the model was increased by adding a convolutional layer.
arXiv Detail & Related papers (2022-04-11T23:24:03Z) - Efficient and Robust Classification for Sparse Attacks [34.48667992227529]
We consider perturbations bounded by the $ell$--norm, which have been shown as effective attacks in the domains of image-recognition, natural language processing, and malware-detection.
We propose a novel defense method that consists of "truncation" and "adrial training"
Motivated by the insights we obtain, we extend these components to neural network classifiers.
arXiv Detail & Related papers (2022-01-23T21:18:17Z) - Image Quality Assessment using Contrastive Learning [50.265638572116984]
We train a deep Convolutional Neural Network (CNN) using a contrastive pairwise objective to solve the auxiliary problem.
We show through extensive experiments that CONTRIQUE achieves competitive performance when compared to state-of-the-art NR image quality models.
Our results suggest that powerful quality representations with perceptual relevance can be obtained without requiring large labeled subjective image quality datasets.
arXiv Detail & Related papers (2021-10-25T21:01:00Z) - Medulloblastoma Tumor Classification using Deep Transfer Learning with
Multi-Scale EfficientNets [63.62764375279861]
We propose an end-to-end MB tumor classification and explore transfer learning with various input sizes and matching network dimensions.
Using a data set with 161 cases, we demonstrate that pre-trained EfficientNets with larger input resolutions lead to significant performance improvements.
arXiv Detail & Related papers (2021-09-10T13:07:11Z) - BreakingBED -- Breaking Binary and Efficient Deep Neural Networks by
Adversarial Attacks [65.2021953284622]
We study robustness of CNNs against white-box and black-box adversarial attacks.
Results are shown for distilled CNNs, agent-based state-of-the-art pruned models, and binarized neural networks.
arXiv Detail & Related papers (2021-03-14T20:43:19Z) - Neural Networks with Recurrent Generative Feedback [61.90658210112138]
We instantiate this design on convolutional neural networks (CNNs)
In the experiments, CNN-F shows considerably improved adversarial robustness over conventional feedforward CNNs on standard benchmarks.
arXiv Detail & Related papers (2020-07-17T19:32:48Z) - A Neuro-AI Interface for Evaluating Generative Adversarial Networks [11.570606002205153]
evaluation metric called Neuroscore, for evaluating the performance of GANs.
Neuroscore has superior performance to the current evaluation metrics in that: (1) It is more consistent with human judgment; (2) The evaluation process needs much smaller numbers of samples; and (3) It is able to rank the quality of images on a per GAN basis.
arXiv Detail & Related papers (2020-03-05T17:53:43Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.