Automatic Classification of Pedagogical Materials against CS Curriculum Guidelines
- URL: http://arxiv.org/abs/2602.03962v1
- Date: Tue, 03 Feb 2026 19:24:18 GMT
- Title: Automatic Classification of Pedagogical Materials against CS Curriculum Guidelines
- Authors: Erik Saule, Kalpathi Subramanian, Razvan Bunescu,
- Abstract summary: Professional societies often publish curriculum guidelines to help programs align their content to international standards.<n>In Computer Science, the primary standard is published by ACM and IEEE and provide detailed guidelines for what should be and could be included in a Computer Science program.<n>It is difficult for program administrators to assess how much of the guidelines is being covered by a CS program.<n>We propose using Natural Language Processing techniques to accelerate the process.
- Score: 0.0
- License: http://creativecommons.org/licenses/by-sa/4.0/
- Abstract: Professional societies often publish curriculum guidelines to help programs align their content to international standards. In Computer Science, the primary standard is published by ACM and IEEE and provide detailed guidelines for what should be and could be included in a Computer Science program. While very helpful, it remains difficult for program administrators to assess how much of the guidelines is being covered by a CS program. This is in particular due to the extensiveness of the guidelines, containing thousands of individual items. As such, it is time consuming and cognitively demanding to audit every course to confidently mark everything that is actually being covered. Our preliminary work indicated that it takes about a day of work per course. In this work, we propose using Natural Language Processing techniques to accelerate the process. We explore two kinds of techniques, the first relying on traditional tools for parsing, tagging, and embeddings, while the second leverages the power of Large Language Models. We evaluate the application of these techniques to classify a corpus of pedagogical materials and show that we can meaningfully classify documents automatically.
Related papers
- No More Manual Guides: Automatic and Scalable Generation of High-Quality Excel Tutorials [63.10037761131196]
Existing tutorials are manually authored by experts, require frequent updates after each software release, and incur substantial labor costs.<n>We present the first framework for automatically generating Excel tutorials directly from natural language task descriptions.<n>Our framework improves task execution success rates by 8.5% over state-of-the-art baselines.
arXiv Detail & Related papers (2025-09-26T03:21:39Z) - Designing a Syllabus for a Course on Empirical Software Engineering [2.518416353853374]
This chapter attempts to support educators in the first and most crucial step in their course design: creating the syllabus.<n>It offers a list of the fundamental building blocks for a syllabus, namely course aims, course topics, and practical assignments.<n>The course topics are also linked to the subsequent chapters of this book, so that readers can dig deeper into those chapters and get support on teaching specific research methods or cross-cutting topics.
arXiv Detail & Related papers (2025-03-14T10:58:29Z) - Integrating Human-Centric Approaches into Undergraduate Software Engineering Education: A Scoping Review and Curriculum Analysis in the Australian Context [0.0]
Human-Centric Software Engineering refers to the software engineering processes that put human needs and requirements as core practice.
A large majority of software projects fail to cater to human needs and consequently run into budget, delivery, and usability issues.
This paper presents a scoping review to identify the topics and curriculum approaches suitable for teaching HCSE to undergraduate software engineering students.
arXiv Detail & Related papers (2024-07-10T02:34:58Z) - Teaching and Learning Ethnography for Software Engineering Contexts [1.0992151305603264]
This chapter provides an introduction to teaching and learning ethnography for faculty teaching ethnography to software engineering graduate students.
The contents of the chapter focus on what we think is the core basic knowledge for newbies to ethnography as a research method.
The chapter is designed to support part of a course on empirical software engineering and provides pointers and literature for further reading.
arXiv Detail & Related papers (2024-07-05T15:43:02Z) - Synthetic Data (Almost) from Scratch: Generalized Instruction Tuning for
Language Models [153.14575887549088]
We introduce Generalized Instruction Tuning (called GLAN), a general and scalable method for instruction tuning of Large Language Models (LLMs)
GLAN exclusively utilizes a pre-curated taxonomy of human knowledge and capabilities as input and generates large-scale synthetic instruction data across all disciplines.
With the fine-grained key concepts detailed in every class session of the syllabus, we are able to generate diverse instructions with a broad coverage across the entire spectrum of human knowledge and skills.
arXiv Detail & Related papers (2024-02-20T15:00:35Z) - Teacher Perception of Automatically Extracted Grammar Concepts for L2
Language Learning [66.79173000135717]
We apply this work to teaching two Indian languages, Kannada and Marathi, which do not have well-developed resources for second language learning.
We extract descriptions from a natural text corpus that answer questions about morphosyntax (learning of word order, agreement, case marking, or word formation) and semantics (learning of vocabulary).
We enlist the help of language educators from schools in North America to perform a manual evaluation, who find the materials have potential to be used for their lesson preparation and learner evaluation.
arXiv Detail & Related papers (2023-10-27T18:17:29Z) - Teacher Perception of Automatically Extracted Grammar Concepts for L2
Language Learning [91.49622922938681]
We present an automatic framework that automatically discovers and visualizing descriptions of different aspects of grammar.
Specifically, we extract descriptions from a natural text corpus that answer questions about morphosyntax and semantics.
We apply this method for teaching the Indian languages, Kannada and Marathi, which, unlike English, do not have well-developed pedagogical resources.
arXiv Detail & Related papers (2022-06-10T14:52:22Z) - Building an Effective Automated Assessment System for C/C++ Introductory
Programming Courses in ODL Environment [0.0]
Traditional ways of assessing students' work are becoming insufficient in terms of both time and effort.
In distance education environment, such assessments become additionally more challenging in terms of hefty remuneration for hiring large number of tutors.
We identify different components that we believe are necessary in building an effective automated assessment system.
arXiv Detail & Related papers (2022-05-24T09:20:43Z) - Global Contentious Politics Database (GLOCON) Annotation Manuals [0.0]
The GLOCON Gold Standard Corpus (GSC) contains news articles from multiple sources from each focus country.
The articles in the GSC were manually coded by skilled annotators in both classification and extraction tasks.
This document lays out the rules according to which annotators code the news articles.
arXiv Detail & Related papers (2022-05-17T13:16:50Z) - CLICKER: A Computational LInguistics Classification Scheme for
Educational Resources [47.48935730905393]
A classification scheme of a scientific subject gives an overview of its body of knowledge.
A comprehensive classification system like CCS or Mathematics Subject Classification (MSC) does not exist for Computational Linguistics (CL) and Natural Language Processing (NLP)
We propose a classification scheme -- CLICKER for CL/NLP based on the analysis of online lectures from 77 university courses on this subject.
arXiv Detail & Related papers (2021-12-16T02:40:43Z) - ProtoTransformer: A Meta-Learning Approach to Providing Student Feedback [54.142719510638614]
In this paper, we frame the problem of providing feedback as few-shot classification.
A meta-learner adapts to give feedback to student code on a new programming question from just a few examples by instructors.
Our approach was successfully deployed to deliver feedback to 16,000 student exam-solutions in a programming course offered by a tier 1 university.
arXiv Detail & Related papers (2021-07-23T22:41:28Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.