Fugu-MT 論文翻訳(概要): Scalable Classification of Course Information Sheets Using Large Language Models: A Reusable Institutional Method for Academic Quality Assurance

論文の概要: Scalable Classification of Course Information Sheets Using Large Language Models: A Reusable Institutional Method for Academic Quality Assurance

arxiv url: http://arxiv.org/abs/2603.13562v1
Date: Fri, 13 Mar 2026 19:59:08 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-17 16:19:35.275052
Title: Scalable Classification of Course Information Sheets Using Large Language Models: A Reusable Institutional Method for Academic Quality Assurance
Title（参考訳）: 大規模言語モデルを用いた授業情報シートのスケーラブルな分類:学術的品質保証のための再利用可能な制度的方法
Authors: Brecht Verbeken, Joke Van den Broeck, Inge De Cleyn, Steven Van Luchene, Nadine Engels, Andres Algaba, Vincent Ginis,
Abstract要約: 高等教育機関は、ジェネレーティブAI(GenAI)統合のためのコース設計を監査する圧力が高まっている。本稿では,大規模言語モデル(LLM)を用いてコース情報シートを大規模にスキャンするエンド・ツー・エンド手法を提案する。
参考スコア（独自算出の注目度）: 3.706350695479005
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: Purpose: Higher education institutions face increasing pressure to audit course designs for generative AI (GenAI) integration. This paper presents an end-to-end method for using large language models (LLMs) to scan course information sheets at scale, identify where assessments may be vulnerable to student use of GenAI tools, validate system performance through iterative refinement, and operationalise results through direct stakeholder communication and effort. Method: We developed a four-phase pipeline: (0) manual pilot sampling, (1) iterative prompt engineering with multi-model comparison, (2) full production scan of 4,684 Bachelor and Master course information sheets (Academic Year 2024-2025) from the Vrije Universiteit Brussel (VUB) with automated report generation and email distribution to teaching teams (91.4% address-matched) using a three-tier risk taxonomy (Clear risk, Potential risk, Low risk), and (3) longitudinal re-scan of 4,675 sheets after the next catalogue release. Results: Five iterations of prompt refinement achieved 87% agreement with expert labels. GPT-4o was selected for production based on superior handling of ambiguous cases involving internships and practical components. The Year 1 scan classified 60.3% of courses as Clear risk, 15.2% as Potential risk, and 24.5% as Low risk. Year 2 comparison revealed substantial shifts in risk distributions, with improvements most pronounced in practice-oriented programmes. Implications: The method enables institutions to rapidly transform heterogeneous catalogue data into structured and actionable intelligence. The approach is transferable to other audit domains (sustainability, accessibility, pedagogical alignment) and provides a template for responsible LLM deployment in higher education governance.
Abstract（参考訳）: 目的:高等教育機関は、ジェネレーティブAI(GenAI)統合のための講座設計の監査を迫られる。本稿では,大規模言語モデル(LLM)を用いて大規模コース情報シートをスキャンし,GenAIツールの学生利用に脆弱な場所を特定し,反復的改善によるシステム性能の検証を行い,直接利害関係者のコミュニケーションや努力を通じて結果を運用する手法を提案する。方法: (0) 手動パイロット・サンプリング, (1) 多モデル比較による反復的プロンプト・エンジニアリング, (2) Vrije Universityversiteit Brussel (VUB) の4,684 Bachelor and Master course information sheets (学術年度2024-2025) のフル生産スキャン, 3層リスク分類法 (Clear risk, potential risk, Low risk) を用いた自動レポート生成とEメール配布 (91.4%アドレスマッチング) による教育チームへの3層リスク分類法 (Clear risk, potential risk, Low risk) , (3) の長期再スキャン (4,675シート) を開発した。結果: プロンプトリファインメントの5回のイテレーションは、専門家のラベルと87%の合意に達した。 GPT-4oは、インターンシップと実用的コンポーネントを含む曖昧なケースの優れたハンドリングに基づいて、生産のために選択された。年1スキャンでは、コースの60.3%が明確なリスク、15.2%が潜在的なリスク、24.5%が低いリスクと分類された。 2年目の比較では、リスク分布が大きく変化し、実践指向のプログラムで最も顕著な改善が見られた。インプリケーション: この手法は、異種カタログデータを構造化され実行可能なインテリジェンスに迅速に変換することを可能にする。このアプローチは、他の監査ドメイン(持続可能性、アクセシビリティ、教育的アライメント)に転送可能であり、高等教育ガバナンスにおけるLCMの展開に責任を負うためのテンプレートを提供する。

論文の概要: Scalable Classification of Course Information Sheets Using Large Language Models: A Reusable Institutional Method for Academic Quality Assurance

関連論文リスト