Fugu-MT 論文翻訳(概要): Large Language Models for Software Testing Education: an Experience Report

論文の概要: Large Language Models for Software Testing Education: an Experience Report

arxiv url: http://arxiv.org/abs/2603.26329v1
Date: Fri, 27 Mar 2026 11:46:53 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-30 21:49:48.47799
Title: Large Language Models for Software Testing Education: an Experience Report
Title（参考訳）: ソフトウェアテスティング教育のための大規模言語モデル:経験報告
Authors: Peng Yang, Yunfeng Zhu, Chao Chang, Shengcheng Yu, Zhenyu Chen, Yong Tang,
Abstract要約: ソフトウェアエンジニアリングプラクティスへのLLM(Large Language Models)の迅速な統合は、ソフトウェアテスト活動の実施方法を変えている。本稿では,ソフトウェアテスティング教育における人間とLLMのコラボレーションに関する2段階の混合手法の探索的研究について述べる。
参考スコア（独自算出の注目度）: 9.310369854735944
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: The rapid integration of Large Language Models (LLMs) into software engineering practice is reshaping how software testing activities are performed. LLMs are increasingly used to support software testing. Consequently, software testing education must evolve to prepare students for this new paradigm. However, while students have already begun to use LLMs in an ad hoc manner for testing tasks, there is limited empirical understanding of how such usage influences their testing behaviors, judgment, and learning outcomes. It is necessary to conduct a systematic investigation into how students learn to evaluate, control, and refine LLM-assisted testing results. This paper presents a mixed-methods, two-phase exploratory study on human-LLM collaboration in software testing education. In Phase I, we analyze classroom learning artifacts and interaction records from 15 students, together with a large-scale survey conducted in a national software testing competition (337 valid responses), to identify recurring prompt-related difficulties across testing tasks. The results reveal systematic interaction breakdowns, including missing contextual information, insufficient constraints, rigid one-shot prompting, and limited strategy-driven iteration, with automated test script generation emerging as a particularly heterogeneous and effort-intensive interaction context. Building on these findings, Phase II conducts an illustrative classroom practice that operationalizes the observed breakdowns into a lightweight, stage-aware prompt scaffold for test script generation, guiding students to explicitly articulate execution-relevant information such as environmental assumptions, interaction grounding, synchronization, and validation intent, and reporting descriptive shifts in students' testing-related articulation when interacting with LLMs.
Abstract（参考訳）: ソフトウェアエンジニアリングプラクティスへのLLM(Large Language Models)の迅速な統合は、ソフトウェアテスト活動の実施方法を変えている。 LLMは、ソフトウェアテストをサポートするためにますます使われています。結果として、この新しいパラダイムのために学生を準備するために、ソフトウェアテスティング教育を進化させなければならない。しかし, 学生はすでに, テスト作業において, LLMを副産物的に使用し始めたが, その使用がテスト行動, 判断, 学習結果にどのような影響を及ぼすか, 経験的理解は限られている。学生がLCM支援試験の結果を評価し, 制御し, 精査する方法について, 体系的な調査を行うことが必要である。本稿では,ソフトウェアテスティング教育における人間とLLMのコラボレーションに関する2段階の混合手法の探索的研究について述べる。第1フェーズでは,15人の生徒の授業学習成果とインタラクション記録を分析し,全国ソフトウェアテストコンペティション(337件の有効回答)で実施した大規模調査と合わせて,テスト課題間で繰り返し発生する即時的困難を同定した。その結果、コンテキスト情報不足、制約不足、厳格なワンショットプロンプト、限定的な戦略駆動イテレーション、特に異種で努力集約的なインタラクションコンテキストとして自動テストスクリプト生成が出現するなど、系統的なインタラクションのブレークダウンが明らかになった。これらの知見に基づいて、第2相は、観察されたブレークスルーを、テストスクリプト生成のための軽量でステージ対応のプロンプト足場に運用し、環境仮定、相互作用基盤、同期、検証意図などの実行関連情報を明示的に記述するよう学生に指示し、LLMと対話する際のテスト関連記述における記述的変化を報告する、図示的な教室の実践を行っている。

論文の概要: Large Language Models for Software Testing Education: an Experience Report

関連論文リスト