Fugu-MT 論文翻訳(概要): Generating High-Level Test Cases from Requirements using LLM: An Industry Study

論文の概要: Generating High-Level Test Cases from Requirements using LLM: An Industry Study

arxiv url: http://arxiv.org/abs/2510.03641v1
Date: Sat, 04 Oct 2025 03:05:45 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-07 16:52:59.172472
Title: Generating High-Level Test Cases from Requirements using LLM: An Industry Study
Title（参考訳）: LLMを用いた高レベルテストケースの生成:産業研究
Authors: Satoshi Masuda, Satoshi Kouzawa, Kyousuke Sezai, Hidetoshi Suhara, Yasuaki Hiruta, Kunihiro Kudou,
Abstract要約: 現在、要件文書から自然言語で記述された高レベルなテストケースを手動で作成している。大規模言語モデル(LLM)を用いた高レベルテストケースの生成にRAG(Research-augmented Generation)を用いる場合もある。本稿では,RAGを作成することなく,要求文書から高レベル(GHL)テストケースを生成する手法を提案する。
参考スコア（独自算出の注目度）: 0.2257707034197163
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: Currently, generating high-level test cases described in natural language from requirement documents is performed manually. In the industry, including companies specializing in software testing, there is a significant demand for the automatic generation of high-level test cases from requirement documents using Large Language Models (LLMs). Efforts to utilize LLMs for requirement analysis are underway. In some cases, retrieval-augmented generation (RAG) is employed for generating high-level test cases using LLMs. However, in practical applications, it is necessary to create a RAG tailored to the knowledge system of each specific application, which is labor-intensive. Moreover, when applying high-level test case generation as a prompt, there is no established method for instructing the generation of high-level test cases at a level applicable to other specifications without using RAG. It is required to establish a method for the automatic generation of high-level test cases that can be generalized across a wider range of requirement documents. In this paper, we propose a method for generating high-level (GHL) test cases from requirement documents using only prompts, without creating RAGs. In the proposed method, first, the requirement document is input into the LLM to generate test design techniques corresponding to the requirement document. Then, high-level test cases are generated for each of the generated test design techniques. Furthermore, we verify an evaluation method based on semantic similarity of the generated high-level test cases. In the experiments, we confirmed the method using datasets from Bluetooth and Mozilla, where requirement documents and high-level test cases are available, achieving macro-recall measurement of 0.81 and 0.37, respectively. We believe that the method is feasible for practical application in generating high-level test cases without using RAG.
Abstract（参考訳）: 現在、要件文書から自然言語で記述された高レベルなテストケースを手動で作成している。ソフトウェアテストに特化した企業を含む業界では、LLM(Large Language Models)を使用した要求文書から高レベルのテストケースの自動生成が要求されている。 LLMを要件分析に活用するための取り組みが進行中である。 LLMを用いた高レベルテストケースの生成にRAG(Research-augmented Generation)を用いる場合もある。しかし,実践的な応用においては,労働集約的な個々のアプリケーションの知識システムに適したRAGを作成する必要がある。さらに、プロンプトとして高レベルテストケース生成を適用する場合、RAGを使わずに他の仕様に適用可能なレベルで高レベルテストケースの生成を指示する確立した方法が存在しない。幅広い要件文書にまたがって一般化可能な高レベルテストケースの自動生成手法を確立する必要がある。本稿では,RAGを作成することなく,要求文書から高レベル(GHL)テストケースを生成する手法を提案する。提案手法では,まず LLM に要求文書を入力し,要求文書に対応するテスト設計技術を生成する。そして、生成されたテスト設計技術ごとに、ハイレベルなテストケースが生成される。さらに,生成したハイレベルテストケースのセマンティックな類似性に基づく評価手法を検証する。実験では,要求文書と高レベルテストケースが利用可能であるBluetoothとMozillaのデータセットを用いて,それぞれ0.81と0.37のマクロリコール測定を行った。 RAGを使わずに高レベルなテストケースを生成するために,本手法が実用化可能であると信じている。

論文の概要: Generating High-Level Test Cases from Requirements using LLM: An Industry Study

関連論文リスト