Related papers: Exploring LLMs for Verifying Technical System Specifications Against Requirements

Exploring LLMs for Verifying Technical System Specifications Against Requirements

URL: http://arxiv.org/abs/2411.11582v1
Date: Mon, 18 Nov 2024 13:59:29 GMT
Title: Exploring LLMs for Verifying Technical System Specifications Against Requirements
Authors: Lasse M. Reinpold, Marvin Schieseck, Lukas P. Wagner, Felix Gehlhoff, Alexander Fay,
Abstract summary: The field of knowledge-based requirements engineering (KBRE) aims to support engineers by providing knowledge to assist in the elicitation, validation, and management of system requirements. The advent of large language models (LLMs) opens new opportunities in the field of KBRE. This work experimentally investigates the potential of LLMs in requirements verification.
Score: 41.19948826527649
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: Requirements engineering is a knowledge intensive process and crucial for the success of engineering projects. The field of knowledge-based requirements engineering (KBRE) aims to support engineers by providing knowledge to assist in the elicitation, validation, and management of system requirements. The advent of large language models (LLMs) opens new opportunities in the field of KBRE. This work experimentally investigates the potential of LLMs in requirements verification. Therein, LLMs are provided with a set of requirements and a textual system specification and are prompted to assess which requirements are fulfilled by the system specification. Different experimental variables such as system specification complexity, the number of requirements, and prompting strategies were analyzed. Formal rule-based systems serve as a benchmark to compare LLM performance to. Requirements and system specifications are derived from the smart-grid domain. Results show that advanced LLMs, like GPT-4o and Claude 3.5 Sonnet, achieved f1-scores between 79 % and 94 % in identifying non-fulfilled requirements, indicating potential for LLMs to be leveraged for requirements verification.

Related papers

Exploring the Use of LLMs for Requirements Specification in an IT Consulting Company [0.39563752273706504]
This paper reports our experience using large language models (LLMs) to automate the requirements specification process.<n>We show that LLMs can help automate and standardize the requirements specification, reducing time and human effort.<n>However, the quality of LLM-generated FDS highly depends on inputs and often requires human revision.
arXiv Detail & Related papers (2025-07-25T09:49:37Z)
Specification and Evaluation of Multi-Agent LLM Systems -- Prototype and Cybersecurity Applications [0.0]
LLMs can be used to solve complex tasks by combining reasoning techniques, code generation, and software execution across multiple, potentially specialized LLMs.<n>This paper introduces an agent schema language and the execution and evaluation of the specifications through a multi-agent system architecture and prototype.<n>Test cases involving cybersecurity tasks indicate the feasibility of the architecture and evaluation approach.
arXiv Detail & Related papers (2025-06-12T08:16:17Z)
Learnware of Language Models: Specialized Small Language Models Can Do Big [50.285859986475394]
This paper presents a preliminary attempt to apply the learnware paradigm to language models.<n>We simulated a learnware system comprising approximately 100 learnwares of specialized SLMs with 8B parameters.<n>By selecting one suitable learnware for each task-specific inference, the system outperforms the base SLMs on all benchmarks.
arXiv Detail & Related papers (2025-05-19T17:54:35Z)
Extracting Formal Specifications from Documents Using LLMs for Automated Testing [11.129512305353055]
The main approach to defining formal specifications is through manual analysis of software documents. System update further increases the human labor cost to maintain a corresponding formal specification. Recent advances in Large Language Models have demonstrated promising capabilities in natural language understanding.
arXiv Detail & Related papers (2025-04-02T01:58:11Z)
Analysis of LLMs vs Human Experts in Requirements Engineering [0.0]
Large Language Models (LLM) application to software development has been on the subject of code generation. This study compares LLM's ability to elicit requirements of a software system, as compared to that of a human expert in a time-boxed and prompt-boxed study.
arXiv Detail & Related papers (2025-01-31T16:55:17Z)
Digital requirements engineering with an INCOSE-derived SysML meta-model [0.0]
We extend the Model-Based Structured Requirement SysML Profile to comply with the INCOSE Guide to Writing Requirements. The resulting SysML Profile was applied in two system architecture models at NASA Jet Propulsion Laboratory.
arXiv Detail & Related papers (2024-10-12T03:06:13Z)
Can Long-Context Language Models Subsume Retrieval, RAG, SQL, and More? [54.667202878390526]
Long-context language models (LCLMs) have the potential to revolutionize our approach to tasks traditionally reliant on external tools like retrieval systems or databases. We introduce LOFT, a benchmark of real-world tasks requiring context up to millions of tokens designed to evaluate LCLMs' performance on in-context retrieval and reasoning. Our findings reveal LCLMs' surprising ability to rival state-of-the-art retrieval and RAG systems, despite never having been explicitly trained for these tasks.
arXiv Detail & Related papers (2024-06-19T00:28:58Z)
Efficient Prompting for LLM-based Generative Internet of Things [88.84327500311464]
Large language models (LLMs) have demonstrated remarkable capacities on various tasks, and integrating the capacities of LLMs into the Internet of Things (IoT) applications has drawn much research attention recently. Due to security concerns, many institutions avoid accessing state-of-the-art commercial LLM services, requiring the deployment and utilization of open-source LLMs in a local network setting. We propose a LLM-based Generative IoT (GIoT) system deployed in the local network setting in this study.
arXiv Detail & Related papers (2024-06-14T19:24:00Z)
Requirements are All You Need: From Requirements to Code with LLMs [0.0]
Large language models (LLMs) can be applied to software engineering tasks. This paper introduces a tailored LLM for automating the generation of code snippets from well-structured requirements documents. We demonstrate the LLM's proficiency in comprehending intricate user requirements and producing robust design and code solutions.
arXiv Detail & Related papers (2024-06-14T14:57:35Z)
Using LLMs in Software Requirements Specifications: An Empirical Evaluation [0.2812395851874055]
We assess the performance of GPT-4 and CodeLlama in drafting an Software Requirements Specification. Our results suggest that LLMs can match the output quality of an entry-level software engineer to generate an SRS. We conclude that the LLMs can be gainfully used by software engineers to increase productivity.
arXiv Detail & Related papers (2024-04-27T09:37:00Z)
Tapping the Potential of Large Language Models as Recommender Systems: A Comprehensive Framework and Empirical Analysis [91.5632751731927]
Large Language Models such as ChatGPT have showcased remarkable abilities in solving general tasks. We propose a general framework for utilizing LLMs in recommendation tasks, focusing on the capabilities of LLMs as recommenders. We analyze the impact of public availability, tuning strategies, model architecture, parameter scale, and context length on recommendation results.
arXiv Detail & Related papers (2024-01-10T08:28:56Z)
When does In-context Learning Fall Short and Why? A Study on Specification-Heavy Tasks [54.71034943526973]
In-context learning (ICL) has become the default method for using large language models (LLMs) We find that ICL falls short of handling specification-heavy tasks, which are tasks with complicated and extensive task specifications. We identify three primary reasons: inability to specifically understand context, misalignment in task schema comprehension with humans, and inadequate long-text understanding ability.
arXiv Detail & Related papers (2023-11-15T14:26:30Z)
Identifying Concerns When Specifying Machine Learning-Enabled Systems: A Perspective-Based Approach [1.2184324428571227]
PerSpecML is a perspective-based approach for specifying ML-enabled systems. It helps practitioners identify which attributes, including ML and non-ML components, are important to contribute to the overall system's quality.
arXiv Detail & Related papers (2023-09-14T18:31:16Z)
How Can Recommender Systems Benefit from Large Language Models: A Survey [82.06729592294322]
Large language models (LLM) have shown impressive general intelligence and human-like capabilities. We conduct a comprehensive survey on this research direction from the perspective of the whole pipeline in real-world recommender systems.
arXiv Detail & Related papers (2023-06-09T11:31:50Z)
Augmented Large Language Models with Parametric Knowledge Guiding [72.71468058502228]
Large Language Models (LLMs) have significantly advanced natural language processing (NLP) with their impressive language understanding and generation capabilities. Their performance may be suboptimal for domain-specific tasks that require specialized knowledge due to limited exposure to the related data. We propose the novel Parametric Knowledge Guiding (PKG) framework, which equips LLMs with a knowledge-guiding module to access relevant knowledge.
arXiv Detail & Related papers (2023-05-08T15:05:16Z)

This list is automatically generated from the titles and abstracts of the papers in this site.