Generative AI in Simulation-Based Test Environments for Large-Scale Cyber-Physical Systems: An Industrial Study
- URL: http://arxiv.org/abs/2512.05507v1
- Date: Fri, 05 Dec 2025 08:09:13 GMT
- Title: Generative AI in Simulation-Based Test Environments for Large-Scale Cyber-Physical Systems: An Industrial Study
- Authors: Masoud Sadrnezhaad, José Antonio Hernández López, Torvald Mårtensson, Daniel Varro,
- Abstract summary: Quality assurance for large-scale cyber-physical systems relies on sophisticated test activities.<n>Recent advances in generative AI have led to tools that can produce executable test cases for software systems.<n>The application of generative AI techniques to simulation-based testing of large-scale cyber-physical systems remains underexplored.
- Score: 2.432409923443071
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Quality assurance for large-scale cyber-physical systems relies on sophisticated test activities using complex test environments investigated with the help of numerous types of simulators. As these systems grow, extensive resources are required to develop and maintain simulation models of hardware and software components, as well as physical environments. Meanwhile, recent advances in generative AI have led to tools that can produce executable test cases for software systems, offering potential benefits such as reducing manual efforts or increasing test coverage. However, the application of generative AI techniques to simulation-based testing of large-scale cyber-physical systems remains underexplored. To better understand this gap, this study captures practitioners' perspectives on leveraging generative AI, based on a cross-company workshop with six organizations. Our contribution is twofold: (1) detailed, experience-based insights into challenges faced by engineers, and (2) a research agenda comprising three high-priority directions: (a) AI-generated scenarios and environment models, (b) simulators and AI in CI/CD pipelines, and (c) trustworthiness in generative AI for simulation. While participants acknowledged substantial potential, they also highlighted unresolved challenges. By detailing these issues, the paper aims to guide future academia-industry collaboration towards the responsible adoption of generative AI in simulation-based testing.
Related papers
- Generative AI in Software Testing: Current Trends and Future Directions [1.0312968200748118]
This paper investigates current software testing systems and explores how artificial intelligence, specifically Generative AI, can be integrated to enhance these systems.<n>It focuses on the potential of Generative AI to transform software testing processes by improving test coverage, increasing efficiency, and reducing costs.
arXiv Detail & Related papers (2026-03-02T18:01:43Z) - Advances and Frontiers of LLM-based Issue Resolution in Software Engineering: A Comprehensive Survey [59.3507264893654]
Issue resolution is a complex Software Engineering task integral to real-world development.<n> benchmarks like SWE-bench revealed this task as profoundly difficult for large language models.<n>This paper presents a systematic survey of this emerging domain.
arXiv Detail & Related papers (2026-01-15T18:55:03Z) - Agentic Pipelines in Embedded Software Engineering: Emerging Practices and Challenges [2.0769172070951067]
A new transformation is underway in software engineering, driven by the rapid adoption of generative AI in development.<n>For embedded software engineering organizations, however, this marks their first experience integrating AI into safety-critical and resource-constrained environments.<n>The strict demands for determinism, reliability, and traceability pose unique challenges for adopting generative technologies.
arXiv Detail & Related papers (2026-01-15T09:30:46Z) - The Software Engineering Simulations Lab: Agentic AI for RE Quality Simulations [0.0]
Quality in Requirements Engineering (RE) is still predominantly anecdotal and intuition-driven.<n>With the advent of AI-based development, the requirements quality factors may change.<n>This paper contributes a first concept, a research roadmap, a prototype, and a first feasibility study for RE simulations with agentic AI.
arXiv Detail & Related papers (2025-11-21T20:19:08Z) - Dyna-Mind: Learning to Simulate from Experience for Better AI Agents [62.21219817256246]
We argue that current AI agents need ''vicarious trial and error'' - the capacity to mentally simulate alternative futures before acting.<n>We introduce Dyna-Mind, a two-stage training framework that explicitly teaches (V)LM agents to integrate such simulation into their reasoning.
arXiv Detail & Related papers (2025-10-10T17:30:18Z) - AI Simulation by Digital Twins: Systematic Survey, Reference Framework, and Mapping to a Standardized Architecture [9.087189607749094]
Insufficient data volume and quality are pressing challenges in the adoption of modern subsymbolic AI.<n>To alleviate these challenges, AI simulation uses virtual training environments in which AI agents can be safely and efficiently developed with simulated, synthetic data.<n>Digital twins open new avenues in AI simulation, as these high-fidelity virtual replicas of physical systems are equipped with state-of-the-art simulators.
arXiv Detail & Related papers (2025-06-06T23:13:38Z) - YuLan-OneSim: Towards the Next Generation of Social Simulator with Large Language Models [50.35333054932747]
We introduce a novel social simulator called YuLan-OneSim.<n>Users can simply describe and refine their simulation scenarios through natural language interactions with our simulator.<n>We implement 50 default simulation scenarios spanning 8 domains, including economics, sociology, politics, psychology, organization, demographics, law, and communication.
arXiv Detail & Related papers (2025-05-12T14:05:17Z) - Edge-Cloud Collaborative Computing on Distributed Intelligence and Model Optimization: A Survey [58.50944604905037]
Edge-cloud collaborative computing (ECCC) has emerged as a pivotal paradigm for addressing the computational demands of modern intelligent applications.<n>Recent advancements in AI, particularly deep learning and large language models (LLMs), have dramatically enhanced the capabilities of these distributed systems.<n>This survey provides a structured tutorial on fundamental architectures, enabling technologies, and emerging applications.
arXiv Detail & Related papers (2025-05-03T13:55:38Z) - EARBench: Towards Evaluating Physical Risk Awareness for Task Planning of Foundation Model-based Embodied AI Agents [53.717918131568936]
Embodied artificial intelligence (EAI) integrates advanced AI models into physical entities for real-world interaction.<n>Foundation models as the "brain" of EAI agents for high-level task planning have shown promising results.<n>However, the deployment of these agents in physical environments presents significant safety challenges.<n>This study introduces EARBench, a novel framework for automated physical risk assessment in EAI scenarios.
arXiv Detail & Related papers (2024-08-08T13:19:37Z) - AI Agents and Education: Simulated Practice at Scale [0.0]
This paper explores the potential of generative AI in creating adaptive educational simulations.
By leveraging a system of multiple AI agents, simulations can provide personalized learning experiences.
We describe a prototype, PitchQuest, a venture capital pitching simulator that showcases the capabilities of AI in delivering instruction.
arXiv Detail & Related papers (2024-06-20T05:26:04Z) - A Roadmap for Simulation-Based Testing of Autonomous Cyber-Physical Systems: Challenges and Future Direction [5.742965094549775]
This paper pioneers a strategic roadmap for simulation-based testing of autonomous systems.
Our paper discusses the relevant challenges and obstacles of ACPSs, focusing on test automation and quality assurance.
arXiv Detail & Related papers (2024-05-02T07:42:33Z) - Software Testing of Generative AI Systems: Challenges and Opportunities [5.634825161148484]
I will explore the challenges posed by generative AI systems and discuss potential opportunities for future research in the field of testing.
I will touch on the specific characteristics of GenAI systems that make traditional testing techniques inadequate or insufficient.
arXiv Detail & Related papers (2023-09-07T08:35:49Z) - RoboTHOR: An Open Simulation-to-Real Embodied AI Platform [56.50243383294621]
We introduce RoboTHOR to democratize research in interactive and embodied visual AI.
We show there exists a significant gap between the performance of models trained in simulation when they are tested in both simulations and their carefully constructed physical analogs.
arXiv Detail & Related papers (2020-04-14T20:52:49Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.