On the Benefits of Semi-Supervised Test Case Generation for Simulation
Models
- URL: http://arxiv.org/abs/2305.03714v2
- Date: Fri, 1 Dec 2023 17:47:29 GMT
- Title: On the Benefits of Semi-Supervised Test Case Generation for Simulation
Models
- Authors: Xiao Ling, Tim Menzies
- Abstract summary: Testing complex simulation models can be expensive and time consuming.
Current state-of-the-art methods that explore this problem are fully-supervised.
We introduce GenClu, which takes a semi-supervised approach.
- Score: 16.28839850314951
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Testing complex simulation models can be expensive and time consuming.
Current state-of-the-art methods that explore this problem are
fully-supervised; i.e. they require that all examples are labeled. On the other
hand, the GenClu system (introduced in this paper) takes a semi-supervised
approach; i.e. (a) only a small subset of information is actually labeled (via
simulation) and (b) those labels are then spread across the rest of the data.
When applied to five open-source simulation models of cyber-physical systems,
GenClu's test generation can be multiple orders of magnitude faster than the
prior state of the art. Further, when assessed via mutation testing, tests
generated by GenClu were as good or better than anything else tested here.
Hence, we recommend semi-supervised methods over prior methods (evolutionary
search and fully-supervised learning).
Related papers
- Diffusion posterior sampling for simulation-based inference in tall data settings [53.17563688225137]
Simulation-based inference ( SBI) is capable of approximating the posterior distribution that relates input parameters to a given observation.
In this work, we consider a tall data extension in which multiple observations are available to better infer the parameters of the model.
We compare our method to recently proposed competing approaches on various numerical experiments and demonstrate its superiority in terms of numerical stability and computational cost.
arXiv Detail & Related papers (2024-04-11T09:23:36Z) - Fusing Neural and Physical: Augment Protein Conformation Sampling with
Tractable Simulations [27.984190594059868]
generative models have been leveraged as a surrogate sampler to obtain conformation ensembles with orders of magnitude faster.
In this work, we explore the few-shot setting of such pre-trained generative sampler which incorporates MD simulations in a tractable manner.
arXiv Detail & Related papers (2024-02-16T03:48:55Z) - A Comprehensive Survey on Test-Time Adaptation under Distribution Shifts [117.72709110877939]
Test-time adaptation (TTA) has the potential to adapt a pre-trained model to unlabeled data during testing, before making predictions.
We categorize TTA into several distinct groups based on the form of test data, namely, test-time domain adaptation, test-time batch adaptation, and online test-time adaptation.
arXiv Detail & Related papers (2023-03-27T16:32:21Z) - Lamarr: LHCb ultra-fast simulation based on machine learning models deployed within Gauss [0.0]
We discuss Lamarr, a framework to speed-up the simulation production parameterizing both the detector response and the reconstruction algorithms of the LHCb experiment.
Deep Generative Models powered by several algorithms and strategies are employed to effectively parameterize the high-level response of the single components of the LHCb detector.
arXiv Detail & Related papers (2023-03-20T20:18:04Z) - Boosting Low-Data Instance Segmentation by Unsupervised Pre-training
with Saliency Prompt [103.58323875748427]
This work offers a novel unsupervised pre-training solution for low-data regimes.
Inspired by the recent success of the Prompting technique, we introduce a new pre-training method that boosts QEIS models.
Experimental results show that our method significantly boosts several QEIS models on three datasets.
arXiv Detail & Related papers (2023-02-02T15:49:03Z) - Construction of a Surrogate Model: Multivariate Time Series Prediction
with a Hybrid Model [2.198430261120653]
Automotive groups rely on simulators to perform most tests.
The reliability of these simulators for constantly refined tasks is becoming an issue.
To increase the number of tests, the industry is now developing surrogate models.
arXiv Detail & Related papers (2022-12-15T15:52:18Z) - MRCLens: an MRC Dataset Bias Detection Toolkit [82.44296974850639]
We introduce MRCLens, a toolkit that detects whether biases exist before users train the full model.
For the convenience of introducing the toolkit, we also provide a categorization of common biases in MRC.
arXiv Detail & Related papers (2022-07-18T21:05:39Z) - Efficient Learning of Accurate Surrogates for Simulations of Complex Systems [0.0]
We introduce an online learning method empowered by sampling-driven sampling.
It ensures that all turning points on the model response surface are included in the training data.
We apply our method to simulations of nuclear matter to demonstrate that highly accurate surrogates can be reliably auto-generated.
arXiv Detail & Related papers (2022-07-11T20:51:11Z) - TTAPS: Test-Time Adaption by Aligning Prototypes using Self-Supervision [70.05605071885914]
We propose a novel modification of the self-supervised training algorithm SwAV that adds the ability to adapt to single test samples.
We show the success of our method on the common benchmark dataset CIFAR10-C.
arXiv Detail & Related papers (2022-05-18T05:43:06Z) - Machine Learning Testing in an ADAS Case Study Using
Simulation-Integrated Bio-Inspired Search-Based Testing [7.5828169434922]
Deeper generates failure-revealing test scenarios for testing a deep neural network-based lane-keeping system.
In the newly proposed version, we utilize a new set of bio-inspired search algorithms, genetic algorithm (GA), $(mu+lambda)$ and $(mu,lambda)$ evolution strategies (ES), and particle swarm optimization (PSO)
Our evaluation shows the newly proposed test generators in Deeper represent a considerable improvement on the previous version.
arXiv Detail & Related papers (2022-03-22T20:27:40Z) - Evaluating Explainable AI: Which Algorithmic Explanations Help Users
Predict Model Behavior? [97.77183117452235]
We carry out human subject tests to isolate the effect of algorithmic explanations on model interpretability.
Clear evidence of method effectiveness is found in very few cases.
Our results provide the first reliable and comprehensive estimates of how explanations influence simulatability.
arXiv Detail & Related papers (2020-05-04T20:35:17Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.