A Micro Architectural Events Aware Real-Time Embedded System Fault Injector
- URL: http://arxiv.org/abs/2401.08397v2
- Date: Tue, 11 Jun 2024 08:44:00 GMT
- Title: A Micro Architectural Events Aware Real-Time Embedded System Fault Injector
- Authors: Enrico Magliano, Alessio Carpegna, Alessadro Savino, Stefano Di Carlo,
- Abstract summary: This paper introduces a novel fault injector designed to facilitate the monitoring, aggregation, and examination of micro-architectural events.
The methodology targets bit-flipping within the memory system, affecting CPU registers and RAM.
The outcomes of these fault injections enable a thorough analysis of the impact of soft errors and establish a robust correlation between the identified faults and the essential timing predictability demanded by SACRES.
- Score: 0.12187048691454236
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: In contemporary times, the increasing complexity of the system poses significant challenges to the reliability, trustworthiness, and security of the SACRES. Key issues include the susceptibility to phenomena such as instantaneous voltage spikes, electromagnetic interference, neutron strikes, and out-of-range temperatures. These factors can induce switch state changes in transistors, resulting in bit-flipping, soft errors, and transient corruption of stored data in memory. The occurrence of soft errors, in turn, may lead to system faults that can propel the system into a hazardous state. Particularly in critical sectors like automotive, avionics, or aerospace, such malfunctions can have real-world implications, potentially causing harm to individuals. This paper introduces a novel fault injector designed to facilitate the monitoring, aggregation, and examination of micro-architectural events. This is achieved by harnessing the microprocessor's PMU and the debugging interface, specifically focusing on ensuring the repeatability of fault injections. The fault injection methodology targets bit-flipping within the memory system, affecting CPU registers and RAM. The outcomes of these fault injections enable a thorough analysis of the impact of soft errors and establish a robust correlation between the identified faults and the essential timing predictability demanded by SACRES.
Related papers
- Tolerance of Reinforcement Learning Controllers against Deviations in Cyber Physical Systems [8.869030580266799]
We introduce a new, expressive notion of tolerance that describes how well a controller is capable of satisfying a desired system requirement.
We propose a novel analysis problem, called the tolerance falsification problem, which involves finding small deviations that result in a violation of the given requirement.
We present a novel, two-layer simulation-based analysis framework and a novel search for finding small tolerance violations.
arXiv Detail & Related papers (2024-06-24T18:33:45Z) - Analyzing Adversarial Inputs in Deep Reinforcement Learning [53.3760591018817]
We present a comprehensive analysis of the characterization of adversarial inputs, through the lens of formal verification.
We introduce a novel metric, the Adversarial Rate, to classify models based on their susceptibility to such perturbations.
Our analysis empirically demonstrates how adversarial inputs can affect the safety of a given DRL system with respect to such perturbations.
arXiv Detail & Related papers (2024-02-07T21:58:40Z) - Causal Disentanglement Hidden Markov Model for Fault Diagnosis [55.90917958154425]
We propose a Causal Disentanglement Hidden Markov model (CDHM) to learn the causality in the bearing fault mechanism.
Specifically, we make full use of the time-series data and progressively disentangle the vibration signal into fault-relevant and fault-irrelevant factors.
To expand the scope of the application, we adopt unsupervised domain adaptation to transfer the learned disentangled representations to other working environments.
arXiv Detail & Related papers (2023-08-06T05:58:45Z) - Forecasting Particle Accelerator Interruptions Using Logistic LASSO
Regression [62.997667081978825]
Unforeseen particle accelerator interruptions, also known as interlocks, lead to abrupt operational changes despite being necessary safety measures.
We propose a simple yet powerful binary classification model aiming to forecast such interruptions.
The model is formulated as logistic regression penalized by at least absolute shrinkage and selection operator.
arXiv Detail & Related papers (2023-03-15T23:11:30Z) - Deep Reinforcement Learning for Online Error Detection in Cyber-Physical
Systems [1.2074552857379273]
This paper proposes a new error detection approach based on Deep Reinforcement Learning (DRL)
The proposed approach can categorize different types of errors from normal data and predict whether the system will fail.
The evaluation results illustrate that the proposed approach has improved more than 2x in terms of accuracy and more than 5x in terms of inference time compared to other approaches.
arXiv Detail & Related papers (2023-02-03T06:28:54Z) - A Robust and Explainable Data-Driven Anomaly Detection Approach For
Power Electronics [56.86150790999639]
We present two anomaly detection and classification approaches, namely the Matrix Profile algorithm and anomaly transformer.
The Matrix Profile algorithm is shown to be well suited as a generalizable approach for detecting real-time anomalies in streaming time-series data.
A series of custom filters is created and added to the detector to tune its sensitivity, recall, and detection accuracy.
arXiv Detail & Related papers (2022-09-23T06:09:35Z) - Ranking-Based Physics-Informed Line Failure Detection in Power Grids [66.0797334582536]
Real-time and accurate detecting of potential line failures is the first step to mitigating the extreme weather impact and activating emergency controls.
Power balance equations nonlinearity, increased uncertainty in generation during extreme events, and lack of grid observability compromise the efficiency of traditional data-driven failure detection methods.
This paper proposes a Physics-InformEd Line failure Detector (FIELD) that leverages grid topology information to reduce sample and time complexities and improve localization accuracy.
arXiv Detail & Related papers (2022-08-31T18:19:25Z) - Fast and Accurate Error Simulation for CNNs against Soft Errors [64.54260986994163]
We present a framework for the reliability analysis of Conal Neural Networks (CNNs) via an error simulation engine.
These error models are defined based on the corruption patterns of the output of the CNN operators induced by faults.
We show that our methodology achieves about 99% accuracy of the fault effects w.r.t. SASSIFI, and a speedup ranging from 44x up to 63x w.r.t.FI, that only implements a limited set of error models.
arXiv Detail & Related papers (2022-06-04T19:45:02Z) - Monitoring and Diagnosability of Perception Systems [21.25149064251918]
We propose a mathematical model for runtime monitoring and fault detection and identification in perception systems.
We demonstrate our monitoring system, dubbed PerSyS, in realistic simulations using the LGSVL self-driving simulator and the Apollo Auto autonomy software stack.
arXiv Detail & Related papers (2020-11-11T23:03:14Z) - A Machine Learning Approach to Online Fault Classification in HPC
Systems [4.642153471124352]
We propose a fault classification method for HPC systems based on machine learning.
The novelty of our approach rests with the fact that it can be operated on streamed data in an online manner.
We introduce a high-level, easy-to-use fault injection tool called FINJ, with a focus on the management of complex experiments.
arXiv Detail & Related papers (2020-07-27T15:36:56Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.