A Multi-step and Resilient Predictive Q-learning Algorithm for IoT with
Human Operators in the Loop: A Case Study in Water Supply Networks
- URL: http://arxiv.org/abs/2006.03899v1
- Date: Sat, 6 Jun 2020 15:51:52 GMT
- Title: A Multi-step and Resilient Predictive Q-learning Algorithm for IoT with
Human Operators in the Loop: A Case Study in Water Supply Networks
- Authors: Maria Grammatopoulou, Aris Kanellopoulos, Kyriakos G.~Vamvoudakis,
Nathan Lau
- Abstract summary: We consider the problem of recommending resilient and predictive actions for an IoT network in the presence of faulty components.
We use anonymized data from Arlington County, Virginia, to compute predictive and resilient scheduling policies for a smart water supply system.
- Score: 4.9550706407171585
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We consider the problem of recommending resilient and predictive actions for
an IoT network in the presence of faulty components, considering the presence
of human operators manipulating the information of the environment the agent
sees for containment purposes. The IoT network is formulated as a directed
graph with a known topology whose objective is to maintain a constant and
resilient flow between a source and a destination node. The optimal route
through this network is evaluated via a predictive and resilient Q-learning
algorithm which takes into account historical data about irregular operation,
due to faults, as well as the feedback from the human operators that are
considered to have extra information about the status of the network concerning
locations likely to be targeted by attacks. To showcase our method, we utilize
anonymized data from Arlington County, Virginia, to compute predictive and
resilient scheduling policies for a smart water supply system, while avoiding
(i) all the locations indicated to be attacked according to human operators
(ii) as many as possible neighborhoods detected to have leaks or other faults.
This method incorporates both the adaptability of the human and the computation
capability of the machine to achieve optimal implementation containment and
recovery actions in water distribution.
Related papers
- Decentralized Learning Strategies for Estimation Error Minimization with Graph Neural Networks [94.2860766709971]
We address the challenge of sampling and remote estimation for autoregressive Markovian processes in a wireless network with statistically-identical agents.
Our goal is to minimize time-average estimation error and/or age of information with decentralized scalable sampling and transmission policies.
arXiv Detail & Related papers (2024-04-04T06:24:11Z) - Privacy-Preserving Distributed Learning for Residential Short-Term Load
Forecasting [11.185176107646956]
Power system load data can inadvertently reveal the daily routines of residential users, posing a risk to their property security.
We introduce a Markovian Switching-based distributed training framework, the convergence of which is substantiated through rigorous theoretical analysis.
Case studies employing real-world power system load data validate the efficacy of our proposed algorithm.
arXiv Detail & Related papers (2024-02-02T16:39:08Z) - Efficient Network Representation for GNN-based Intrusion Detection [2.321323878201932]
The last decades have seen a growth in the number of cyber-attacks with severe economic and privacy damages.
We propose a novel network representation as a graph of flows that aims to provide relevant topological information for the intrusion detection task.
We present a Graph Neural Network (GNN) based framework responsible for exploiting the proposed graph structure.
arXiv Detail & Related papers (2023-09-11T16:10:12Z) - Implicit Occupancy Flow Fields for Perception and Prediction in
Self-Driving [68.95178518732965]
A self-driving vehicle (SDV) must be able to perceive its surroundings and predict the future behavior of other traffic participants.
Existing works either perform object detection followed by trajectory of the detected objects, or predict dense occupancy and flow grids for the whole scene.
This motivates our unified approach to perception and future prediction that implicitly represents occupancy and flow over time with a single neural network.
arXiv Detail & Related papers (2023-08-02T23:39:24Z) - Compressed Regression over Adaptive Networks [58.79251288443156]
We derive the performance achievable by a network of distributed agents that solve, adaptively and in the presence of communication constraints, a regression problem.
We devise an optimized allocation strategy where the parameters necessary for the optimization can be learned online by the agents.
arXiv Detail & Related papers (2023-04-07T13:41:08Z) - Adversarial training with informed data selection [53.19381941131439]
Adrial training is the most efficient solution to defend the network against these malicious attacks.
This work proposes a data selection strategy to be applied in the mini-batch training.
The simulation results show that a good compromise can be obtained regarding robustness and standard accuracy.
arXiv Detail & Related papers (2023-01-07T12:09:50Z) - A Reinforcement Learning Approach for Scheduling in mmWave Networks [24.46948464551684]
We consider a source that wishes to communicate with a destination at a desired rate, over a mmWave network.
To achieve resilience to link and node failures, we here explore a state-of-the-art Soft Actor-Critic (SAC) deep reinforcement learning algorithm.
arXiv Detail & Related papers (2021-08-01T21:47:47Z) - Resilient Identification of Distribution Network Topology [0.0]
This paper develops a network TI function that relies only on the measurements available to DERMS.
The propounded method is able to identify the network switching configuration, as well as the status of protective devices.
Having a low computational burden, this approach is fast-track and can be applied in real-time applications.
arXiv Detail & Related papers (2020-11-16T14:23:56Z) - Risk-Averse MPC via Visual-Inertial Input and Recurrent Networks for
Online Collision Avoidance [95.86944752753564]
We propose an online path planning architecture that extends the model predictive control (MPC) formulation to consider future location uncertainties.
Our algorithm combines an object detection pipeline with a recurrent neural network (RNN) which infers the covariance of state estimates.
The robustness of our methods is validated on complex quadruped robot dynamics and can be generally applied to most robotic platforms.
arXiv Detail & Related papers (2020-07-28T07:34:30Z) - ARC-Net: Activity Recognition Through Capsules [0.0]
We introduce ARC-Net and propose the utilization of capsules to fuse the information from multiple inertial measurement units (IMUs) to predict the activity performed by the subject.
We provide heatmaps of the priors, learned by the network, to visualize the utilization of each of the data sources by the trained network.
arXiv Detail & Related papers (2020-07-06T21:03:04Z) - Forgetting Outside the Box: Scrubbing Deep Networks of Information
Accessible from Input-Output Observations [143.3053365553897]
We describe a procedure for removing dependency on a cohort of training data from a trained deep network.
We introduce a new bound on how much information can be extracted per query about the forgotten cohort.
We exploit the connections between the activation and weight dynamics of a DNN inspired by Neural Tangent Kernels to compute the information in the activations.
arXiv Detail & Related papers (2020-03-05T23:17:35Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.