Anomaly Detection Under Controlled Sensing Using Actor-Critic
Reinforcement Learning
- URL: http://arxiv.org/abs/2006.01044v1
- Date: Tue, 26 May 2020 22:53:17 GMT
- Title: Anomaly Detection Under Controlled Sensing Using Actor-Critic
Reinforcement Learning
- Authors: Geethu Joseph, M. Cenk Gursoy, Pramod K. Varshney
- Abstract summary: We consider the problem of detecting anomalies among a given set of processes using noisy binary sensor measurements.
The noiseless sensor measurement corresponding to a normal process is 0, and the measurement is 1 if the process is anomalous.
We propose a sequential sensor selection policy that dynamically determines which processes to observe at each time and when to terminate the detection algorithm.
- Score: 31.841289319809807
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We consider the problem of detecting anomalies among a given set of processes
using their noisy binary sensor measurements. The noiseless sensor measurement
corresponding to a normal process is 0, and the measurement is 1 if the process
is anomalous. The decision-making algorithm is assumed to have no knowledge of
the number of anomalous processes. The algorithm is allowed to choose a subset
of the sensors at each time instant until the confidence level on the decision
exceeds the desired value. Our objective is to design a sequential sensor
selection policy that dynamically determines which processes to observe at each
time and when to terminate the detection algorithm. The selection policy is
designed such that the anomalous processes are detected with the desired
confidence level while incurring minimum cost which comprises the delay in
detection and the cost of sensing. We cast this problem as a sequential
hypothesis testing problem within the framework of Markov decision processes,
and solve it using the actor-critic deep reinforcement learning algorithm. This
deep neural network-based algorithm offers a low complexity solution with good
detection accuracy. We also study the effect of statistical dependence between
the processes on the algorithm performance. Through numerical experiments, we
show that our algorithm is able to adapt to any unknown statistical dependence
pattern of the processes.
Related papers
- Bagged Regularized $k$-Distances for Anomaly Detection [9.899763598214122]
We propose a new distance-based algorithm called bagged regularized $k$-distances for anomaly detection (BRDAD)
Our BRDAD algorithm selects the weights by minimizing the surrogate risk, i.e., the finite sample bound of the empirical risk of the bagged weighted $k$-distances for density estimation (BWDDE)
On the theoretical side, we establish fast convergence rates of the AUC regret of our algorithm and demonstrate that the bagging technique significantly reduces the computational complexity.
arXiv Detail & Related papers (2023-12-02T07:00:46Z) - Anomaly Detection via Learning-Based Sequential Controlled Sensing [25.282033825977827]
We address the problem of detecting anomalies among a set of binary processes via learning-based controlled sensing.
To identify the anomalies, the decision-making agent is allowed to observe a subset of the processes at each time instant.
Our objective is to design a sequential selection policy that dynamically determines which processes to observe at each time.
arXiv Detail & Related papers (2023-11-30T07:49:33Z) - Scalable and Decentralized Algorithms for Anomaly Detection via
Learning-Based Controlled Sensing [40.14838268469627]
We develop an anomaly detection algorithm that chooses the processes to be observed at a given time instant.
The objective of the detection algorithm is to identify the anomalies with an accuracy exceeding the desired value.
arXiv Detail & Related papers (2021-12-08T11:20:36Z) - Machine Learning for Online Algorithm Selection under Censored Feedback [71.6879432974126]
In online algorithm selection (OAS), instances of an algorithmic problem class are presented to an agent one after another, and the agent has to quickly select a presumably best algorithm from a fixed set of candidate algorithms.
For decision problems such as satisfiability (SAT), quality typically refers to the algorithm's runtime.
In this work, we revisit multi-armed bandit algorithms for OAS and discuss their capability of dealing with the problem.
We adapt them towards runtime-oriented losses, allowing for partially censored data while keeping a space- and time-complexity independent of the time horizon.
arXiv Detail & Related papers (2021-09-13T18:10:52Z) - Anomaly Detection via Controlled Sensing and Deep Active Inference [43.07302992747749]
In this paper, we address the anomaly detection problem where the objective is to find the anomalous processes among a given set of processes.
We develop a sequential selection algorithm that decides which processes to be probed at every instant to detect the anomalies.
Our algorithm is based on active inference which is a general framework to make sequential decisions in order to maximize the notion of free energy.
arXiv Detail & Related papers (2021-05-12T17:54:02Z) - A Scalable Algorithm for Anomaly Detection via Learning-Based Controlled
Sensing [37.78306297797]
We develop an anomaly detection algorithm that chooses the process to be observed at a given time instant.
The objective of the detection algorithm is to arrive at a decision with an accuracy exceeding a desired value.
Unlike prior work on this topic that has exponential complexity in the number of processes, our algorithm has computational and memory requirements that are both in the number of processes.
arXiv Detail & Related papers (2021-05-12T17:46:01Z) - Optimal Sequential Detection of Signals with Unknown Appearance and
Disappearance Points in Time [64.26593350748401]
The paper addresses a sequential changepoint detection problem, assuming that the duration of change may be finite and unknown.
We focus on a reliable maximin change detection criterion of maximizing the minimal probability of detection in a given time (or space) window.
The FMA algorithm is applied to detecting faint streaks of satellites in optical images.
arXiv Detail & Related papers (2021-02-02T04:58:57Z) - Learned Block Iterative Shrinkage Thresholding Algorithm for
Photothermal Super Resolution Imaging [52.42007686600479]
We propose a learned block-sparse optimization approach using an iterative algorithm unfolded into a deep neural network.
We show the benefits of using a learned block iterative shrinkage thresholding algorithm that is able to learn the choice of regularization parameters.
arXiv Detail & Related papers (2020-12-07T09:27:16Z) - Run2Survive: A Decision-theoretic Approach to Algorithm Selection based
on Survival Analysis [75.64261155172856]
survival analysis (SA) naturally supports censored data and offers appropriate ways to use such data for learning distributional models of algorithm runtime.
We leverage such models as a basis of a sophisticated decision-theoretic approach to algorithm selection, which we dub Run2Survive.
In an extensive experimental study with the standard benchmark ASlib, our approach is shown to be highly competitive and in many cases even superior to state-of-the-art AS approaches.
arXiv Detail & Related papers (2020-07-06T15:20:17Z) - Active Model Estimation in Markov Decision Processes [108.46146218973189]
We study the problem of efficient exploration in order to learn an accurate model of an environment, modeled as a Markov decision process (MDP)
We show that our Markov-based algorithm outperforms both our original algorithm and the maximum entropy algorithm in the small sample regime.
arXiv Detail & Related papers (2020-03-06T16:17:24Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.