Robustifying Reinforcement Learning Agents via Action Space Adversarial
  Training
        - URL: http://arxiv.org/abs/2007.07176v1
- Date: Tue, 14 Jul 2020 16:50:02 GMT
- Title: Robustifying Reinforcement Learning Agents via Action Space Adversarial
  Training
- Authors: Kai Liang Tan, Yasaman Esfandiari, Xian Yeow Lee, Aakanksha, Soumik
  Sarkar
- Abstract summary: Adoption of machine learning (ML)-enabled cyber-physical systems (CPS) are becoming prevalent in various sectors of modern society.
Recent studies in deep reinforcement learning (DRL) have demonstrated its benefits in a large variety of data-driven decisions and control applications.
We show that a well-performing DRL agent that is initially susceptible to action space perturbations can be robustified against similar perturbations through adversarial training.
- Score: 23.284452331353894
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract:   Adoption of machine learning (ML)-enabled cyber-physical systems (CPS) are
becoming prevalent in various sectors of modern society such as transportation,
industrial, and power grids. Recent studies in deep reinforcement learning
(DRL) have demonstrated its benefits in a large variety of data-driven
decisions and control applications. As reliance on ML-enabled systems grows, it
is imperative to study the performance of these systems under malicious state
and actuator attacks. Traditional control systems employ
resilient/fault-tolerant controllers that counter these attacks by correcting
the system via error observations. However, in some applications, a resilient
controller may not be sufficient to avoid a catastrophic failure. Ideally, a
robust approach is more useful in these scenarios where a system is inherently
robust (by design) to adversarial attacks. While robust control has a long
history of development, robust ML is an emerging research area that has already
demonstrated its relevance and urgency. However, the majority of robust ML
research has focused on perception tasks and not on decision and control tasks,
although the ML (specifically RL) models used for control applications are
equally vulnerable to adversarial attacks. In this paper, we show that a
well-performing DRL agent that is initially susceptible to action space
perturbations (e.g. actuator attacks) can be robustified against similar
perturbations through adversarial training.
 
      
        Related papers
        - Beyond Prompt Engineering: Robust Behavior Control in LLMs via Steering   Target Atoms [71.85633762642125]
 The vast number of parameters in models often results in highly intertwined internal representations.<n>Recent research has explored the use of sparse autoencoders (SAE) to disentangle knowledge in high-dimensional spaces for steering.<n>We propose Steering Target Atoms (STA), a novel method that isolates and manipulates disentangled knowledge components to enhance safety.
 arXiv  Detail & Related papers  (2025-05-23T17:59:18Z)
- A Novel Bifurcation Method for Observation Perturbation Attacks on   Reinforcement Learning Agents: Load Altering Attacks on a Cyber Physical   Power System [1.7887848708497243]
 This work proposes a novel attack technique for continuous control using Group Difference Logits loss with a bifurcation layer.
We demonstrate the impacts of powerful gradient-based attacks in a realistic smart energy environment.
 arXiv  Detail & Related papers  (2024-07-06T20:55:24Z)
- Rethinking Robustness Assessment: Adversarial Attacks on Learning-based   Quadrupedal Locomotion Controllers [33.50779001548997]
 Legged locomotion has recently achieved remarkable success with the progress of machine learning techniques.
We propose a computational method that leverages sequential adversarial attacks to identify weaknesses in learned locomotion controllers.
Our research demonstrates that, even state-of-the-art robust controllers can fail significantly under well-designed, low-magnitude adversarial sequence.
 arXiv  Detail & Related papers  (2024-05-21T00:26:11Z)
- Analyzing Adversarial Inputs in Deep Reinforcement Learning [53.3760591018817]
 We present a comprehensive analysis of the characterization of adversarial inputs, through the lens of formal verification.
We introduce a novel metric, the Adversarial Rate, to classify models based on their susceptibility to such perturbations.
Our analysis empirically demonstrates how adversarial inputs can affect the safety of a given DRL system with respect to such perturbations.
 arXiv  Detail & Related papers  (2024-02-07T21:58:40Z)
- Vulnerability of Machine Learning Approaches Applied in IoT-based Smart   Grid: A Review [51.31851488650698]
 Machine learning (ML) sees an increasing prevalence of being used in the internet-of-things (IoT)-based smart grid.
 adversarial distortion injected into the power signal will greatly affect the system's normal control and operation.
It is imperative to conduct vulnerability assessment for MLsgAPPs applied in the context of safety-critical power systems.
 arXiv  Detail & Related papers  (2023-08-30T03:29:26Z)
- Improving Robustness of Reinforcement Learning for Power System Control
  with Adversarial Training [71.7750435554693]
 We show that several state-of-the-art RL agents proposed for power system control are vulnerable to adversarial attacks.
Specifically, we use an adversary Markov Decision Process to learn an attack policy, and demonstrate the potency of our attack.
We propose to use adversarial training to increase the robustness of RL agent against attacks and avoid infeasible operational decisions.
 arXiv  Detail & Related papers  (2021-10-18T00:50:34Z)
- A Practical Adversarial Attack on Contingency Detection of Smart Energy
  Systems [0.0]
 We propose an innovative adversarial attack model that can practically compromise dynamical controls of energy system.
We also optimize the deployment of the proposed adversarial attack model by employing deep reinforcement learning (RL) techniques.
 arXiv  Detail & Related papers  (2021-09-13T23:11:56Z)
- An RL-Based Adaptive Detection Strategy to Secure Cyber-Physical Systems [0.0]
 Increased dependence on software based control has escalated the vulnerabilities of Cyber Physical Systems.
We propose a Reinforcement Learning (RL) based framework which adaptively sets the parameters of such detectors based on experience learned from attack scenarios.
 arXiv  Detail & Related papers  (2021-03-04T07:38:50Z)
- Query-based Targeted Action-Space Adversarial Policies on Deep
  Reinforcement Learning Agents [23.580682320064714]
 This work investigates targeted attacks in the action-space domain, also commonly known as actuation attacks in CPS literature.
We show that a query-based black-box attack model that generates optimal perturbations with respect to an adversarial goal can be formulated as another reinforcement learning problem.
 Experimental results showed that adversarial policies that only observe the nominal policy's output generate stronger attacks than adversarial policies that observe the nominal policy's input and output.
 arXiv  Detail & Related papers  (2020-11-13T20:25:48Z)
- Robust Deep Reinforcement Learning against Adversarial Perturbations on
  State Observations [88.94162416324505]
 A deep reinforcement learning (DRL) agent observes its states through observations, which may contain natural measurement errors or adversarial noises.
Since the observations deviate from the true states, they can mislead the agent into making suboptimal actions.
We show that naively applying existing techniques on improving robustness for classification tasks, like adversarial training, is ineffective for many RL tasks.
 arXiv  Detail & Related papers  (2020-03-19T17:59:59Z)
- Enhanced Adversarial Strategically-Timed Attacks against Deep
  Reinforcement Learning [91.13113161754022]
 We introduce timing-based adversarial strategies against a DRL-based navigation system by jamming in physical noise patterns on the selected time frames.
Our experimental results show that the adversarial timing attacks can lead to a significant performance drop.
 arXiv  Detail & Related papers  (2020-02-20T21:39:25Z)
- Challenges and Countermeasures for Adversarial Attacks on Deep
  Reinforcement Learning [48.49658986576776]
 Deep Reinforcement Learning (DRL) has numerous applications in the real world thanks to its outstanding ability in adapting to the surrounding environments.
Despite its great advantages, DRL is susceptible to adversarial attacks, which precludes its use in real-life critical systems and applications.
This paper presents emerging attacks in DRL-based systems and the potential countermeasures to defend against these attacks.
 arXiv  Detail & Related papers  (2020-01-27T10:53:11Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
       
     
           This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.