PAMMELA: Policy Administration Methodology using Machine Learning
- URL: http://arxiv.org/abs/2111.07060v1
- Date: Sat, 13 Nov 2021 07:05:22 GMT
- Title: PAMMELA: Policy Administration Methodology using Machine Learning
- Authors: Varun Gumma, Barsha Mitra, Soumyadeep Dey, Pratik Shashikantbhai
Patel, Sourabh Suman, Saptarshi Das
- Abstract summary: PAMMELA is a policy administration methodology using Machine Learning.
It generates a new policy by learning the rules of a policy currently enforced in a similar organization.
For policy augmentation, PAMMELA can infer new rules based on the knowledge gathered from the existing rules.
- Score: 1.1744028458220428
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: In recent years, Attribute-Based Access Control (ABAC) has become quite
popular and effective for enforcing access control in dynamic and collaborative
environments. Implementation of ABAC requires the creation of a set of
attribute-based rules which cumulatively form a policy. Designing an ABAC
policy ab initio demands a substantial amount of effort from the system
administrator. Moreover, organizational changes may necessitate the inclusion
of new rules in an already deployed policy. In such a case, re-mining the
entire ABAC policy will require a considerable amount of time and
administrative effort. Instead, it is better to incrementally augment the
policy. Keeping these aspects of reducing administrative overhead in mind, in
this paper, we propose PAMMELA, a Policy Administration Methodology using
Machine Learning to help system administrators in creating new ABAC policies as
well as augmenting existing ones. PAMMELA can generate a new policy for an
organization by learning the rules of a policy currently enforced in a similar
organization. For policy augmentation, PAMMELA can infer new rules based on the
knowledge gathered from the existing rules. Experimental results show that our
proposed approach provides a reasonably good performance in terms of the
various machine learning evaluation metrics as well as execution time.
Related papers
- Quantifying Policy Administration Cost in an Active Learning Framework [4.106460421493345]
This paper proposes a computational model for policy administration.
A well-designed access control model must anticipate such changes so that the administration cost does not become prohibitive when the organization scales up.
arXiv Detail & Related papers (2023-12-29T22:12:53Z) - IOB: Integrating Optimization Transfer and Behavior Transfer for
Multi-Policy Reuse [50.90781542323258]
Reinforcement learning (RL) agents can transfer knowledge from source policies to a related target task.
Previous methods introduce additional components, such as hierarchical policies or estimations of source policies' value functions.
We propose a novel transfer RL method that selects the source policy without training extra components.
arXiv Detail & Related papers (2023-08-14T09:22:35Z) - Diverse Policy Optimization for Structured Action Space [59.361076277997704]
We propose Diverse Policy Optimization (DPO) to model the policies in structured action space as the energy-based models (EBM)
A novel and powerful generative model, GFlowNet, is introduced as the efficient, diverse EBM-based policy sampler.
Experiments on ATSC and Battle benchmarks demonstrate that DPO can efficiently discover surprisingly diverse policies.
arXiv Detail & Related papers (2023-02-23T10:48:09Z) - Constructing a Good Behavior Basis for Transfer using Generalized Policy
Updates [63.58053355357644]
We study the problem of learning a good set of policies, so that when combined together, they can solve a wide variety of unseen reinforcement learning tasks.
We show theoretically that having access to a specific set of diverse policies, which we call a set of independent policies, can allow for instantaneously achieving high-level performance.
arXiv Detail & Related papers (2021-12-30T12:20:46Z) - Cautious Policy Programming: Exploiting KL Regularization in Monotonic
Policy Improvement for Reinforcement Learning [11.82492300303637]
We propose a novel value-based reinforcement learning (RL) algorithm that can ensure monotonic policy improvement during learning.
We demonstrate that the proposed algorithm can trade o? performance and stability in both didactic classic control problems and challenging high-dimensional Atari games.
arXiv Detail & Related papers (2021-07-13T01:03:10Z) - Goal-Conditioned Reinforcement Learning with Imagined Subgoals [89.67840168694259]
We propose to incorporate imagined subgoals into policy learning to facilitate learning of complex tasks.
Imagined subgoals are predicted by a separate high-level policy, which is trained simultaneously with the policy and its critic.
We evaluate our approach on complex robotic navigation and manipulation tasks and show that it outperforms existing methods by a large margin.
arXiv Detail & Related papers (2021-07-01T15:30:59Z) - Adaptive ABAC Policy Learning: A Reinforcement Learning Approach [2.5997274006052544]
We propose an adaptive ABAC policy learning approach to automate the authorization management task.
In particular, we propose a contextual bandit system, in which an authorization engine adapts an ABAC model through a feedback control loop.
We focus on developing an adaptive ABAC policy learning model for a home IoT environment as a running example.
arXiv Detail & Related papers (2021-05-18T15:18:02Z) - Imitation Learning from MPC for Quadrupedal Multi-Gait Control [63.617157490920505]
We present a learning algorithm for training a single policy that imitates multiple gaits of a walking robot.
We use and extend MPC-Net, which is an Imitation Learning approach guided by Model Predictive Control.
We validate our approach on hardware and show that a single learned policy can replace its teacher to control multiple gaits.
arXiv Detail & Related papers (2021-03-26T08:48:53Z) - An Automatic Attribute Based Access Control Policy Extraction from
Access Logs [5.142415132534397]
An attribute-based access control (ABAC) model provides a more flexible approach for addressing the authorization needs of complex and dynamic systems.
We present a methodology for automatically learning ABAC policy rules from access logs of a system to simplify the policy development process.
arXiv Detail & Related papers (2020-03-16T15:08:54Z) - BRPO: Batch Residual Policy Optimization [79.53696635382592]
In batch reinforcement learning, one often constrains a learned policy to be close to the behavior (data-generating) policy.
We propose residual policies, where the allowable deviation of the learned policy is state-action-dependent.
We derive a new for RL method, BRPO, which learns both the policy and allowable deviation that jointly maximize a lower bound on policy performance.
arXiv Detail & Related papers (2020-02-08T01:59:33Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.