Related papers: One Policy to Control Them All: Shared Modular Policies for Agent-Agnostic Control

One Policy to Control Them All: Shared Modular Policies for Agent-Agnostic Control

URL: http://arxiv.org/abs/2007.04976v1
Date: Thu, 9 Jul 2020 17:59:35 GMT
Title: One Policy to Control Them All: Shared Modular Policies for Agent-Agnostic Control
Authors: Wenlong Huang, Igor Mordatch, Deepak Pathak
Abstract summary: We investigate whether there exists a single global policy that can generalize to control a wide variety of agent morphologies. We propose to express this global policy as a collection of identical modular neural networks. We show that a single modular policy can successfully generate locomotion behaviors for several planar agents.
Score: 47.78262874364569
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Reinforcement learning is typically concerned with learning control policies tailored to a particular agent. We investigate whether there exists a single global policy that can generalize to control a wide variety of agent morphologies -- ones in which even dimensionality of state and action spaces changes. We propose to express this global policy as a collection of identical modular neural networks, dubbed as Shared Modular Policies (SMP), that correspond to each of the agent's actuators. Every module is only responsible for controlling its corresponding actuator and receives information from only its local sensors. In addition, messages are passed between modules, propagating information between distant modules. We show that a single modular policy can successfully generate locomotion behaviors for several planar agents with different skeletal structures such as monopod hoppers, quadrupeds, bipeds, and generalize to variants not seen during training -- a process that would normally require training and manual hyperparameter tuning for each morphology. We observe that a wide variety of drastically diverse locomotion styles across morphologies as well as centralized coordination emerges via message passing between decentralized modules purely from the reinforcement learning objective. Videos and code at https://huangwl18.github.io/modular-rl/

Related papers

Is Modularity Transferable? A Case Study through the Lens of Knowledge Distillation [59.37775534633868]
We present an extremely straightforward approach to transferring pre-trained, task-specific PEFT modules between same-family PLMs. We also propose a method that allows the transfer of modules between incompatible PLMs without any change in the inference complexity.
arXiv Detail & Related papers (2024-03-27T17:50:00Z)
FedYolo: Augmenting Federated Learning with Pretrained Transformers [61.56476056444933]
In this work, we investigate pretrained transformers (PTF) to achieve on-device learning goals. We show that larger scale shrinks the accuracy gaps between alternative approaches and improves robustness. Finally, it enables clients to solve multiple unrelated tasks simultaneously using a single PTF.
arXiv Detail & Related papers (2023-07-10T21:08:52Z)
Modular Deep Learning [120.36599591042908]
Transfer learning has recently become the dominant paradigm of machine learning. It remains unclear how to develop models that specialise towards multiple tasks without incurring negative interference. Modular deep learning has emerged as a promising solution to these challenges.
arXiv Detail & Related papers (2023-02-22T18:11:25Z)
DMAP: a Distributed Morphological Attention Policy for Learning to Locomote with a Changing Body [126.52031472297413]
We introduce DMAP, a biologically-inspired, attention-based policy network architecture. We show that a control policy based on the proprioceptive state performs poorly with highly variable body configurations. DMAP can be trained end-to-end in all the considered environments, overall matching or surpassing the performance of an oracle agent.
arXiv Detail & Related papers (2022-09-28T16:45:35Z)
Behavior Trees in Robot Control Systems [0.0]
Key idea underlying behavior trees is to make use of modularity, hierarchies and feedback. A hierarchy of such modules is natural, since robot tasks can often be decomposed into a hierarchy of sub-tasks. feedback control is a fundamental tool for handling uncertainties and disturbances in any low level control system.
arXiv Detail & Related papers (2022-03-24T14:16:15Z)
Learning Modular Robot Control Policies [10.503109190599828]
We construct a modular control policy that handles a broad class of designs. As the modules are physically re-configured, the policy automatically re-configures to match the kinematic structure. We show that the policy can then generalize to a larger set of designs not seen during training.
arXiv Detail & Related papers (2021-05-20T21:54:37Z)
Neural Function Modules with Sparse Arguments: A Dynamic Approach to Integrating Information across Layers [84.57980167400513]
Neural Function Modules (NFM) aims to introduce the same structural capability into deep learning. Most of the work in the context of feed-forward networks combining top-down and bottom-up feedback is limited to classification problems. The key contribution of our work is to combine attention, sparsity, top-down and bottom-up feedback, in a flexible algorithm.
arXiv Detail & Related papers (2020-10-15T20:43:17Z)
Adapting to Unseen Environments through Explicit Representation of Context [16.8615211682877]
In order to deploy autonomous agents to domains such as autonomous driving, infrastructure management, health care, and finance, they must be able to adapt safely to unseen situations. This paper proposes a principled approach where a context module is coevolved with a skill module. The Context+Skill approach leads to significantly more robust behavior in environments with previously unseen effects.
arXiv Detail & Related papers (2020-02-13T17:15:47Z)

This list is automatically generated from the titles and abstracts of the papers in this site.

This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.