Related papers: Temporally Layered Architecture for Adaptive, Distributed and Continuous Control

Temporally Layered Architecture for Adaptive, Distributed and Continuous Control

URL: http://arxiv.org/abs/2301.00723v1
Date: Sun, 25 Dec 2022 08:46:22 GMT
Title: Temporally Layered Architecture for Adaptive, Distributed and Continuous Control
Authors: Devdhar Patel, Joshua Russell, Francesca Walsh, Tauhidur Rahman, Terrance Sejnowski, Hava Siegelmann
Abstract summary: We present temporally layered architecture (TLA), a biologically inspired system for temporally adaptive distributed control. TLA layers a fast and a slow controller together to achieve temporal abstraction that allows each layer to focus on a different time-scale. Our design is biologically inspired and draws on the architecture of the human brain which executes actions at different timescales depending on the environment's demands.
Score: 2.1700103865910503
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: We present temporally layered architecture (TLA), a biologically inspired system for temporally adaptive distributed control. TLA layers a fast and a slow controller together to achieve temporal abstraction that allows each layer to focus on a different time-scale. Our design is biologically inspired and draws on the architecture of the human brain which executes actions at different timescales depending on the environment's demands. Such distributed control design is widespread across biological systems because it increases survivability and accuracy in certain and uncertain environments. We demonstrate that TLA can provide many advantages over existing approaches, including persistent exploration, adaptive control, explainable temporal behavior, compute efficiency and distributed control. We present two different algorithms for training TLA: (a) Closed-loop control, where the fast controller is trained over a pre-trained slow controller, allowing better exploration for the fast controller and closed-loop control where the fast controller decides whether to "act-or-not" at each timestep; and (b) Partially open loop control, where the slow controller is trained over a pre-trained fast controller, allowing for open loop-control where the slow controller picks a temporally extended action or defers the next n-actions to the fast controller. We evaluated our method on a suite of continuous control tasks and demonstrate the advantages of TLA over several strong baselines.

Related papers

TARC: Time-Adaptive Robotic Control [48.61871569444481]
Fixed-frequency control in robotics imposes a trade-off between the efficiency of low-frequency control and the robustness of high-frequency control.<n>We address this with a reinforcement learning approach in which policies jointly select control actions and their application durations.<n>We validate our method with zero-shot sim-to-real experiments on two distinct hardware platforms.
arXiv Detail & Related papers (2025-10-27T10:10:19Z)
Improved Robustness of Deep Reinforcement Learning for Control of Time-Varying Systems by Bounded Extremum Seeking [39.407739937584104]
We study the use of robust model independent bounded extremum seeking (ES) feedback control to improve the robustness of deep reinforcement learning controllers.<n>We present a numerical study of a general time-varying system and a combined ES-DRL controller for automatic tuning of the Low Energy Beam Transport section at the Los Alamos Neutron Science Center linear particle accelerator.
arXiv Detail & Related papers (2025-10-02T18:53:02Z)
Neural Operator based Reinforcement Learning for Control of first-order PDEs with Spatially-Varying State Delay [9.616306243200269]
Control of distributed parameter systems affected by delays is a challenging task. We address the problem of controlling an unstable first-order hyperbolic PDE with spatially-varying delays by combining PDE backstepping control strategies and deep reinforcement learning (RL) In simulations, our algorithm outperforms the baseline SAC without prior backstepping knowledge and the analytical controller.
arXiv Detail & Related papers (2025-01-30T08:49:08Z)
Extensive Exploration in Complex Traffic Scenarios using Hierarchical Reinforcement Learning [7.380119332658803]
Our research introduces a pioneering hierarchical framework that efficiently decomposes intricate decision-making problems into manageable subtasks. We adopt a two step training process that trains the high-level controller and low-level controller separately. The high-level controller exhibits an enhanced exploration potential with long-term delayed rewards, and the low-level controller provides longitudinal and lateral control ability using short-term instantaneous rewards.
arXiv Detail & Related papers (2025-01-25T00:00:11Z)
CLoSD: Closing the Loop between Simulation and Diffusion for multi-task character control [27.418288450778192]
Motion diffusion models and Reinforcement Learning based control for physics-based simulations have complementary strengths for human motion generation. CLoSD is a text-driven RL physics-based controller, guided by diffusion generation for various tasks. CLoSD is capable of seamlessly performing a sequence of different tasks, including navigation to a goal location, striking an object with a hand or foot as specified in a text prompt, sitting down, and getting up.
arXiv Detail & Related papers (2024-10-04T13:56:48Z)
Closed-loop Diffusion Control of Complex Physical Systems [10.167080282182972]
We propose an efficient Closed-Loop Diffusion method for Physical systems Control (CL-DiffPhyCon) By employing an asynchronous denoising framework for different physical time steps, CL-DiffPhyCon generates control signals conditioned on real-time feedback from the environment. We evaluate CL-DiffPhyCon on two tasks: 1D Burgers' equation control and 2D incompressible fluid control.
arXiv Detail & Related papers (2024-07-31T14:54:29Z)
Growing Q-Networks: Solving Continuous Control Tasks with Adaptive Control Resolution [51.83951489847344]
In robotics applications, smooth control signals are commonly preferred to reduce system wear and energy efficiency. In this work, we aim to bridge this performance gap by growing discrete action spaces from coarse to fine control resolution. Our work indicates that an adaptive control resolution in combination with value decomposition yields simple critic-only algorithms that yield surprisingly strong performance on continuous control tasks.
arXiv Detail & Related papers (2024-04-05T17:58:37Z)
Reinforcement Learning for Versatile, Dynamic, and Robust Bipedal Locomotion Control [106.32794844077534]
This paper presents a study on using deep reinforcement learning to create dynamic locomotion controllers for bipedal robots. We develop a general control solution that can be used for a range of dynamic bipedal skills, from periodic walking and running to aperiodic jumping and standing. This work pushes the limits of agility for bipedal robots through extensive real-world experiments.
arXiv Detail & Related papers (2024-01-30T10:48:43Z)
Learning to Fly in Seconds [7.259696592534715]
We show how curriculum learning and a highly optimized simulator enhance sample complexity and lead to fast training times. Our framework enables Simulation-to-Reality (Sim2Real) transfer for direct control after only 18 seconds of training on a consumer-grade laptop.
arXiv Detail & Related papers (2023-11-22T01:06:45Z)
Deluca -- A Differentiable Control Library: Environments, Methods, and Benchmarking [52.44199258132215]
We present an open-source library of differentiable physics and robotics environments. The library features several popular environments, including classical control settings from OpenAI Gym. We give several use-cases of new scientific results obtained using the library.
arXiv Detail & Related papers (2021-02-19T15:06:47Z)
Machine Learning for Mechanical Ventilation Control [52.65490904484772]
We consider the problem of controlling an invasive mechanical ventilator for pressure-controlled ventilation. A PID controller must let air in and out of a sedated patient's lungs according to a trajectory of airway pressures specified by a clinician. We show that our controllers are able to track target pressure waveforms significantly better than PID controllers.
arXiv Detail & Related papers (2021-02-12T21:23:33Z)
Regularizing Action Policies for Smooth Control with Reinforcement Learning [47.312768123967025]
Conditioning for Action Policy Smoothness (CAPS) is an effective yet intuitive regularization on action policies. CAPS offers consistent improvement in the smoothness of the learned state-to-action mappings of neural network controllers. Tested on a real system, improvements in controller smoothness on a quadrotor drone resulted in an almost 80% reduction in power consumption.
arXiv Detail & Related papers (2020-12-11T21:35:24Z)
Learning a Contact-Adaptive Controller for Robust, Efficient Legged Locomotion [95.1825179206694]
We present a framework that synthesizes robust controllers for a quadruped robot. A high-level controller learns to choose from a set of primitives in response to changes in the environment. A low-level controller that utilizes an established control method to robustly execute the primitives.
arXiv Detail & Related papers (2020-09-21T16:49:26Z)
Optimal PID and Antiwindup Control Design as a Reinforcement Learning Problem [3.131740922192114]
We focus on the interpretability of DRL control methods. In particular, we view linear fixed-structure controllers as shallow neural networks embedded in the actor-critic framework.
arXiv Detail & Related papers (2020-05-10T01:05:26Z)

This list is automatically generated from the titles and abstracts of the papers in this site.