Learning a Single Near-hover Position Controller for Vastly Different
Quadcopters
- URL: http://arxiv.org/abs/2209.09232v2
- Date: Tue, 2 May 2023 21:48:51 GMT
- Title: Learning a Single Near-hover Position Controller for Vastly Different
Quadcopters
- Authors: Dingqi Zhang, Antonio Loquercio, Xiangyu Wu, Ashish Kumar, Jitendra
Malik, Mark W. Mueller
- Abstract summary: This paper proposes an adaptive near-hover position controller for quadcopters.
It can be deployed to quadcopters of very different mass, size and motor constants.
It also shows rapid adaptation to unknown disturbances during runtime.
- Score: 56.37274861303324
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: This paper proposes an adaptive near-hover position controller for
quadcopters, which can be deployed to quadcopters of very different mass, size
and motor constants, and also shows rapid adaptation to unknown disturbances
during runtime. The core algorithmic idea is to learn a single policy that can
adapt online at test time not only to the disturbances applied to the drone,
but also to the robot dynamics and hardware in the same framework. We achieve
this by training a neural network to estimate a latent representation of the
robot and environment parameters, which is used to condition the behaviour of
the controller, also represented as a neural network. We train both networks
exclusively in simulation with the goal of flying the quadcopters to goal
positions and avoiding crashes to the ground. We directly deploy the same
controller trained in the simulation without any modifications on two
quadcopters in the real world with differences in mass, size, motors, and
propellers with mass differing by 4.5 times. In addition, we show rapid
adaptation to sudden and large disturbances up to one-third of the mass of the
quadcopters. We perform an extensive evaluation in both simulation and the
physical world, where we outperform a state-of-the-art learning-based adaptive
controller and a traditional PID controller specifically tuned to each platform
individually. Video results can be found at https://youtu.be/U-c-LbTfvoA.
Related papers
- Autonomous Vehicle Controllers From End-to-End Differentiable Simulation [60.05963742334746]
We propose a differentiable simulator and design an analytic policy gradients (APG) approach to training AV controllers.
Our proposed framework brings the differentiable simulator into an end-to-end training loop, where gradients of environment dynamics serve as a useful prior to help the agent learn a more grounded policy.
We find significant improvements in performance and robustness to noise in the dynamics, as well as overall more intuitive human-like handling.
arXiv Detail & Related papers (2024-09-12T11:50:06Z) - BiRoDiff: Diffusion policies for bipedal robot locomotion on unseen terrains [0.9480364746270075]
Locomotion on unknown terrains is essential for bipedal robots to handle novel real-world challenges.
We introduce a lightweight framework that learns a single walking controller that yields locomotion on multiple terrains.
arXiv Detail & Related papers (2024-07-07T16:03:33Z) - From Flies to Robots: Inverted Landing in Small Quadcopters with Dynamic
Perching [15.57055572401334]
Inverted landing is a routine behavior among a number of animal fliers.
We develop a control policy general to arbitrary ceiling-approach conditions.
We successfully achieved a range of robust inverted-landing behaviors in small quadcopters.
arXiv Detail & Related papers (2024-02-29T21:09:08Z) - Reinforcement Learning for Versatile, Dynamic, and Robust Bipedal Locomotion Control [106.32794844077534]
This paper presents a study on using deep reinforcement learning to create dynamic locomotion controllers for bipedal robots.
We develop a general control solution that can be used for a range of dynamic bipedal skills, from periodic walking and running to aperiodic jumping and standing.
This work pushes the limits of agility for bipedal robots through extensive real-world experiments.
arXiv Detail & Related papers (2024-01-30T10:48:43Z) - Improving Wind Resistance Performance of Cascaded PID Controlled
Quadcopters using Residual Reinforcement Learning [0.0]
Wind resistance control is an essential feature for quadcopters to maintain their position to avoid deviation from target position.
We propose a residual reinforcement learning based approach to build a wind resistance controller of a quadcopter.
Our controller reduces the position deviation by approximately 50% compared to the quadcopter controlled with the conventional cascaded PID controller.
arXiv Detail & Related papers (2023-08-03T09:29:19Z) - GenLoco: Generalized Locomotion Controllers for Quadrupedal Robots [87.32145104894754]
We introduce a framework for training generalized locomotion (GenLoco) controllers for quadrupedal robots.
Our framework synthesizes general-purpose locomotion controllers that can be deployed on a large variety of quadrupedal robots.
We show that our models acquire more general control strategies that can be directly transferred to novel simulated and real-world robots.
arXiv Detail & Related papers (2022-09-12T15:14:32Z) - Adapting Rapid Motor Adaptation for Bipedal Robots [73.5914982741483]
We leverage recent advances in rapid adaptation for locomotion control, and extend them to work on bipedal robots.
A-RMA adapts the base policy for the imperfect extrinsics estimator by finetuning it using model-free RL.
We demonstrate that A-RMA outperforms a number of RL-based baseline controllers and model-based controllers in simulation.
arXiv Detail & Related papers (2022-05-30T17:59:09Z) - Learning multiple gaits of quadruped robot using hierarchical
reinforcement learning [9.60618440185329]
We propose a hierarchical controller for quadruped robot that could generate multiple gaits while tracking velocity command.
Experiment results show 1) the existence of optimal gait for specific velocity range 2) the efficiency of our hierarchical controller compared to a controller composed of a single policy.
arXiv Detail & Related papers (2021-12-09T07:45:25Z) - Learning a Contact-Adaptive Controller for Robust, Efficient Legged
Locomotion [95.1825179206694]
We present a framework that synthesizes robust controllers for a quadruped robot.
A high-level controller learns to choose from a set of primitives in response to changes in the environment.
A low-level controller that utilizes an established control method to robustly execute the primitives.
arXiv Detail & Related papers (2020-09-21T16:49:26Z) - Developmental Reinforcement Learning of Control Policy of a Quadcopter
UAV with Thrust Vectoring Rotors [1.0057838324294686]
We present a novel developmental reinforcement learning-based controller for a quadcopter with thrust vectoring capabilities.
The control policy of this robot is learned using the policy transfer from the learned controller of the quadcopter.
The performance of the learned policy is evaluated by physics-based simulations for the tasks of hovering and way-point navigation.
arXiv Detail & Related papers (2020-07-15T16:17:29Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.