End-to-End Reinforcement Learning for Torque Based Variable Height
Hopping
- URL: http://arxiv.org/abs/2307.16676v2
- Date: Mon, 18 Dec 2023 19:02:34 GMT
- Title: End-to-End Reinforcement Learning for Torque Based Variable Height
Hopping
- Authors: Raghav Soni, Daniel Harnack, Hauke Isermann, Sotaro Fushimi, Shivesh
Kumar, Frank Kirchner
- Abstract summary: Legged locomotion is arguably the most suited and versatile mode to deal with natural or unstructured terrains.
In this paper, we present a end-to-end RL based torque controller that learns to implicitly detect the relevant jump phases.
We also extend a method for simulation to reality transfer of the learned controller to contact rich dynamic tasks, resulting in successful deployment on the robot.
- Score: 5.34772724436823
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Legged locomotion is arguably the most suited and versatile mode to deal with
natural or unstructured terrains. Intensive research into dynamic walking and
running controllers has recently yielded great advances, both in the optimal
control and reinforcement learning (RL) literature. Hopping is a challenging
dynamic task involving a flight phase and has the potential to increase the
traversability of legged robots. Model based control for hopping typically
relies on accurate detection of different jump phases, such as lift-off or
touch down, and using different controllers for each phase. In this paper, we
present a end-to-end RL based torque controller that learns to implicitly
detect the relevant jump phases, removing the need to provide manual heuristics
for state detection. We also extend a method for simulation to reality transfer
of the learned controller to contact rich dynamic tasks, resulting in
successful deployment on the robot after training without parameter tuning.
Related papers
- Agile and versatile bipedal robot tracking control through reinforcement learning [12.831810518025309]
This paper proposes a versatile controller for bipedal robots.
It achieves ankle and body trajectory tracking across a wide range of gaits using a single small-scale neural network.
Highly flexible gait control can be achieved by combining minimal control units with high-level policy.
arXiv Detail & Related papers (2024-04-12T05:25:03Z) - Reinforcement Learning for Versatile, Dynamic, and Robust Bipedal Locomotion Control [106.32794844077534]
This paper presents a study on using deep reinforcement learning to create dynamic locomotion controllers for bipedal robots.
We develop a general control solution that can be used for a range of dynamic bipedal skills, from periodic walking and running to aperiodic jumping and standing.
This work pushes the limits of agility for bipedal robots through extensive real-world experiments.
arXiv Detail & Related papers (2024-01-30T10:48:43Z) - Learning and Adapting Agile Locomotion Skills by Transferring Experience [71.8926510772552]
We propose a framework for training complex robotic skills by transferring experience from existing controllers to jumpstart learning new tasks.
We show that our method enables learning complex agile jumping behaviors, navigating to goal locations while walking on hind legs, and adapting to new environments.
arXiv Detail & Related papers (2023-04-19T17:37:54Z) - Skip Training for Multi-Agent Reinforcement Learning Controller for
Industrial Wave Energy Converters [94.84709449845352]
Recent Wave Energy Converters (WEC) are equipped with multiple legs and generators to maximize energy generation.
Traditional controllers have shown limitations to capture complex wave patterns and the controllers must efficiently maximize the energy capture.
This paper introduces a Multi-Agent Reinforcement Learning controller (MARL), which outperforms the traditionally used spring damper controller.
arXiv Detail & Related papers (2022-09-13T00:20:31Z) - VAE-Loco: Versatile Quadruped Locomotion by Learning a Disentangled Gait
Representation [78.92147339883137]
We show that it is pivotal in increasing controller robustness by learning a latent space capturing the key stance phases constituting a particular gait.
We demonstrate that specific properties of the drive signal map directly to gait parameters such as cadence, footstep height and full stance duration.
The use of a generative model facilitates the detection and mitigation of disturbances to provide a versatile and robust planning framework.
arXiv Detail & Related papers (2022-05-02T19:49:53Z) - Learning Torque Control for Quadrupedal Locomotion [22.415419916292187]
This paper introduces a learning torque control framework for quadrupedal locomotion.
It trains an RL policy that directly predicts joint torques at a high frequency, thus circumventing the use of PD controllers.
To our knowledge, this is the first attempt of learning torque control for quadrupedal locomotion with an end-to-end single neural network.
arXiv Detail & Related papers (2022-03-10T07:09:05Z) - Reinforcement Learning for Robust Parameterized Locomotion Control of
Bipedal Robots [121.42930679076574]
We present a model-free reinforcement learning framework for training robust locomotion policies in simulation.
domain randomization is used to encourage the policies to learn behaviors that are robust across variations in system dynamics.
We demonstrate this on versatile walking behaviors such as tracking a target walking velocity, walking height, and turning yaw.
arXiv Detail & Related papers (2021-03-26T07:14:01Z) - ReLMoGen: Leveraging Motion Generation in Reinforcement Learning for
Mobile Manipulation [99.2543521972137]
ReLMoGen is a framework that combines a learned policy to predict subgoals and a motion generator to plan and execute the motion needed to reach these subgoals.
Our method is benchmarked on a diverse set of seven robotics tasks in photo-realistic simulation environments.
ReLMoGen shows outstanding transferability between different motion generators at test time, indicating a great potential to transfer to real robots.
arXiv Detail & Related papers (2020-08-18T08:05:15Z) - Learning to Fly via Deep Model-Based Reinforcement Learning [37.37420200406336]
We learn a thrust-attitude controller for a quadrotor through model-based reinforcement learning.
We show that "learning to fly" can be achieved with less than 30 minutes of experience with a single drone.
arXiv Detail & Related papers (2020-03-19T15:55:39Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.