Efficient Learning of Control Policies for Robust Quadruped Bounding
using Pretrained Neural Networks
- URL: http://arxiv.org/abs/2011.00446v3
- Date: Sun, 29 Oct 2023 14:18:20 GMT
- Title: Efficient Learning of Control Policies for Robust Quadruped Bounding
using Pretrained Neural Networks
- Authors: Zhicheng Wang, Anqiao Li, Yixiao Zheng, Anhuan Xie, Zhibin Li, Jun Wu,
Qiuguo Zhu
- Abstract summary: Bounding is one of the important gaits in quadrupedal locomotion for negotiating obstacles.
The authors proposed an effective approach that can learn robust bounding gaits more efficiently.
The authors approach shows efficient computing and good locomotion results by the Jueying Mini quadrupedal robot bounding over uneven terrain.
- Score: 15.09037992110481
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Bounding is one of the important gaits in quadrupedal locomotion for
negotiating obstacles. The authors proposed an effective approach that can
learn robust bounding gaits more efficiently despite its large variation in
dynamic body movements. The authors first pretrained the neural network (NN)
based on data from a robot operated by conventional model based controllers,
and then further optimised the pretrained NN via deep reinforcement learning
(DRL). In particular, the authors designed a reward function considering
contact points and phases to enforce the gait symmetry and periodicity, which
improved the bounding performance. The NN based feedback controller was learned
in the simulation and directly deployed on the real quadruped robot Jueying
Mini successfully. A variety of environments are presented both indoors and
outdoors with the authors approach. The authors approach shows efficient
computing and good locomotion results by the Jueying Mini quadrupedal robot
bounding over uneven terrain.
Related papers
- Offline Adaptation of Quadruped Locomotion using Diffusion Models [59.882275766745295]
We present a diffusion-based approach to quadrupedal locomotion that simultaneously addresses the limitations of learning and interpolating between multiple skills.
We show that these capabilities are compatible with a multi-skill policy and can be applied with little modification and minimal compute overhead.
We verify the validity of our approach with hardware experiments on the ANYmal quadruped platform.
arXiv Detail & Related papers (2024-11-13T18:12:15Z) - Distributed Robust Learning based Formation Control of Mobile Robots based on Bioinspired Neural Dynamics [14.149584412213269]
We first introduce a distributed estimator using a variable structure and cascaded design technique, eliminating the need for derivative information to improve the real time performance.
Then, a kinematic tracking control method is developed utilizing a bioinspired neural dynamic-based approach aimed at providing smooth control inputs and effectively resolving the speed jump issue.
To address the challenges for robots operating with completely unknown dynamics and disturbances, a learning-based robust dynamic controller is developed.
arXiv Detail & Related papers (2024-03-23T04:36:12Z) - Robot Fine-Tuning Made Easy: Pre-Training Rewards and Policies for
Autonomous Real-World Reinforcement Learning [58.3994826169858]
We introduce RoboFuME, a reset-free fine-tuning system for robotic reinforcement learning.
Our insights are to utilize offline reinforcement learning techniques to ensure efficient online fine-tuning of a pre-trained policy.
Our method can incorporate data from an existing robot dataset and improve on a target task within as little as 3 hours of autonomous real-world experience.
arXiv Detail & Related papers (2023-10-23T17:50:08Z) - Value function estimation using conditional diffusion models for control [62.27184818047923]
We propose a simple algorithm called Diffused Value Function (DVF)
It learns a joint multi-step model of the environment-robot interaction dynamics using a diffusion model.
We show how DVF can be used to efficiently capture the state visitation measure for multiple controllers.
arXiv Detail & Related papers (2023-06-09T18:40:55Z) - Learning Bipedal Walking for Humanoids with Current Feedback [5.429166905724048]
We present an approach for overcoming the sim2real gap issue for humanoid robots arising from inaccurate torque-tracking at the actuator level.
Our approach successfully trains a unified, end-to-end policy in simulation that can be deployed on a real HRP-5P humanoid robot to achieve bipedal locomotion.
arXiv Detail & Related papers (2023-03-07T08:16:46Z) - Learning to Exploit Elastic Actuators for Quadruped Locomotion [7.9585932082270014]
Spring-based actuators in legged locomotion provide energy-efficiency and improved performance, but increase the difficulty of controller design.
We propose to learn model-free controllers directly on the real robot.
We evaluate the proposed approach on the DLR elastic quadruped bert.
arXiv Detail & Related papers (2022-09-15T09:43:17Z) - Complex Locomotion Skill Learning via Differentiable Physics [30.868690308658174]
Differentiable physics enables efficient-based optimizations of neural network (NN) controllers.
We present a practical learning framework that outputs unified NN controllers capable of tasks with significantly improved complexity and diversity.
arXiv Detail & Related papers (2022-06-06T04:01:12Z) - Gradient-Based Trajectory Optimization With Learned Dynamics [80.41791191022139]
We use machine learning techniques to learn a differentiable dynamics model of the system from data.
We show that a neural network can model highly nonlinear behaviors accurately for large time horizons.
In our hardware experiments, we demonstrate that our learned model can represent complex dynamics for both the Spot and Radio-controlled (RC) car.
arXiv Detail & Related papers (2022-04-09T22:07:34Z) - Model Predictive Control for Fluid Human-to-Robot Handovers [50.72520769938633]
Planning motions that take human comfort into account is not a part of the human-robot handover process.
We propose to generate smooth motions via an efficient model-predictive control framework.
We conduct human-to-robot handover experiments on a diverse set of objects with several users.
arXiv Detail & Related papers (2022-03-31T23:08:20Z) - Reinforcement Learning with Evolutionary Trajectory Generator: A General
Approach for Quadrupedal Locomotion [29.853927354893656]
We propose a novel RL-based approach that contains an evolutionary foot trajectory generator.
The generator continually optimize the shape of the output trajectory for the given task, providing diversified motion priors to guide the policy learning.
We deploy the controller learned in the simulation on a 12-DoF quadrupedal robot, and it can successfully traverse challenging scenarios with efficient gaits.
arXiv Detail & Related papers (2021-09-14T02:51:50Z) - First Steps: Latent-Space Control with Semantic Constraints for
Quadruped Locomotion [73.37945453998134]
Traditional approaches to quadruped control employ simplified, hand-derived models.
This significantly reduces the capability of the robot since its effective kinematic range is curtailed.
In this work, these challenges are addressed by framing quadruped control as optimisation in a structured latent space.
A deep generative model captures a statistical representation of feasible joint configurations, whilst complex dynamic and terminal constraints are expressed via high-level, semantic indicators.
We validate the feasibility of locomotion trajectories optimised using our approach both in simulation and on a real-worldmal quadruped.
arXiv Detail & Related papers (2020-07-03T07:04:18Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.