Robust Quadrupedal Locomotion on Sloped Terrains: A Linear Policy
Approach
- URL: http://arxiv.org/abs/2010.16342v2
- Date: Tue, 10 Nov 2020 12:29:13 GMT
- Title: Robust Quadrupedal Locomotion on Sloped Terrains: A Linear Policy
Approach
- Authors: Kartik Paigwar, Lokesh Krishna, Sashank Tirumala, Naman Khetan, Aditya
Sagi, Ashish Joglekar, Shalabh Bhatnagar, Ashitava Ghosal, Bharadwaj Amrutur,
Shishir Kolathaya
- Abstract summary: We use a linear policy for realizing end-foot trajectories in the quadruped robot, Stoch $2$.
In particular, the parameters of the end-foot trajectories are shaped via a linear feedback policy that takes the torso orientation and the terrain slope as inputs.
The resulting walking is robust to terrain slope variations and external pushes.
- Score: 3.752600874088677
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In this paper, with a view toward fast deployment of locomotion gaits in
low-cost hardware, we use a linear policy for realizing end-foot trajectories
in the quadruped robot, Stoch $2$. In particular, the parameters of the
end-foot trajectories are shaped via a linear feedback policy that takes the
torso orientation and the terrain slope as inputs. The corresponding desired
joint angles are obtained via an inverse kinematics solver and tracked via a
PID control law. Augmented Random Search, a model-free and a gradient-free
learning algorithm is used to train this linear policy. Simulation results show
that the resulting walking is robust to terrain slope variations and external
pushes. This methodology is not only computationally light-weight but also uses
minimal sensing and actuation capabilities in the robot, thereby justifying the
approach.
Related papers
- Autonomous Vehicle Controllers From End-to-End Differentiable Simulation [60.05963742334746]
We propose a differentiable simulator and design an analytic policy gradients (APG) approach to training AV controllers.
Our proposed framework brings the differentiable simulator into an end-to-end training loop, where gradients of environment dynamics serve as a useful prior to help the agent learn a more grounded policy.
We find significant improvements in performance and robustness to noise in the dynamics, as well as overall more intuitive human-like handling.
arXiv Detail & Related papers (2024-09-12T11:50:06Z) - Predictive Linear Online Tracking for Unknown Targets [5.047136039782827]
We study the problem of online tracking in linear control systems, where the objective is to follow a moving target.
We propose a new algorithm, called predictive linear online tracking (PLOT)
We implement PLOT on a real quadrotor and provide open-source software, thus, showcasing one of the first successful applications of online control methods on real hardware.
arXiv Detail & Related papers (2024-02-15T15:59:59Z) - DATT: Deep Adaptive Trajectory Tracking for Quadrotor Control [62.24301794794304]
Deep Adaptive Trajectory Tracking (DATT) is a learning-based approach that can precisely track arbitrary, potentially infeasible trajectories in the presence of large disturbances in the real world.
DATT significantly outperforms competitive adaptive nonlinear and model predictive controllers for both feasible smooth and infeasible trajectories in unsteady wind fields.
It can efficiently run online with an inference time less than 3.2 ms, less than 1/4 of the adaptive nonlinear model predictive control baseline.
arXiv Detail & Related papers (2023-10-13T12:22:31Z) - DDPEN: Trajectory Optimisation With Sub Goal Generation Model [70.36888514074022]
In this paper, we produce a novel Differential Dynamic Programming with Escape Network (DDPEN)
We propose to utilize a deep model that takes as an input map of the environment in the form of a costmap together with the desired position.
The model produces possible future directions that will lead to the goal, avoiding local minima which is possible to run in real time conditions.
arXiv Detail & Related papers (2023-01-18T11:02:06Z) - Neural Motion Fields: Encoding Grasp Trajectories as Implicit Value
Functions [65.84090965167535]
We present Neural Motion Fields, a novel object representation which encodes both object point clouds and the relative task trajectories as an implicit value function parameterized by a neural network.
This object-centric representation models a continuous distribution over the SE(3) space and allows us to perform grasping reactively by leveraging sampling-based MPC to optimize this value function.
arXiv Detail & Related papers (2022-06-29T18:47:05Z) - VAE-Loco: Versatile Quadruped Locomotion by Learning a Disentangled Gait
Representation [78.92147339883137]
We show that it is pivotal in increasing controller robustness by learning a latent space capturing the key stance phases constituting a particular gait.
We demonstrate that specific properties of the drive signal map directly to gait parameters such as cadence, footstep height and full stance duration.
The use of a generative model facilitates the detection and mitigation of disturbances to provide a versatile and robust planning framework.
arXiv Detail & Related papers (2022-05-02T19:49:53Z) - Learning Linear Policies for Robust Bipedal Locomotion on Terrains with
Varying Slopes [5.737287537823072]
We learn this policy via a model-free and a gradient-free learning algorithm, Augmented Random Search (ARS), in the two robot platforms Rabbit and Digit.
We demonstrate additional behaviors like walking backwards, stepping-in-place, and recovery from external pushes of up to 120 N.
The end result is a robust and a fast feedback control law for bipedal walking on terrains with varying slopes.
arXiv Detail & Related papers (2021-04-04T18:50:58Z) - Autonomous Drone Racing with Deep Reinforcement Learning [39.757652701917166]
In many robotic tasks, such as drone racing, the goal is to travel through a set of waypoints as fast as possible.
A key challenge is planning the minimum-time trajectory, which is typically solved by assuming perfect knowledge of the waypoints to pass in advance.
In this work, a new approach to minimum-time trajectory generation for quadrotors is presented.
arXiv Detail & Related papers (2021-03-15T18:05:49Z) - RLOC: Terrain-Aware Legged Locomotion using Reinforcement Learning and
Optimal Control [6.669503016190925]
We present a unified model-based and data-driven approach for quadrupedal planning and control.
We map sensory information and desired base velocity commands into footstep plans using a reinforcement learning policy.
We train and evaluate our framework on a complex quadrupedal system, ANYmal B, and demonstrate transferability to a larger and heavier robot, ANYmal C, without requiring retraining.
arXiv Detail & Related papers (2020-12-05T18:30:23Z) - Cycloidal Trajectory Realization on Staircase based on Neural Network
Temporal Quantized Lagrange Dynamics (NNTQLD) with Ant Colony Optimization
for a 9-Link Bipedal Robot [1.7205106391379026]
In this paper, a novel optimal technique for joint angles trajectory tracking control with energy optimization is proposed.
For the task of climbing stairs by a 9-link biped model, a cycloid trajectory for swing phase is proposed in such a way that the cycloid variables depend on the staircase dimensions.
arXiv Detail & Related papers (2020-12-02T05:50:42Z) - Path Planning Followed by Kinodynamic Smoothing for Multirotor Aerial
Vehicles (MAVs) [61.94975011711275]
We propose a geometrically based motion planning technique textquotedblleft RRT*textquotedblright; for this purpose.
In the proposed technique, we modified original RRT* introducing an adaptive search space and a steering function.
We have tested the proposed technique in various simulated environments.
arXiv Detail & Related papers (2020-08-29T09:55:49Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.