Learning Time-optimized Path Tracking with or without Sensory Feedback
- URL: http://arxiv.org/abs/2203.01968v1
- Date: Thu, 3 Mar 2022 19:13:31 GMT
- Title: Learning Time-optimized Path Tracking with or without Sensory Feedback
- Authors: Jonas C. Kiemel, Torsten Kr\"oger
- Abstract summary: We present a learning-based approach that allows a robot to quickly follow a reference path defined in joint space.
The robot is controlled by a neural network that is trained via reinforcement learning using data generated by a physics simulator.
- Score: 5.254093731341154
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In this paper, we present a learning-based approach that allows a robot to
quickly follow a reference path defined in joint space without exceeding limits
on the position, velocity, acceleration and jerk of each robot joint. Contrary
to offline methods for time-optimal path parameterization, the reference path
can be changed during motion execution. In addition, our approach can utilize
sensory feedback, for instance, to follow a reference path with a bipedal robot
without losing balance. With our method, the robot is controlled by a neural
network that is trained via reinforcement learning using data generated by a
physics simulator. From a mathematical perspective, the problem of tracking a
reference path in a time-optimized manner is formalized as a Markov decision
process. Each state includes a fixed number of waypoints specifying the next
part of the reference path. The action space is designed in such a way that all
resulting motions comply with the specified kinematic joint limits. The reward
function finally reflects the trade-off between the execution time, the
deviation from the desired reference path and optional additional objectives
like balancing. We evaluate our approach with and without additional objectives
and show that time-optimized path tracking can be successfully learned for both
industrial and humanoid robots. In addition, we demonstrate that networks
trained in simulation can be successfully transferred to a real Kuka robot.
Related papers
- Back-stepping Experience Replay with Application to Model-free Reinforcement Learning for a Soft Snake Robot [15.005962159112002]
Back-stepping Experience Replay (BER) is compatible with arbitrary off-policy reinforcement learning algorithms.
We present an application of BER in a model-free RL approach for the locomotion and navigation of a soft snake robot.
arXiv Detail & Related papers (2024-01-21T02:17:16Z) - Trace and Pace: Controllable Pedestrian Animation via Guided Trajectory
Diffusion [83.88829943619656]
We introduce a method for generating realistic pedestrian trajectories and full-body animations that can be controlled to meet user-defined goals.
Our guided diffusion model allows users to constrain trajectories through target waypoints, speed, and specified social groups.
We propose utilizing the value function learned during RL training of the animation controller to guide diffusion to produce trajectories better suited for particular scenarios.
arXiv Detail & Related papers (2023-04-04T15:46:42Z) - Advanced Skills by Learning Locomotion and Local Navigation End-to-End [10.872193480485596]
In this work, we propose to solve the complete problem by training an end-to-end policy with deep reinforcement learning.
We demonstrate the successful deployment of policies on a real quadrupedal robot.
arXiv Detail & Related papers (2022-09-26T16:35:00Z) - Hierarchical Path-planning from Speech Instructions with Spatial Concept-based Topometric Semantic Mapping [7.332652485849632]
This study aims to realize a hierarchical spatial representation using a topometric semantic map and path planning with speech instructions, including waypoints.
We conducted experiments in home environments using the Toyota Human Support Robot on the SIGVerse simulator and in a lab-office environment with the real robot, Albert.
Navigation experiments using speech instructions with a waypoint demonstrated a performance improvement of SpCoTMHP over the baseline hierarchical path planning method with path costs.
arXiv Detail & Related papers (2022-03-21T09:15:25Z) - SABER: Data-Driven Motion Planner for Autonomously Navigating
Heterogeneous Robots [112.2491765424719]
We present an end-to-end online motion planning framework that uses a data-driven approach to navigate a heterogeneous robot team towards a global goal.
We use model predictive control (SMPC) to calculate control inputs that satisfy robot dynamics, and consider uncertainty during obstacle avoidance with chance constraints.
recurrent neural networks are used to provide a quick estimate of future state uncertainty considered in the SMPC finite-time horizon solution.
A Deep Q-learning agent is employed to serve as a high-level path planner, providing the SMPC with target positions that move the robots towards a desired global goal.
arXiv Detail & Related papers (2021-08-03T02:56:21Z) - OctoPath: An OcTree Based Self-Supervised Learning Approach to Local
Trajectory Planning for Mobile Robots [0.0]
We introduce OctoPath, which is an encoder-decoder deep neural network, trained in a self-supervised manner to predict the local optimal trajectory for the ego-vehicle.
During training, OctoPath minimizes the error between the predicted and the manually driven trajectories in a given training dataset.
We evaluate the predictions of OctoPath in different driving scenarios, both indoor and outdoor, while benchmarking our system against a baseline hybrid A-Star algorithm.
arXiv Detail & Related papers (2021-06-02T07:10:54Z) - Domain Adaptive Robotic Gesture Recognition with Unsupervised
Kinematic-Visual Data Alignment [60.31418655784291]
We propose a novel unsupervised domain adaptation framework which can simultaneously transfer multi-modality knowledge, i.e., both kinematic and visual data, from simulator to real robot.
It remedies the domain gap with enhanced transferable features by using temporal cues in videos, and inherent correlations in multi-modal towards recognizing gesture.
Results show that our approach recovers the performance with great improvement gains, up to 12.91% in ACC and 20.16% in F1score without using any annotations in real robot.
arXiv Detail & Related papers (2021-03-06T09:10:03Z) - Learning Collision-free and Torque-limited Robot Trajectories based on
Alternative Safe Behaviors [2.28438857884398]
A neural network is periodically invoked to predict future motions for industrial robots.
Compliance with kinematic joint limits is ensured by the design of the action space.
Experiments with a real robot demonstrate that safe trajectories can be generated in real-time.
arXiv Detail & Related papers (2021-03-05T16:50:57Z) - Online Body Schema Adaptation through Cost-Sensitive Active Learning [63.84207660737483]
The work was implemented in a simulation environment, using the 7DoF arm of the iCub robot simulator.
A cost-sensitive active learning approach is used to select optimal joint configurations.
The results show cost-sensitive active learning has similar accuracy to the standard active learning approach, while reducing in about half the executed movement.
arXiv Detail & Related papers (2021-01-26T16:01:02Z) - Risk-Averse MPC via Visual-Inertial Input and Recurrent Networks for
Online Collision Avoidance [95.86944752753564]
We propose an online path planning architecture that extends the model predictive control (MPC) formulation to consider future location uncertainties.
Our algorithm combines an object detection pipeline with a recurrent neural network (RNN) which infers the covariance of state estimates.
The robustness of our methods is validated on complex quadruped robot dynamics and can be generally applied to most robotic platforms.
arXiv Detail & Related papers (2020-07-28T07:34:30Z) - Thinking While Moving: Deep Reinforcement Learning with Concurrent
Control [122.49572467292293]
We study reinforcement learning in settings where sampling an action from the policy must be done concurrently with the time evolution of the controlled system.
Much like a person or an animal, the robot must think and move at the same time, deciding on its next action before the previous one has completed.
arXiv Detail & Related papers (2020-04-13T17:49:29Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.