Imitation Learning for Autonomous Trajectory Learning of Robot Arms in
Space
- URL: http://arxiv.org/abs/2008.04007v1
- Date: Mon, 10 Aug 2020 10:18:04 GMT
- Title: Imitation Learning for Autonomous Trajectory Learning of Robot Arms in
Space
- Authors: RB Ashith Shyam, Zhou Hao, Umberto Montanaro, Gerhard Neumann
- Abstract summary: Concept of programming by demonstration or imitation learning is used for trajectory planning of manipulators mounted on small spacecraft.
For greater autonomy in future space missions and minimal human intervention through ground control, a robot arm having 7-Degrees of Freedom (DoF) is envisaged for carrying out multiple tasks like debris removal, on-orbit servicing and assembly.
- Score: 13.64392246529041
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: This work adds on to the on-going efforts to provide more autonomy to space
robots. Here the concept of programming by demonstration or imitation learning
is used for trajectory planning of manipulators mounted on small spacecraft.
For greater autonomy in future space missions and minimal human intervention
through ground control, a robot arm having 7-Degrees of Freedom (DoF) is
envisaged for carrying out multiple tasks like debris removal, on-orbit
servicing and assembly. Since actual hardware implementation of microgravity
environment is extremely expensive, the demonstration data for trajectory
learning is generated using a model predictive controller (MPC) in a physics
based simulator. The data is then encoded compactly by Probabilistic Movement
Primitives (ProMPs). This offline trajectory learning allows faster
reproductions and also avoids any computationally expensive optimizations after
deployment in a space environment. It is shown that the probabilistic
distribution can be used to generate trajectories to previously unseen
situations by conditioning the distribution. The motion of the robot (or
manipulator) arm induces reaction forces on the spacecraft hub and hence its
attitude changes prompting the Attitude Determination and Control System (ADCS)
to take large corrective action that drains energy out of the system. By having
a robot arm with redundant DoF helps in finding several possible trajectories
from the same start to the same target. This allows the ProMP trajectory
generator to sample out the trajectory which is obstacle free as well as having
minimal attitudinal disturbances thereby reducing the load on ADCS.
Related papers
- Autonomous Vehicle Controllers From End-to-End Differentiable Simulation [60.05963742334746]
We propose a differentiable simulator and design an analytic policy gradients (APG) approach to training AV controllers.
Our proposed framework brings the differentiable simulator into an end-to-end training loop, where gradients of environment dynamics serve as a useful prior to help the agent learn a more grounded policy.
We find significant improvements in performance and robustness to noise in the dynamics, as well as overall more intuitive human-like handling.
arXiv Detail & Related papers (2024-09-12T11:50:06Z) - Trace and Pace: Controllable Pedestrian Animation via Guided Trajectory
Diffusion [83.88829943619656]
We introduce a method for generating realistic pedestrian trajectories and full-body animations that can be controlled to meet user-defined goals.
Our guided diffusion model allows users to constrain trajectories through target waypoints, speed, and specified social groups.
We propose utilizing the value function learned during RL training of the animation controller to guide diffusion to produce trajectories better suited for particular scenarios.
arXiv Detail & Related papers (2023-04-04T15:46:42Z) - Modular Safety-Critical Control of Legged Robots [0.0]
Safety concerns during the operation of legged robots must be addressed to enable their widespread use.
This study presents a modular safety filter to improve the safety of a legged robot, i.e., reduce the chance of a fall.
arXiv Detail & Related papers (2023-03-04T11:36:21Z) - Obstacle Avoidance for Robotic Manipulator in Joint Space via Improved
Proximal Policy Optimization [6.067589886362815]
In this paper, we train a deep neural network via an improved Proximal Policy Optimization (PPO) algorithm to map from task space to joint space for a 6-DoF manipulator.
Since training such a task in real-robot is time-consuming and strenuous, we develop a simulation environment to train the model.
Experimental results showed that using our method, the robot was capable of tracking a single target or reaching multiple targets in unstructured environments.
arXiv Detail & Related papers (2022-10-03T10:21:57Z) - Autonomous Aerial Robot for High-Speed Search and Intercept Applications [86.72321289033562]
A fully-autonomous aerial robot for high-speed object grasping has been proposed.
As an additional sub-task, our system is able to autonomously pierce balloons located in poles close to the surface.
Our approach has been validated in a challenging international competition and has shown outstanding results.
arXiv Detail & Related papers (2021-12-10T11:49:51Z) - Nonprehensile Riemannian Motion Predictive Control [57.295751294224765]
We introduce a novel Real-to-Sim reward analysis technique to reliably imagine and predict the outcome of taking possible actions for a real robotic platform.
We produce a closed-loop controller to reactively push objects in a continuous action space.
We observe that RMPC is robust in cluttered as well as occluded environments and outperforms the baselines.
arXiv Detail & Related papers (2021-11-15T18:50:04Z) - OSCAR: Data-Driven Operational Space Control for Adaptive and Robust
Robot Manipulation [50.59541802645156]
Operational Space Control (OSC) has been used as an effective task-space controller for manipulation.
We propose OSC for Adaptation and Robustness (OSCAR), a data-driven variant of OSC that compensates for modeling errors.
We evaluate our method on a variety of simulated manipulation problems, and find substantial improvements over an array of controller baselines.
arXiv Detail & Related papers (2021-10-02T01:21:38Z) - SABER: Data-Driven Motion Planner for Autonomously Navigating
Heterogeneous Robots [112.2491765424719]
We present an end-to-end online motion planning framework that uses a data-driven approach to navigate a heterogeneous robot team towards a global goal.
We use model predictive control (SMPC) to calculate control inputs that satisfy robot dynamics, and consider uncertainty during obstacle avoidance with chance constraints.
recurrent neural networks are used to provide a quick estimate of future state uncertainty considered in the SMPC finite-time horizon solution.
A Deep Q-learning agent is employed to serve as a high-level path planner, providing the SMPC with target positions that move the robots towards a desired global goal.
arXiv Detail & Related papers (2021-08-03T02:56:21Z) - DikpolaSat Mission: Improvement of Space Flight Performance and Optimal
Control Using Trained Deep Neural Network -- Trajectory Controller for Space
Objects Collision Avoidance [0.0]
This paper shows how the controller demonstration is carried out by having the spacecraft follow a desired path.
The obstacle avoidance algorithm is built into the control features to respond spontaneously using inputs from the neural network.
Multiple algorithms for optimizing flight controls and fuel consumption can be implemented using knowledge of flight dynamics in trajectory.
arXiv Detail & Related papers (2021-05-30T23:35:13Z) - Learning Constrained Distributions of Robot Configurations with
Generative Adversarial Network [15.962033896896385]
In high dimensional robotic system, the manifold of the valid configuration space often has a complex shape.
We propose a generative adversarial network approach to learn the distribution of valid robot configurations under such constraints.
arXiv Detail & Related papers (2020-11-11T11:43:54Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.