Learning to Get Up
- URL: http://arxiv.org/abs/2205.00307v1
- Date: Sat, 30 Apr 2022 17:12:30 GMT
- Title: Learning to Get Up
- Authors: Tianxin Tao, Matthew Wilson, Ruiyu Gou, Michiel van de Panne
- Abstract summary: Getting up from a fallen state is a basic human skill.
Existing methods for learning this skill generate highly dynamic and erratic get-up motions.
We present a staged approach using reinforcement learning, without recourse to motion capture data.
- Score: 5.887969742827488
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Getting up from an arbitrary fallen state is a basic human skill. Existing
methods for learning this skill often generate highly dynamic and erratic
get-up motions, which do not resemble human get-up strategies, or are based on
tracking recorded human get-up motions. In this paper, we present a staged
approach using reinforcement learning, without recourse to motion capture data.
The method first takes advantage of a strong character model, which facilitates
the discovery of solution modes. A second stage then learns to adapt the
control policy to work with progressively weaker versions of the character.
Finally, a third stage learns control policies that can reproduce the weaker
get-up motions at much slower speeds. We show that across multiple runs, the
method can discover a diverse variety of get-up strategies, and execute them at
a variety of speeds. The results usually produce policies that use a final
stand-up strategy that is common to the recovery motions seen from all initial
states. However, we also find policies for which different strategies are seen
for prone and supine initial fallen states. The learned get-up control
strategies often have significant static stability, i.e., they can be paused at
a variety of points during the get-up motion. We further test our method on
novel constrained scenarios, such as having a leg and an arm in a cast.
Related papers
- ViViDex: Learning Vision-based Dexterous Manipulation from Human Videos [81.99559944822752]
We propose ViViDex to improve vision-based policy learning from human videos.
It first uses reinforcement learning with trajectory guided rewards to train state-based policies for each video.
We then rollout successful episodes from state-based policies and train a unified visual policy without using any privileged information.
arXiv Detail & Related papers (2024-04-24T07:58:28Z) - Strategy Extraction in Single-Agent Games [0.19336815376402716]
We propose an approach to knowledge transfer using behavioural strategies as a form of transferable knowledge influenced by the human cognitive ability to develop strategies.
We show that our method can identify plausible strategies in three environments: Pacman, Bank Heist and a dungeon-crawling video game.
arXiv Detail & Related papers (2023-05-22T01:28:59Z) - Robust Imitation of a Few Demonstrations with a Backwards Model [3.8530020696501794]
Behavior cloning of expert demonstrations can speed up learning optimal policies in a more sample-efficient way than reinforcement learning.
We tackle this issue by extending the region of attraction around the demonstrations so that the agent can learn how to get back onto the demonstrated trajectories if it veers off-course.
With optimal or near-optimal demonstrations, the learned policy will be both optimal and robust to deviations, with a wider region of attraction.
arXiv Detail & Related papers (2022-10-17T18:02:19Z) - Discovering Diverse Athletic Jumping Strategies [8.231687569030898]
We present a framework that enables the discovery of diverse and natural-looking motion strategies for athletic skills such as the high jump.
The combination of physics simulation and deep reinforcement learning provides a suitable starting point for automatic control policy training.
arXiv Detail & Related papers (2021-05-02T01:37:16Z) - Augmenting Policy Learning with Routines Discovered from a Demonstration [86.9307760606403]
We propose routine-augmented policy learning (RAPL)
RAPL discovers routines composed of primitive actions from a single demonstration.
We show that RAPL improves the state-of-the-art imitation learning method SQIL and reinforcement learning method A2C.
arXiv Detail & Related papers (2020-12-23T03:15:21Z) - Learning Object Manipulation Skills via Approximate State Estimation
from Real Videos [47.958512470724926]
Humans are adept at learning new tasks by watching a few instructional videos.
On the other hand, robots that learn new actions either require a lot of effort through trial and error, or use expert demonstrations that are challenging to obtain.
In this paper, we explore a method that facilitates learning object manipulation skills directly from videos.
arXiv Detail & Related papers (2020-11-13T08:53:47Z) - Learning Dexterous Grasping with Object-Centric Visual Affordances [86.49357517864937]
Dexterous robotic hands are appealing for their agility and human-like morphology.
We introduce an approach for learning dexterous grasping.
Our key idea is to embed an object-centric visual affordance model within a deep reinforcement learning loop.
arXiv Detail & Related papers (2020-09-03T04:00:40Z) - State-Only Imitation Learning for Dexterous Manipulation [63.03621861920732]
In this paper, we explore state-only imitation learning.
We train an inverse dynamics model and use it to predict actions for state-only demonstrations.
Our method performs on par with state-action approaches and considerably outperforms RL alone.
arXiv Detail & Related papers (2020-04-07T17:57:20Z) - Human Motion Transfer from Poses in the Wild [61.6016458288803]
We tackle the problem of human motion transfer, where we synthesize novel motion video for a target person that imitates the movement from a reference video.
It is a video-to-video translation task in which the estimated poses are used to bridge two domains.
We introduce a novel pose-to-video translation framework for generating high-quality videos that are temporally coherent even for in-the-wild pose sequences unseen during training.
arXiv Detail & Related papers (2020-04-07T05:59:53Z) - Towards Learning to Imitate from a Single Video Demonstration [11.15358253586118]
We develop a reinforcement learning agent that can learn to imitate given video observation.
We use a Siamese recurrent neural network architecture to learn rewards in space and time between motion clips.
We demonstrate our approach on simulated humanoid, dog, and raptor agents in 2D and a quadruped and a humanoid in 3D.
arXiv Detail & Related papers (2019-01-22T06:46:19Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.