Discovering Diverse Athletic Jumping Strategies
- URL: http://arxiv.org/abs/2105.00371v1
- Date: Sun, 2 May 2021 01:37:16 GMT
- Title: Discovering Diverse Athletic Jumping Strategies
- Authors: Zhiqi Yin, Zeshi Yang, Michiel van de Panne, KangKang Yin
- Abstract summary: We present a framework that enables the discovery of diverse and natural-looking motion strategies for athletic skills such as the high jump.
The combination of physics simulation and deep reinforcement learning provides a suitable starting point for automatic control policy training.
- Score: 8.231687569030898
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We present a framework that enables the discovery of diverse and
natural-looking motion strategies for athletic skills such as the high jump.
The strategies are realized as control policies for physics-based characters.
Given a task objective and an initial character configuration, the combination
of physics simulation and deep reinforcement learning (DRL) provides a suitable
starting point for automatic control policy training. To facilitate the
learning of realistic human motions, we propose a Pose Variational Autoencoder
(P-VAE) to constrain the actions to a subspace of natural poses. In contrast to
motion imitation methods, a rich variety of novel strategies can naturally
emerge by exploring initial character states through a sample-efficient
Bayesian diversity search (BDS) algorithm. A second stage of optimization that
encourages novel policies can further enrich the unique strategies discovered.
Our method allows for the discovery of diverse and novel strategies for
athletic jumping motions such as high jumps and obstacle jumps with no motion
examples and less reward engineering than prior work.
Related papers
- Perceptive Humanoid Parkour: Chaining Dynamic Human Skills via Motion Matching [77.28042137892943]
We present Perceptive Humanoid Parkour (PHP), a modular framework that enables humanoid robots to autonomously perform long-horizon, vision-based parkour.<n>We train motion-tracking reinforcement learning expert policies for these composed motions, and distill them into a single depth-based, multi-skill student policy.<n>We validate our framework with extensive real-world experiments on a Unitree G1 humanoid robot.
arXiv Detail & Related papers (2026-02-17T18:59:11Z) - Strategy and Skill Learning for Physics-based Table Tennis Animation [8.51262627906337]
We present a strategy and skill learning approach for physics-based table tennis animation.
Our method addresses the issue of mode collapse, where the characters do not fully utilize the motor skills they need to perform to execute complex tasks.
arXiv Detail & Related papers (2024-07-23T06:31:13Z) - Emergence of Chemotactic Strategies with Multi-Agent Reinforcement Learning [1.9253333342733674]
We investigate whether reinforcement learning can provide insights into biological systems when trained to perform chemotaxis.
We run simulations covering a range of agent shapes, sizes, and swim speeds to determine if the physical constraints on biological swimmers, namely Brownian motion, lead to regions where reinforcement learners' training fails.
We find that RL agents can perform chemotaxis as soon as it is physically possible and, in some cases, even before the active swimming overpowers the environment.
arXiv Detail & Related papers (2024-04-02T14:42:52Z) - AnySkill: Learning Open-Vocabulary Physical Skill for Interactive Agents [58.807802111818994]
We propose AnySkill, a novel hierarchical method that learns physically plausible interactions following open-vocabulary instructions.
Our approach begins by developing a set of atomic actions via a low-level controller trained via imitation learning.
An important feature of our method is the use of image-based rewards for the high-level policy, which allows the agent to learn interactions with objects without manual reward engineering.
arXiv Detail & Related papers (2024-03-19T15:41:39Z) - Adaptive Tracking of a Single-Rigid-Body Character in Various
Environments [2.048226951354646]
We propose a deep reinforcement learning method based on the simulation of a single-rigid-body character.
Using the centroidal dynamics model (CDM) to express the full-body character as a single rigid body (SRB) and training a policy to track a reference motion, we can obtain a policy capable of adapting to various unobserved environmental changes.
We demonstrate that our policy, efficiently trained within 30 minutes on an ultraportable laptop, has the ability to cope with environments that have not been experienced during learning.
arXiv Detail & Related papers (2023-08-14T22:58:54Z) - Robust and Versatile Bipedal Jumping Control through Reinforcement
Learning [141.56016556936865]
This work aims to push the limits of agility for bipedal robots by enabling a torque-controlled bipedal robot to perform robust and versatile dynamic jumps in the real world.
We present a reinforcement learning framework for training a robot to accomplish a large variety of jumping tasks, such as jumping to different locations and directions.
We develop a new policy structure that encodes the robot's long-term input/output (I/O) history while also providing direct access to a short-term I/O history.
arXiv Detail & Related papers (2023-02-19T01:06:09Z) - Learning to Get Up [5.887969742827488]
Getting up from a fallen state is a basic human skill.
Existing methods for learning this skill generate highly dynamic and erratic get-up motions.
We present a staged approach using reinforcement learning, without recourse to motion capture data.
arXiv Detail & Related papers (2022-04-30T17:12:30Z) - Adversarial Motion Priors Make Good Substitutes for Complex Reward
Functions [124.11520774395748]
Reinforcement learning practitioners often utilize complex reward functions that encourage physically plausible behaviors.
We propose substituting complex reward functions with "style rewards" learned from a dataset of motion capture demonstrations.
A learned style reward can be combined with an arbitrary task reward to train policies that perform tasks using naturalistic strategies.
arXiv Detail & Related papers (2022-03-28T21:17:36Z) - Learning Task-Agnostic Action Spaces for Movement Optimization [18.37812596641983]
We propose a novel method for exploring the dynamics of physically based animated characters.
We parameterize actions as target states, and learn a short-horizon goal-conditioned low-level control policy that drives the agent's state towards the targets.
arXiv Detail & Related papers (2020-09-22T06:18:56Z) - Reinforcement Learning with Fast Stabilization in Linear Dynamical
Systems [91.43582419264763]
We study model-based reinforcement learning (RL) in unknown stabilizable linear dynamical systems.
We propose an algorithm that certifies fast stabilization of the underlying system by effectively exploring the environment.
We show that the proposed algorithm attains $tildemathcalO(sqrtT)$ regret after $T$ time steps of agent-environment interaction.
arXiv Detail & Related papers (2020-07-23T23:06:40Z) - TENet: Triple Excitation Network for Video Salient Object Detection [57.72696926903698]
We propose a simple yet effective approach, named Triple Excitation Network, to reinforce the training of video salient object detection (VSOD)
These excitation mechanisms are designed following the spirit of curriculum learning and aim to reduce learning at the beginning of training.
Our semi-curriculum learning design enables the first online strategy for VSOD, which allows exciting and boosting saliency responses during testing without re-training.
arXiv Detail & Related papers (2020-07-20T08:45:41Z) - Evolutionary Stochastic Policy Distillation [139.54121001226451]
We propose a new method called Evolutionary Policy Distillation (ESPD) to solve GCRS tasks.
ESPD enables a target policy to learn from a series of its variants through the technique of policy distillation (PD)
The experiments based on the MuJoCo control suite show the high learning efficiency of the proposed method.
arXiv Detail & Related papers (2020-04-27T16:19:25Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.