Related papers: Discovering Diverse Athletic Jumping Strategies

Discovering Diverse Athletic Jumping Strategies

URL: http://arxiv.org/abs/2105.00371v1
Date: Sun, 2 May 2021 01:37:16 GMT
Title: Discovering Diverse Athletic Jumping Strategies
Authors: Zhiqi Yin, Zeshi Yang, Michiel van de Panne, KangKang Yin
Abstract summary: We present a framework that enables the discovery of diverse and natural-looking motion strategies for athletic skills such as the high jump. The combination of physics simulation and deep reinforcement learning provides a suitable starting point for automatic control policy training.
Score: 8.231687569030898
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: We present a framework that enables the discovery of diverse and natural-looking motion strategies for athletic skills such as the high jump. The strategies are realized as control policies for physics-based characters. Given a task objective and an initial character configuration, the combination of physics simulation and deep reinforcement learning (DRL) provides a suitable starting point for automatic control policy training. To facilitate the learning of realistic human motions, we propose a Pose Variational Autoencoder (P-VAE) to constrain the actions to a subspace of natural poses. In contrast to motion imitation methods, a rich variety of novel strategies can naturally emerge by exploring initial character states through a sample-efficient Bayesian diversity search (BDS) algorithm. A second stage of optimization that encourages novel policies can further enrich the unique strategies discovered. Our method allows for the discovery of diverse and novel strategies for athletic jumping motions such as high jumps and obstacle jumps with no motion examples and less reward engineering than prior work.

Related papers

Perceptive Humanoid Parkour: Chaining Dynamic Human Skills via Motion Matching [77.28042137892943]
We present Perceptive Humanoid Parkour (PHP), a modular framework that enables humanoid robots to autonomously perform long-horizon, vision-based parkour.<n>We train motion-tracking reinforcement learning expert policies for these composed motions, and distill them into a single depth-based, multi-skill student policy.<n>We validate our framework with extensive real-world experiments on a Unitree G1 humanoid robot.
arXiv Detail & Related papers (2026-02-17T18:59:11Z)
Strategy and Skill Learning for Physics-based Table Tennis Animation [8.51262627906337]
We present a strategy and skill learning approach for physics-based table tennis animation. Our method addresses the issue of mode collapse, where the characters do not fully utilize the motor skills they need to perform to execute complex tasks.
arXiv Detail & Related papers (2024-07-23T06:31:13Z)
Emergence of Chemotactic Strategies with Multi-Agent Reinforcement Learning [1.9253333342733674]
We investigate whether reinforcement learning can provide insights into biological systems when trained to perform chemotaxis. We run simulations covering a range of agent shapes, sizes, and swim speeds to determine if the physical constraints on biological swimmers, namely Brownian motion, lead to regions where reinforcement learners' training fails. We find that RL agents can perform chemotaxis as soon as it is physically possible and, in some cases, even before the active swimming overpowers the environment.
arXiv Detail & Related papers (2024-04-02T14:42:52Z)
AnySkill: Learning Open-Vocabulary Physical Skill for Interactive Agents [58.807802111818994]
We propose AnySkill, a novel hierarchical method that learns physically plausible interactions following open-vocabulary instructions. Our approach begins by developing a set of atomic actions via a low-level controller trained via imitation learning. An important feature of our method is the use of image-based rewards for the high-level policy, which allows the agent to learn interactions with objects without manual reward engineering.
arXiv Detail & Related papers (2024-03-19T15:41:39Z)
Adaptive Tracking of a Single-Rigid-Body Character in Various Environments [2.048226951354646]
We propose a deep reinforcement learning method based on the simulation of a single-rigid-body character. Using the centroidal dynamics model (CDM) to express the full-body character as a single rigid body (SRB) and training a policy to track a reference motion, we can obtain a policy capable of adapting to various unobserved environmental changes. We demonstrate that our policy, efficiently trained within 30 minutes on an ultraportable laptop, has the ability to cope with environments that have not been experienced during learning.
arXiv Detail & Related papers (2023-08-14T22:58:54Z)
Robust and Versatile Bipedal Jumping Control through Reinforcement Learning [141.56016556936865]
This work aims to push the limits of agility for bipedal robots by enabling a torque-controlled bipedal robot to perform robust and versatile dynamic jumps in the real world. We present a reinforcement learning framework for training a robot to accomplish a large variety of jumping tasks, such as jumping to different locations and directions. We develop a new policy structure that encodes the robot's long-term input/output (I/O) history while also providing direct access to a short-term I/O history.
arXiv Detail & Related papers (2023-02-19T01:06:09Z)
Learning to Get Up [5.887969742827488]
Getting up from a fallen state is a basic human skill. Existing methods for learning this skill generate highly dynamic and erratic get-up motions. We present a staged approach using reinforcement learning, without recourse to motion capture data.
arXiv Detail & Related papers (2022-04-30T17:12:30Z)
Adversarial Motion Priors Make Good Substitutes for Complex Reward Functions [124.11520774395748]
Reinforcement learning practitioners often utilize complex reward functions that encourage physically plausible behaviors. We propose substituting complex reward functions with "style rewards" learned from a dataset of motion capture demonstrations. A learned style reward can be combined with an arbitrary task reward to train policies that perform tasks using naturalistic strategies.
arXiv Detail & Related papers (2022-03-28T21:17:36Z)
Learning Task-Agnostic Action Spaces for Movement Optimization [18.37812596641983]
We propose a novel method for exploring the dynamics of physically based animated characters. We parameterize actions as target states, and learn a short-horizon goal-conditioned low-level control policy that drives the agent's state towards the targets.
arXiv Detail & Related papers (2020-09-22T06:18:56Z)
Reinforcement Learning with Fast Stabilization in Linear Dynamical Systems [91.43582419264763]
We study model-based reinforcement learning (RL) in unknown stabilizable linear dynamical systems. We propose an algorithm that certifies fast stabilization of the underlying system by effectively exploring the environment. We show that the proposed algorithm attains $tildemathcalO(sqrtT)$ regret after $T$ time steps of agent-environment interaction.
arXiv Detail & Related papers (2020-07-23T23:06:40Z)
TENet: Triple Excitation Network for Video Salient Object Detection [57.72696926903698]
We propose a simple yet effective approach, named Triple Excitation Network, to reinforce the training of video salient object detection (VSOD) These excitation mechanisms are designed following the spirit of curriculum learning and aim to reduce learning at the beginning of training. Our semi-curriculum learning design enables the first online strategy for VSOD, which allows exciting and boosting saliency responses during testing without re-training.
arXiv Detail & Related papers (2020-07-20T08:45:41Z)
Evolutionary Stochastic Policy Distillation [139.54121001226451]
We propose a new method called Evolutionary Policy Distillation (ESPD) to solve GCRS tasks. ESPD enables a target policy to learn from a series of its variants through the technique of policy distillation (PD) The experiments based on the MuJoCo control suite show the high learning efficiency of the proposed method.
arXiv Detail & Related papers (2020-04-27T16:19:25Z)

This list is automatically generated from the titles and abstracts of the papers in this site.