DROPO: Sim-to-Real Transfer with Offline Domain Randomization
- URL: http://arxiv.org/abs/2201.08434v1
- Date: Thu, 20 Jan 2022 20:03:35 GMT
- Title: DROPO: Sim-to-Real Transfer with Offline Domain Randomization
- Authors: Gabriele Tiboni and Karol Arndt and Ville Kyrki
- Abstract summary: We introduce DROPO, a novel method for estimating domain randomization distributions for safe sim-to-real transfer.
We demonstrate that DROPO is capable of recovering dynamic parameter distributions in simulation and finding a distribution capable of compensating for an unmodelled phenomenon.
- Score: 12.778412161239466
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In recent years, domain randomization has gained a lot of traction as a
method for sim-to-real transfer of reinforcement learning policies in robotic
manipulation; however, finding optimal randomization distributions can be
difficult. In this paper, we introduce DROPO, a novel method for estimating
domain randomization distributions for safe sim-to-real transfer. Unlike prior
work, DROPO only requires a limited, precollected offline dataset of
trajectories, and explicitly models parameter uncertainty to match real data.
We demonstrate that DROPO is capable of recovering dynamic parameter
distributions in simulation and finding a distribution capable of compensating
for an unmodelled phenomenon. We also evaluate the method in two zero-shot
sim-to-real transfer scenarios, showing successful domain transfer and improved
performance over prior methods.
Related papers
- Synthetic location trajectory generation using categorical diffusion
models [50.809683239937584]
Diffusion models (DPMs) have rapidly evolved to be one of the predominant generative models for the simulation of synthetic data.
We propose using DPMs for the generation of synthetic individual location trajectories (ILTs) which are sequences of variables representing physical locations visited by individuals.
arXiv Detail & Related papers (2024-02-19T15:57:39Z) - Enhancing Polynomial Chaos Expansion Based Surrogate Modeling using a
Novel Probabilistic Transfer Learning Strategy [2.980666177064344]
In black-box simulations, non-intrusive PCE allows the construction of surrogates using a set of simulation response evaluations.
We propose to leverage transfer learning whereby knowledge gained through similar PCE surrogate construction tasks is transferred to a new surrogate-construction task.
arXiv Detail & Related papers (2023-12-07T19:16:42Z) - Domain Randomization via Entropy Maximization [32.76511125425768]
We propose a novel approach to address sim-to-real transfer, which automatically shapes dynamics distributions during training in simulation without requiring real-world data.
We introduce DOmain RAndomization via Entropy MaximizatiON (DORAEMON), a constrained optimization problem that directly maximizes the entropy of the training distribution.
We empirically validate the consistent benefits of DORAEMON in obtaining highly adaptive and generalizable policies.
arXiv Detail & Related papers (2023-11-03T12:54:05Z) - Robust Visual Sim-to-Real Transfer for Robotic Manipulation [79.66851068682779]
Learning visuomotor policies in simulation is much safer and cheaper than in the real world.
However, due to discrepancies between the simulated and real data, simulator-trained policies often fail when transferred to real robots.
One common approach to bridge the visual sim-to-real domain gap is domain randomization (DR)
arXiv Detail & Related papers (2023-07-28T05:47:24Z) - Stochastic Trajectory Prediction via Motion Indeterminacy Diffusion [88.45326906116165]
We present a new framework to formulate the trajectory prediction task as a reverse process of motion indeterminacy diffusion (MID)
We encode the history behavior information and the social interactions as a state embedding and devise a Transformer-based diffusion model to capture the temporal dependencies of trajectories.
Experiments on the human trajectory prediction benchmarks including the Stanford Drone and ETH/UCY datasets demonstrate the superiority of our method.
arXiv Detail & Related papers (2022-03-25T16:59:08Z) - Comparing Probability Distributions with Conditional Transport [63.11403041984197]
We propose conditional transport (CT) as a new divergence and approximate it with the amortized CT (ACT) cost.
ACT amortizes the computation of its conditional transport plans and comes with unbiased sample gradients that are straightforward to compute.
On a wide variety of benchmark datasets generative modeling, substituting the default statistical distance of an existing generative adversarial network with ACT is shown to consistently improve the performance.
arXiv Detail & Related papers (2020-12-28T05:14:22Z) - Model-based Policy Optimization with Unsupervised Model Adaptation [37.09948645461043]
We investigate how to bridge the gap between real and simulated data due to inaccurate model estimation for better policy optimization.
We propose a novel model-based reinforcement learning framework AMPO, which introduces unsupervised model adaptation.
Our approach achieves state-of-the-art performance in terms of sample efficiency on a range of continuous control benchmark tasks.
arXiv Detail & Related papers (2020-10-19T14:19:42Z) - Learning Invariant Representations and Risks for Semi-supervised Domain
Adaptation [109.73983088432364]
We propose the first method that aims to simultaneously learn invariant representations and risks under the setting of semi-supervised domain adaptation (Semi-DA)
We introduce the LIRR algorithm for jointly textbfLearning textbfInvariant textbfRepresentations and textbfRisks.
arXiv Detail & Related papers (2020-10-09T15:42:35Z) - Predicting Sim-to-Real Transfer with Probabilistic Dynamics Models [3.7692466417039814]
We propose a method to predict the sim-to-real transfer performance of RL policies.
A probabilistic dynamics model is trained alongside the policy and evaluated on a fixed set of real-world trajectories.
arXiv Detail & Related papers (2020-09-27T15:06:54Z) - Data-efficient Domain Randomization with Bayesian Optimization [34.854609756970305]
When learning policies for robot control, the required real-world data is typically prohibitively expensive to acquire.
BayRn is a black-box sim-to-real algorithm that solves tasks efficiently by adapting the domain parameter distribution.
Our results show that BayRn is able to perform sim-to-real transfer, while significantly reducing the required prior knowledge.
arXiv Detail & Related papers (2020-03-05T07:48:31Z) - Few-shot Domain Adaptation by Causal Mechanism Transfer [107.08605582020866]
We study few-shot supervised domain adaptation (DA) for regression problems, where only a few labeled target domain data and many labeled source domain data are available.
Many of the current DA methods base their transfer assumptions on either parametrized distribution shift or apparent distribution similarities.
We propose mechanism transfer, a meta-distributional scenario in which a data generating mechanism is invariant among domains.
arXiv Detail & Related papers (2020-02-10T02:16:53Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.