Learning Vision-based Reactive Policies for Obstacle Avoidance
- URL: http://arxiv.org/abs/2010.16298v1
- Date: Fri, 30 Oct 2020 14:50:32 GMT
- Title: Learning Vision-based Reactive Policies for Obstacle Avoidance
- Authors: Elie Aljalbout and Ji Chen and Konstantin Ritt and Maximilian Ulmer
and Sami Haddadin
- Abstract summary: Vision-based obstacle avoidance for robotic manipulators poses challenges for both perception and motion generation.
We propose a framework to connect perception and motion by identifying the relationship between the visual input and the corresponding motion representation.
We show the ability of the proposed method to efficiently learn stable obstacle avoidance strategies at a high success rate.
- Score: 15.135439507187801
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In this paper, we address the problem of vision-based obstacle avoidance for
robotic manipulators. This topic poses challenges for both perception and
motion generation. While most work in the field aims at improving one of those
aspects, we provide a unified framework for approaching this problem. The main
goal of this framework is to connect perception and motion by identifying the
relationship between the visual input and the corresponding motion
representation. To this end, we propose a method for learning reactive obstacle
avoidance policies. We evaluate our method on goal-reaching tasks for single
and multiple obstacles scenarios. We show the ability of the proposed method to
efficiently learn stable obstacle avoidance strategies at a high success rate,
while maintaining closed-loop responsiveness required for critical applications
like human-robot interaction.
Related papers
- Learning to Assist Humans without Inferring Rewards [65.28156318196397]
We build upon prior work that studies assistance through the lens of empowerment.
An assistive agent aims to maximize the influence of the human's actions.
We prove that these representations estimate a similar notion of empowerment to that studied by prior work.
arXiv Detail & Related papers (2024-11-04T21:31:04Z) - Physically Plausible Full-Body Hand-Object Interaction Synthesis [32.83908152822006]
We propose a physics-based method for synthesizing dexterous hand-object interactions in a full-body setting.
Existing methods often focus on isolated segments of the interaction process and rely on data-driven techniques that may result in artifacts.
arXiv Detail & Related papers (2023-09-14T17:55:18Z) - Learning to Generate All Feasible Actions [4.333208181196761]
We introduce action mapping, a novel approach that divides the learning process into two steps: first learn feasibility and subsequently, the objective.
This paper focuses on the feasibility part by learning to generate all feasible actions through self-supervised querying of the feasibility model.
We demonstrate the agent's proficiency in generating actions across disconnected feasible action sets.
arXiv Detail & Related papers (2023-01-26T23:15:51Z) - Contact-Aware Retargeting of Skinned Motion [49.71236739408685]
This paper introduces a motion estimation method that preserves self-contacts and prevents interpenetration.
The method identifies self-contacts and ground contacts in the input motion, and optimize the motion to apply to the output skeleton.
In experiments, our results quantitatively outperform previous methods and we conduct a user study where our retargeted motions are rated as higher-quality than those produced by recent works.
arXiv Detail & Related papers (2021-09-15T17:05:02Z) - Seeing Differently, Acting Similarly: Imitation Learning with
Heterogeneous Observations [126.78199124026398]
In many real-world imitation learning tasks, the demonstrator and the learner have to act in different but full observation spaces.
In this work, we model the above learning problem as Heterogeneous Observations Learning (HOIL)
We propose the Importance Weighting with REjection (IWRE) algorithm based on the techniques of importance-weighting, learning with rejection, and active querying to solve the key challenge of occupancy measure matching.
arXiv Detail & Related papers (2021-06-17T05:44:04Z) - Decentralized Swarm Collision Avoidance for Quadrotors via End-to-End
Reinforcement Learning [28.592704336574158]
We draw biological inspiration from flocks of starlings and apply the insight to end-to-end learned decentralized collision avoidance.
We propose a new, scalable observation model following a biomimetic topological interaction rule.
Our learned policies are tested in simulation and subsequently transferred to real-world drones to validate their real-world applicability.
arXiv Detail & Related papers (2021-04-30T11:19:03Z) - Vision-Based Mobile Robotics Obstacle Avoidance With Deep Reinforcement
Learning [49.04274612323564]
Obstacle avoidance is a fundamental and challenging problem for autonomous navigation of mobile robots.
In this paper, we consider the problem of obstacle avoidance in simple 3D environments where the robot has to solely rely on a single monocular camera.
We tackle the obstacle avoidance problem as a data-driven end-to-end deep learning approach.
arXiv Detail & Related papers (2021-03-08T13:05:46Z) - Behavior Priors for Efficient Reinforcement Learning [97.81587970962232]
We consider how information and architectural constraints can be combined with ideas from the probabilistic modeling literature to learn behavior priors.
We discuss how such latent variable formulations connect to related work on hierarchical reinforcement learning (HRL) and mutual information and curiosity based objectives.
We demonstrate the effectiveness of our framework by applying it to a range of simulated continuous control domains.
arXiv Detail & Related papers (2020-10-27T13:17:18Z) - Learning Obstacle Representations for Neural Motion Planning [70.80176920087136]
We address sensor-based motion planning from a learning perspective.
Motivated by recent advances in visual recognition, we argue the importance of learning appropriate representations for motion planning.
We propose a new obstacle representation based on the PointNet architecture and train it jointly with policies for obstacle avoidance.
arXiv Detail & Related papers (2020-08-25T17:12:32Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.