Reward Shaping with Subgoals for Social Navigation
- URL: http://arxiv.org/abs/2104.06410v1
- Date: Tue, 13 Apr 2021 13:52:58 GMT
- Title: Reward Shaping with Subgoals for Social Navigation
- Authors: Takato Okudo and Seiji Yamada
- Abstract summary: Social navigation has been gaining attentions with the growth in machine intelligence.
reinforcement learning can select an action in the prediction phase at a low computational cost.
We propose a reward shaping method with subgoals to accelerate learning.
- Score: 7.6146285961466
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Social navigation has been gaining attentions with the growth in machine
intelligence. Since reinforcement learning can select an action in the
prediction phase at a low computational cost, it has been formulated in a
social navigation tasks. However, reinforcement learning takes an enormous
number of iterations until acquiring a behavior policy in the learning phase.
This negatively affects the learning of robot behaviors in the real world. In
particular, social navigation includes humans who are unpredictable moving
obstacles in an environment. We proposed a reward shaping method with subgoals
to accelerate learning. The main part is an aggregation method that use
subgoals to shape a reinforcement learning algorithm. We performed a learning
experiment with a social navigation task in which a robot avoided collisions
and then reached its goal. The experimental results show that our method
improved the learning efficiency from a base algorithm in the task.
Related papers
- SPIRE: Synergistic Planning, Imitation, and Reinforcement Learning for Long-Horizon Manipulation [58.14969377419633]
We propose spire, a system that decomposes tasks into smaller learning subproblems and second combines imitation and reinforcement learning to maximize their strengths.
We find that spire outperforms prior approaches that integrate imitation learning, reinforcement learning, and planning by 35% to 50% in average task performance.
arXiv Detail & Related papers (2024-10-23T17:42:07Z) - Offline Imitation Learning Through Graph Search and Retrieval [57.57306578140857]
Imitation learning is a powerful machine learning algorithm for a robot to acquire manipulation skills.
We propose GSR, a simple yet effective algorithm that learns from suboptimal demonstrations through Graph Search and Retrieval.
GSR can achieve a 10% to 30% higher success rate and over 30% higher proficiency compared to baselines.
arXiv Detail & Related papers (2024-07-22T06:12:21Z) - Online Context Learning for Socially-compliant Navigation [49.609656402450746]
This letter introduces an online context learning method that aims to empower robots to adapt to new social environments online.
Experiments using a community-wide simulator show that our method outperforms the state-of-the-art ones.
arXiv Detail & Related papers (2024-06-17T12:59:13Z) - A Study on Learning Social Robot Navigation with Multimodal Perception [6.052803245103173]
We present a study on learning social robot navigation with multimodal perception using a large-scale real-world dataset.
We compare unimodal and multimodal learning approaches against a set of classical navigation approaches in different social scenarios.
The results show that multimodal learning has a clear advantage over unimodal learning in both dataset and human studies.
arXiv Detail & Related papers (2023-09-22T01:47:47Z) - SACSoN: Scalable Autonomous Control for Social Navigation [62.59274275261392]
We develop methods for training policies for socially unobtrusive navigation.
By minimizing this counterfactual perturbation, we can induce robots to behave in ways that do not alter the natural behavior of humans in the shared space.
We collect a large dataset where an indoor mobile robot interacts with human bystanders.
arXiv Detail & Related papers (2023-06-02T19:07:52Z) - Human-to-Robot Imitation in the Wild [50.49660984318492]
We propose an efficient one-shot robot learning algorithm, centered around learning from a third-person perspective.
We show one-shot generalization and success in real-world settings, including 20 different manipulation tasks in the wild.
arXiv Detail & Related papers (2022-07-19T17:59:59Z) - Relative velocity-based reward functions for crowd navigation of robots [7.671375709255977]
How to navigate in crowd environments with socially acceptable standards remains a key problem to be solved for the development of mobile robots.
Recent work has shown the effectiveness of deep reinforcement learning in addressing crowd navigation, but the learning becomes progressively less effective as the speed of pedestrians increases.
To improve the effectiveness of deep reinforcement learning, we redesigned the reward function by introducing the penalty term of relative speed in the reward function.
arXiv Detail & Related papers (2021-12-28T03:49:01Z) - Subgoal-based Reward Shaping to Improve Efficiency in Reinforcement
Learning [7.6146285961466]
We extend potential-based reward shaping and propose a subgoal-based reward shaping.
Our method makes it easier for human trainers to share their knowledge of subgoals.
arXiv Detail & Related papers (2021-04-13T14:28:48Z) - Hierarchical Affordance Discovery using Intrinsic Motivation [69.9674326582747]
We propose an algorithm using intrinsic motivation to guide the learning of affordances for a mobile robot.
This algorithm is capable to autonomously discover, learn and adapt interrelated affordances without pre-programmed actions.
Once learned, these affordances may be used by the algorithm to plan sequences of actions in order to perform tasks of various difficulties.
arXiv Detail & Related papers (2020-09-23T07:18:21Z) - Analysis of Social Robotic Navigation approaches: CNN Encoder and
Incremental Learning as an alternative to Deep Reinforcement Learning [1.244705780038575]
Having humans in the learning loop is incompatible with state-of-the-art machine learning algorithms.
In this work, we discuss this problem and possible solutions by analysing a previous study on adaptive convolutional encoders for a social navigation task.
arXiv Detail & Related papers (2020-08-18T14:54:24Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.