Heuristic-free Optimization of Force-Controlled Robot Search Strategies
in Stochastic Environments
- URL: http://arxiv.org/abs/2207.07524v1
- Date: Fri, 15 Jul 2022 15:16:08 GMT
- Title: Heuristic-free Optimization of Force-Controlled Robot Search Strategies
in Stochastic Environments
- Authors: Benjamin Alt, Darko Katic, Rainer J\"akel and Michael Beetz
- Abstract summary: Even relatively simple peg-in-hole tasks are typically subject to variations, requiring search motions to find relevant features such as holes.
This paper introduces an automatic, data-driven and conditioning-free approach to optimize search strategies.
We evaluate our approach on two different industrial robots in the context of spiral and probe search for THT electronics assembly.
- Score: 13.622757453459748
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: In both industrial and service domains, a central benefit of the use of
robots is their ability to quickly and reliably execute repetitive tasks.
However, even relatively simple peg-in-hole tasks are typically subject to
stochastic variations, requiring search motions to find relevant features such
as holes. While search improves robustness, it comes at the cost of increased
runtime: More exhaustive search will maximize the probability of successfully
executing a given task, but will significantly delay any downstream tasks. This
trade-off is typically resolved by human experts according to simple
heuristics, which are rarely optimal. This paper introduces an automatic,
data-driven and heuristic-free approach to optimize robot search strategies. By
training a neural model of the search strategy on a large set of simulated
stochastic environments, conditioning it on few real-world examples and
inverting the model, we can infer search strategies which adapt to the
time-variant characteristics of the underlying probability distributions, while
requiring very few real-world measurements. We evaluate our approach on two
different industrial robots in the context of spiral and probe search for THT
electronics assembly.
Related papers
- Fully Automated Correlated Time Series Forecasting in Minutes [31.198713853170375]
We propose a fully automated and highly efficient correlated time series forecasting framework.
It includes a data-driven, iterative strategy to automatically prune a large search space to obtain a high-quality search space for a new forecasting task.
Experiments on seven benchmark datasets offer evidence that the framework is capable of state-of-the-art accuracy and is much more efficient than existing methods.
arXiv Detail & Related papers (2024-11-06T09:02:13Z) - Sparse Diffusion Policy: A Sparse, Reusable, and Flexible Policy for Robot Learning [61.294110816231886]
We introduce a sparse, reusable, and flexible policy, Sparse Diffusion Policy (SDP)
SDP selectively activates experts and skills, enabling efficient and task-specific learning without retraining the entire model.
Demos and codes can be found in https://forrest-110.io/sparse_diffusion_policy/.
arXiv Detail & Related papers (2024-07-01T17:59:56Z) - Comparing Active Learning Performance Driven by Gaussian Processes or
Bayesian Neural Networks for Constrained Trajectory Exploration [0.0]
Currently, humans drive robots to meet scientific objectives, but depending on the robot's location, the exchange of information and driving commands may cause undue delays in mission fulfillment.
An autonomous robot encoded with a scientific objective and an exploration strategy incurs no communication delays and can fulfill missions more quickly.
Active learning algorithms offer this capability of intelligent exploration, but the underlying model structure varies the performance of the active learning algorithm in accurately forming an understanding of the environment.
arXiv Detail & Related papers (2023-09-28T02:45:14Z) - Maximize to Explore: One Objective Function Fusing Estimation, Planning,
and Exploration [87.53543137162488]
We propose an easy-to-implement online reinforcement learning (online RL) framework called textttMEX.
textttMEX integrates estimation and planning components while balancing exploration exploitation automatically.
It can outperform baselines by a stable margin in various MuJoCo environments with sparse rewards.
arXiv Detail & Related papers (2023-05-29T17:25:26Z) - GUTS: Generalized Uncertainty-Aware Thompson Sampling for Multi-Agent
Active Search [5.861092453610268]
Generalized Uncertainty-aware Thompson Sampling (GUTS) algorithm is suitable for deployment on heterogeneous multi-robot systems for active search in large unstructured environments.
We conduct field tests using our multi-robot system in an unstructured environment with a search area of 75,000 sq. m.
arXiv Detail & Related papers (2023-04-04T18:58:16Z) - Domain Randomization for Robust, Affordable and Effective Closed-loop
Control of Soft Robots [10.977130974626668]
Soft robots are gaining popularity thanks to their intrinsic safety to contacts and adaptability.
We show how Domain Randomization (DR) can solve this problem by enhancing RL policies for soft robots.
We introduce a novel algorithmic extension to previous adaptive domain randomization methods for the automatic inference of dynamics parameters for deformable objects.
arXiv Detail & Related papers (2023-03-07T18:50:00Z) - TransPath: Learning Heuristics For Grid-Based Pathfinding via
Transformers [64.88759709443819]
We suggest learning the instance-dependent proxies that are supposed to notably increase the efficiency of the search.
The first proxy we suggest to learn is the correction factor, i.e. the ratio between the instance independent cost-to-go estimate and the perfect one.
The second proxy is the path probability, which indicates how likely the grid cell is lying on the shortest path.
arXiv Detail & Related papers (2022-12-22T14:26:11Z) - SABER: Data-Driven Motion Planner for Autonomously Navigating
Heterogeneous Robots [112.2491765424719]
We present an end-to-end online motion planning framework that uses a data-driven approach to navigate a heterogeneous robot team towards a global goal.
We use model predictive control (SMPC) to calculate control inputs that satisfy robot dynamics, and consider uncertainty during obstacle avoidance with chance constraints.
recurrent neural networks are used to provide a quick estimate of future state uncertainty considered in the SMPC finite-time horizon solution.
A Deep Q-learning agent is employed to serve as a high-level path planner, providing the SMPC with target positions that move the robots towards a desired global goal.
arXiv Detail & Related papers (2021-08-03T02:56:21Z) - Robotic Brain Storm Optimization: A Multi-target Collaborative Searching
Paradigm for Swarm Robotics [24.38312890501329]
This paper proposes a BSO-based collaborative searching framework for swarm robotics called Robotic BSO.
The proposed method can simulate the BSO's guided search characteristics and has an excellent prospect for multi-target searching problems for swarm robotics.
arXiv Detail & Related papers (2021-05-27T13:05:48Z) - AutoOD: Automated Outlier Detection via Curiosity-guided Search and
Self-imitation Learning [72.99415402575886]
Outlier detection is an important data mining task with numerous practical applications.
We propose AutoOD, an automated outlier detection framework, which aims to search for an optimal neural network model.
Experimental results on various real-world benchmark datasets demonstrate that the deep model identified by AutoOD achieves the best performance.
arXiv Detail & Related papers (2020-06-19T18:57:51Z) - Guided Uncertainty-Aware Policy Optimization: Combining Learning and
Model-Based Strategies for Sample-Efficient Policy Learning [75.56839075060819]
Traditional robotic approaches rely on an accurate model of the environment, a detailed description of how to perform the task, and a robust perception system to keep track of the current state.
reinforcement learning approaches can operate directly from raw sensory inputs with only a reward signal to describe the task, but are extremely sample-inefficient and brittle.
In this work, we combine the strengths of model-based methods with the flexibility of learning-based methods to obtain a general method that is able to overcome inaccuracies in the robotics perception/actuation pipeline.
arXiv Detail & Related papers (2020-05-21T19:47:05Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.