Related papers: Heuristic-free Optimization of Force-Controlled Robot Search Strategies in Stochastic Environments

Heuristic-free Optimization of Force-Controlled Robot Search Strategies in Stochastic Environments

URL: http://arxiv.org/abs/2207.07524v1
Date: Fri, 15 Jul 2022 15:16:08 GMT
Title: Heuristic-free Optimization of Force-Controlled Robot Search Strategies in Stochastic Environments
Authors: Benjamin Alt, Darko Katic, Rainer J\"akel and Michael Beetz
Abstract summary: Even relatively simple peg-in-hole tasks are typically subject to variations, requiring search motions to find relevant features such as holes. This paper introduces an automatic, data-driven and conditioning-free approach to optimize search strategies. We evaluate our approach on two different industrial robots in the context of spiral and probe search for THT electronics assembly.
Score: 13.622757453459748
License: http://creativecommons.org/licenses/by/4.0/
Abstract: In both industrial and service domains, a central benefit of the use of robots is their ability to quickly and reliably execute repetitive tasks. However, even relatively simple peg-in-hole tasks are typically subject to stochastic variations, requiring search motions to find relevant features such as holes. While search improves robustness, it comes at the cost of increased runtime: More exhaustive search will maximize the probability of successfully executing a given task, but will significantly delay any downstream tasks. This trade-off is typically resolved by human experts according to simple heuristics, which are rarely optimal. This paper introduces an automatic, data-driven and heuristic-free approach to optimize robot search strategies. By training a neural model of the search strategy on a large set of simulated stochastic environments, conditioning it on few real-world examples and inverting the model, we can infer search strategies which adapt to the time-variant characteristics of the underlying probability distributions, while requiring very few real-world measurements. We evaluate our approach on two different industrial robots in the context of spiral and probe search for THT electronics assembly.

Related papers

A Real-time Anomaly Detection Method for Robots based on a Flexible and Sparse Latent Space [2.0186752447895993]
Deep learning-based models in robotics face challenges due to limited training data and highly noisy signal features. We present Sparse Masked Autoregressive Flow-based Adversarial AutoEncoders model to address these problems. Our model performs inferences within 1 millisecond, ensuring real-time anomaly detection.
arXiv Detail & Related papers (2025-04-15T13:17:14Z)
Fully Automated Correlated Time Series Forecasting in Minutes [31.198713853170375]
We propose a fully automated and highly efficient correlated time series forecasting framework. It includes a data-driven, iterative strategy to automatically prune a large search space to obtain a high-quality search space for a new forecasting task. Experiments on seven benchmark datasets offer evidence that the framework is capable of state-of-the-art accuracy and is much more efficient than existing methods.
arXiv Detail & Related papers (2024-11-06T09:02:13Z)
Sparse Diffusion Policy: A Sparse, Reusable, and Flexible Policy for Robot Learning [61.294110816231886]
We introduce a sparse, reusable, and flexible policy, Sparse Diffusion Policy (SDP) SDP selectively activates experts and skills, enabling efficient and task-specific learning without retraining the entire model. Demos and codes can be found in https://forrest-110.io/sparse_diffusion_policy/.
arXiv Detail & Related papers (2024-07-01T17:59:56Z)
Comparing Active Learning Performance Driven by Gaussian Processes or Bayesian Neural Networks for Constrained Trajectory Exploration [0.0]
Currently, humans drive robots to meet scientific objectives, but depending on the robot's location, the exchange of information and driving commands may cause undue delays in mission fulfillment. An autonomous robot encoded with a scientific objective and an exploration strategy incurs no communication delays and can fulfill missions more quickly. Active learning algorithms offer this capability of intelligent exploration, but the underlying model structure varies the performance of the active learning algorithm in accurately forming an understanding of the environment.
arXiv Detail & Related papers (2023-09-28T02:45:14Z)
Maximize to Explore: One Objective Function Fusing Estimation, Planning, and Exploration [87.53543137162488]
We propose an easy-to-implement online reinforcement learning (online RL) framework called textttMEX. textttMEX integrates estimation and planning components while balancing exploration exploitation automatically. It can outperform baselines by a stable margin in various MuJoCo environments with sparse rewards.
arXiv Detail & Related papers (2023-05-29T17:25:26Z)
GUTS: Generalized Uncertainty-Aware Thompson Sampling for Multi-Agent Active Search [5.861092453610268]
Generalized Uncertainty-aware Thompson Sampling (GUTS) algorithm is suitable for deployment on heterogeneous multi-robot systems for active search in large unstructured environments. We conduct field tests using our multi-robot system in an unstructured environment with a search area of 75,000 sq. m.
arXiv Detail & Related papers (2023-04-04T18:58:16Z)
Domain Randomization for Robust, Affordable and Effective Closed-loop Control of Soft Robots [10.977130974626668]
Soft robots are gaining popularity thanks to their intrinsic safety to contacts and adaptability. We show how Domain Randomization (DR) can solve this problem by enhancing RL policies for soft robots. We introduce a novel algorithmic extension to previous adaptive domain randomization methods for the automatic inference of dynamics parameters for deformable objects.
arXiv Detail & Related papers (2023-03-07T18:50:00Z)
TransPath: Learning Heuristics For Grid-Based Pathfinding via Transformers [64.88759709443819]
We suggest learning the instance-dependent proxies that are supposed to notably increase the efficiency of the search. The first proxy we suggest to learn is the correction factor, i.e. the ratio between the instance independent cost-to-go estimate and the perfect one. The second proxy is the path probability, which indicates how likely the grid cell is lying on the shortest path.
arXiv Detail & Related papers (2022-12-22T14:26:11Z)
SABER: Data-Driven Motion Planner for Autonomously Navigating Heterogeneous Robots [112.2491765424719]
We present an end-to-end online motion planning framework that uses a data-driven approach to navigate a heterogeneous robot team towards a global goal. We use model predictive control (SMPC) to calculate control inputs that satisfy robot dynamics, and consider uncertainty during obstacle avoidance with chance constraints. recurrent neural networks are used to provide a quick estimate of future state uncertainty considered in the SMPC finite-time horizon solution. A Deep Q-learning agent is employed to serve as a high-level path planner, providing the SMPC with target positions that move the robots towards a desired global goal.
arXiv Detail & Related papers (2021-08-03T02:56:21Z)
Robotic Brain Storm Optimization: A Multi-target Collaborative Searching Paradigm for Swarm Robotics [24.38312890501329]
This paper proposes a BSO-based collaborative searching framework for swarm robotics called Robotic BSO. The proposed method can simulate the BSO's guided search characteristics and has an excellent prospect for multi-target searching problems for swarm robotics.
arXiv Detail & Related papers (2021-05-27T13:05:48Z)
AutoOD: Automated Outlier Detection via Curiosity-guided Search and Self-imitation Learning [72.99415402575886]
Outlier detection is an important data mining task with numerous practical applications. We propose AutoOD, an automated outlier detection framework, which aims to search for an optimal neural network model. Experimental results on various real-world benchmark datasets demonstrate that the deep model identified by AutoOD achieves the best performance.
arXiv Detail & Related papers (2020-06-19T18:57:51Z)
Guided Uncertainty-Aware Policy Optimization: Combining Learning and Model-Based Strategies for Sample-Efficient Policy Learning [75.56839075060819]
Traditional robotic approaches rely on an accurate model of the environment, a detailed description of how to perform the task, and a robust perception system to keep track of the current state. reinforcement learning approaches can operate directly from raw sensory inputs with only a reward signal to describe the task, but are extremely sample-inefficient and brittle. In this work, we combine the strengths of model-based methods with the flexibility of learning-based methods to obtain a general method that is able to overcome inaccuracies in the robotics perception/actuation pipeline.
arXiv Detail & Related papers (2020-05-21T19:47:05Z)

This list is automatically generated from the titles and abstracts of the papers in this site.