Smart Magnetic Microrobots Learn to Swim with Deep Reinforcement
Learning
- URL: http://arxiv.org/abs/2201.05599v1
- Date: Fri, 14 Jan 2022 18:42:18 GMT
- Title: Smart Magnetic Microrobots Learn to Swim with Deep Reinforcement
Learning
- Authors: Michael R. Behrens and Warren C. Ruder
- Abstract summary: Deep reinforcement learning is a promising method of autonomously developing robust controllers for creating smart microrobots.
Here, we report the development of a smart helical magnetic hydrogel microrobot that used the soft actor critic reinforcement learning algorithm to autonomously derive a control policy.
The reinforcement learning agent learned successful control policies with fewer than 100,000 training steps, demonstrating sample efficiency for fast learning.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Swimming microrobots are increasingly developed with complex materials and
dynamic shapes and are expected to operate in complex environments in which the
system dynamics are difficult to model and positional control of the microrobot
is not straightforward to achieve. Deep reinforcement learning is a promising
method of autonomously developing robust controllers for creating smart
microrobots, which can adapt their behavior to operate in uncharacterized
environments without the need to model the system dynamics. Here, we report the
development of a smart helical magnetic hydrogel microrobot that used the soft
actor critic reinforcement learning algorithm to autonomously derive a control
policy which allowed the microrobot to swim through an uncharacterized
biomimetic fluidic environment under control of a time varying magnetic field
generated from a three-axis array of electromagnets. The reinforcement learning
agent learned successful control policies with fewer than 100,000 training
steps, demonstrating sample efficiency for fast learning. We also demonstrate
that we can fine tune the control policies learned by the reinforcement
learning agent by fitting mathematical functions to the learned policy's action
distribution via regression. Deep reinforcement learning applied to microrobot
control is likely to significantly expand the capabilities of the next
generation of microrobots.
Related papers
- Action Flow Matching for Continual Robot Learning [57.698553219660376]
Continual learning in robotics seeks systems that can constantly adapt to changing environments and tasks.
We introduce a generative framework leveraging flow matching for online robot dynamics model alignment.
We find that by transforming the actions themselves rather than exploring with a misaligned model, the robot collects informative data more efficiently.
arXiv Detail & Related papers (2025-04-25T16:26:15Z) - On the Vulnerability of LLM/VLM-Controlled Robotics [54.57914943017522]
We highlight vulnerabilities in robotic systems integrating large language models (LLMs) and vision-language models (VLMs) due to input modality sensitivities.
Our results show that simple input perturbations reduce task execution success rates by 22.2% and 14.6% in two representative LLM/VLM-controlled robotic systems.
arXiv Detail & Related papers (2024-02-15T22:01:45Z) - RoboGen: Towards Unleashing Infinite Data for Automated Robot Learning via Generative Simulation [68.70755196744533]
RoboGen is a generative robotic agent that automatically learns diverse robotic skills at scale via generative simulation.
Our work attempts to extract the extensive and versatile knowledge embedded in large-scale models and transfer them to the field of robotics.
arXiv Detail & Related papers (2023-11-02T17:59:21Z) - Robot Fine-Tuning Made Easy: Pre-Training Rewards and Policies for
Autonomous Real-World Reinforcement Learning [58.3994826169858]
We introduce RoboFuME, a reset-free fine-tuning system for robotic reinforcement learning.
Our insights are to utilize offline reinforcement learning techniques to ensure efficient online fine-tuning of a pre-trained policy.
Our method can incorporate data from an existing robot dataset and improve on a target task within as little as 3 hours of autonomous real-world experience.
arXiv Detail & Related papers (2023-10-23T17:50:08Z) - Robot Fleet Learning via Policy Merging [58.5086287737653]
We propose FLEET-MERGE to efficiently merge policies in the fleet setting.
We show that FLEET-MERGE consolidates the behavior of policies trained on 50 tasks in the Meta-World environment.
We introduce a novel robotic tool-use benchmark, FLEET-TOOLS, for fleet policy learning in compositional and contact-rich robot manipulation tasks.
arXiv Detail & Related papers (2023-10-02T17:23:51Z) - Navigation of micro-robot swarms for targeted delivery using
reinforcement learning [0.0]
We use the Reinforcement Learning (RL) algorithms Proximal Policy Optimization (PPO) and Robust Policy Optimization (RPO) to navigate a swarm of 4, 9 and 16 microswimmers.
We look at both PPO and RPO performances with limited state information scenarios and also test their robustness for random target location and size.
arXiv Detail & Related papers (2023-06-30T12:17:39Z) - Bridging Active Exploration and Uncertainty-Aware Deployment Using
Probabilistic Ensemble Neural Network Dynamics [11.946807588018595]
This paper presents a unified model-based reinforcement learning framework that bridges active exploration and uncertainty-aware deployment.
The two opposing tasks of exploration and deployment are optimized through state-of-the-art sampling-based MPC.
We conduct experiments on both autonomous vehicles and wheeled robots, showing promising results for both exploration and deployment.
arXiv Detail & Related papers (2023-05-20T17:20:12Z) - Programmable Control of Ultrasound Swarmbots through Reinforcement
Learning [0.0]
Acoustically driven microrobot navigation based on microbubbles is a promising approach for targeted drug delivery.
We use reinforcement learning control strategies to learn the microrobot dynamics and manipulate them through acoustic forces.
The result demonstrated for the first time autonomous acoustic navigation of microbubbles in a microfluidic environment.
arXiv Detail & Related papers (2022-09-30T11:46:12Z) - Active Predicting Coding: Brain-Inspired Reinforcement Learning for
Sparse Reward Robotic Control Problems [79.07468367923619]
We propose a backpropagation-free approach to robotic control through the neuro-cognitive computational framework of neural generative coding (NGC)
We design an agent built completely from powerful predictive coding/processing circuits that facilitate dynamic, online learning from sparse rewards.
We show that our proposed ActPC agent performs well in the face of sparse (extrinsic) reward signals and is competitive with or outperforms several powerful backprop-based RL approaches.
arXiv Detail & Related papers (2022-09-19T16:49:32Z) - Chemotaxis of sea urchin sperm cells through deep reinforcement learning [0.0]
In this work, we investigate how a model of sea urchin sperm cell can self-learn chemotactic motion in a chemoattractant concentration field.
We employ an artificial neural network to act as a decision-making agent and facilitate the sperm cell to discover efficient maneuver strategies.
Our results provide insights to the chemotactic process of sea urchin sperm cells and also prepare guidance for the intelligent maneuver of microrobots.
arXiv Detail & Related papers (2022-08-02T06:04:32Z) - Autonomous object harvesting using synchronized optoelectronic
microrobots [10.860767733334306]
Optoelectronic tweezer-driven microrobots (OETdMs) are a versatile micromanipulation technology.
We describe an approach to automated targeting and path planning to enable open-loop control of multiple microrobots.
arXiv Detail & Related papers (2021-03-08T17:24:15Z) - Populations of Spiking Neurons for Reservoir Computing: Closed Loop
Control of a Compliant Quadruped [64.64924554743982]
We present a framework for implementing central pattern generators with spiking neural networks to obtain closed loop robot control.
We demonstrate the learning of predefined gait patterns, speed control and gait transition on a simulated model of a compliant quadrupedal robot.
arXiv Detail & Related papers (2020-04-09T14:32:49Z) - Enhanced Adversarial Strategically-Timed Attacks against Deep
Reinforcement Learning [91.13113161754022]
We introduce timing-based adversarial strategies against a DRL-based navigation system by jamming in physical noise patterns on the selected time frames.
Our experimental results show that the adversarial timing attacks can lead to a significant performance drop.
arXiv Detail & Related papers (2020-02-20T21:39:25Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.