Enhancing AUV Autonomy With Model Predictive Path Integral Control
- URL: http://arxiv.org/abs/2308.05547v1
- Date: Thu, 10 Aug 2023 12:55:57 GMT
- Title: Enhancing AUV Autonomy With Model Predictive Path Integral Control
- Authors: Pierre Nicolay, Yvan Petillot, Mykhaylo Marfeychuk, Sen Wang, Ignacio
Carlucho
- Abstract summary: We investigate the feasibility of Model Predictive Path Integral Control (MPPI) for the control of an AUV.
We utilise a non-linear model of the AUV to propagate the samples of the MPPI, which allow us to compute the control action in real time.
- Score: 9.800697959791544
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Autonomous underwater vehicles (AUVs) play a crucial role in surveying marine
environments, carrying out underwater inspection tasks, and ocean exploration.
However, in order to ensure that the AUV is able to carry out its mission
successfully, a control system capable of adapting to changing environmental
conditions is required. Furthermore, to ensure the robotic platform's safe
operation, the onboard controller should be able to operate under certain
constraints. In this work, we investigate the feasibility of Model Predictive
Path Integral Control (MPPI) for the control of an AUV. We utilise a non-linear
model of the AUV to propagate the samples of the MPPI, which allow us to
compute the control action in real time. We provide a detailed evaluation of
the effect of the main hyperparameters on the performance of the MPPI
controller. Furthermore, we compared the performance of the proposed method
with a classical PID and Cascade PID approach, demonstrating the superiority of
our proposed controller. Finally, we present results where environmental
constraints are added and show how MPPI can handle them by simply incorporating
those constraints in the cost function.
Related papers
- A comparison of RL-based and PID controllers for 6-DOF swimming robots:
hybrid underwater object tracking [8.362739554991073]
We present an exploration and assessment of employing a centralized deep Q-network (DQN) controller as a substitute for PID controllers.
Our primary focus centers on illustrating this transition with the specific case of underwater object tracking.
Our experiments, conducted within a Unity-based simulator, validate the effectiveness of a centralized RL agent over separated PID controllers.
arXiv Detail & Related papers (2024-01-29T23:14:15Z) - Sim-to-Real Transfer of Adaptive Control Parameters for AUV
Stabilization under Current Disturbance [1.099532646524593]
This paper presents a novel approach, merging the Maximum Entropy Deep Reinforcement Learning framework with a classic model-based control architecture, to formulate an adaptive controller.
Within this framework, we introduce a Sim-to-Real transfer strategy comprising the following components: a bio-inspired experience replay mechanism, an enhanced domain randomisation technique, and an evaluation protocol executed on a physical platform.
Our experimental assessments demonstrate that this method effectively learns proficient policies from suboptimal simulated models of the AUV, resulting in control performance 3 times higher when transferred to a real-world vehicle.
arXiv Detail & Related papers (2023-10-17T08:46:56Z) - Tuning Legged Locomotion Controllers via Safe Bayesian Optimization [47.87675010450171]
This paper presents a data-driven strategy to streamline the deployment of model-based controllers in legged robotic hardware platforms.
We leverage a model-free safe learning algorithm to automate the tuning of control gains, addressing the mismatch between the simplified model used in the control formulation and the real system.
arXiv Detail & Related papers (2023-06-12T13:10:14Z) - In-Distribution Barrier Functions: Self-Supervised Policy Filters that
Avoid Out-of-Distribution States [84.24300005271185]
We propose a control filter that wraps any reference policy and effectively encourages the system to stay in-distribution with respect to offline-collected safe demonstrations.
Our method is effective for two different visuomotor control tasks in simulation environments, including both top-down and egocentric view settings.
arXiv Detail & Related papers (2023-01-27T22:28:19Z) - Designing a Robust Low-Level Agnostic Controller for a Quadrotor with
Actor-Critic Reinforcement Learning [0.38073142980732994]
We introduce domain randomization during the training phase of a low-level waypoint guidance controller based on Soft Actor-Critic.
We show that, by introducing a certain degree of uncertainty in quadrotor dynamics during training, we can obtain a controller that is capable to perform the proposed task using a larger variation of quadrotor parameters.
arXiv Detail & Related papers (2022-10-06T14:58:19Z) - Policy Search for Model Predictive Control with Application to Agile
Drone Flight [56.24908013905407]
We propose a policy-search-for-model-predictive-control framework for MPC.
Specifically, we formulate the MPC as a parameterized controller, where the hard-to-optimize decision variables are represented as high-level policies.
Experiments show that our controller achieves robust and real-time control performance in both simulation and the real world.
arXiv Detail & Related papers (2021-12-07T17:39:24Z) - Learning Robust Output Control Barrier Functions from Safe Expert Demonstrations [50.37808220291108]
This paper addresses learning safe output feedback control laws from partial observations of expert demonstrations.
We first propose robust output control barrier functions (ROCBFs) as a means to guarantee safety.
We then formulate an optimization problem to learn ROCBFs from expert demonstrations that exhibit safe system behavior.
arXiv Detail & Related papers (2021-11-18T23:21:00Z) - Data-driven controllers and the need for perception systems in
underwater manipulation [4.060731229044571]
The modeling of UVMSs is a complicated and costly process due to the highly nonlinear dynamics.
This is aggravated in tasks where the manipulation of objects is necessary.
We introduce a control strategy for UVMSs working with unknown payloads.
arXiv Detail & Related papers (2021-09-21T17:25:10Z) - Pointwise Feasibility of Gaussian Process-based Safety-Critical Control
under Model Uncertainty [77.18483084440182]
Control Barrier Functions (CBFs) and Control Lyapunov Functions (CLFs) are popular tools for enforcing safety and stability of a controlled system, respectively.
We present a Gaussian Process (GP)-based approach to tackle the problem of model uncertainty in safety-critical controllers that use CBFs and CLFs.
arXiv Detail & Related papers (2021-06-13T23:08:49Z) - Control Design of Autonomous Drone Using Deep Learning Based Image
Understanding Techniques [1.0953917735844645]
This paper presents a new framework to use images as the inputs for the controller to have autonomous flight, considering the noisy indoor environment and uncertainties.
A new Proportional-Integral-Derivative-Accelerated (PIDA) control with a derivative filter is proposed to improve drone/quadcopter flight stability within a noisy environment.
arXiv Detail & Related papers (2020-04-27T15:50:04Z) - Goal-Conditioned End-to-End Visuomotor Control for Versatile Skill
Primitives [89.34229413345541]
We propose a conditioning scheme which avoids pitfalls by learning the controller and its conditioning in an end-to-end manner.
Our model predicts complex action sequences based directly on a dynamic image representation of the robot motion.
We report significant improvements in task success over representative MPC and IL baselines.
arXiv Detail & Related papers (2020-03-19T15:04:37Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.