Non-Parametric Neuro-Adaptive Coordination of Multi-Agent Systems
- URL: http://arxiv.org/abs/2110.05125v1
- Date: Mon, 11 Oct 2021 10:04:08 GMT
- Title: Non-Parametric Neuro-Adaptive Coordination of Multi-Agent Systems
- Authors: Christos K. Verginis, Zhe Xu, Ufuk Topcu
- Abstract summary: We develop a learning-based algorithm for the distributed formation control of networked multi-agent systems.
The proposed algorithm integrates neural network-based learning with adaptive control in a two-step procedure.
We provide formal theoretical guarantees on the achievement of the formation task.
- Score: 29.22096249070293
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: We develop a learning-based algorithm for the distributed formation control
of networked multi-agent systems governed by unknown, nonlinear dynamics. Most
existing algorithms either assume certain parametric forms for the unknown
dynamic terms or resort to unnecessarily large control inputs in order to
provide theoretical guarantees. The proposed algorithm avoids these drawbacks
by integrating neural network-based learning with adaptive control in a
two-step procedure. In the first step of the algorithm, each agent learns a
controller, represented as a neural network, using training data that
correspond to a collection of formation tasks and agent parameters. These
parameters and tasks are derived by varying the nominal agent parameters and
the formation specifications of the task in hand, respectively. In the second
step of the algorithm, each agent incorporates the trained neural network into
an online and adaptive control policy in such a way that the behavior of the
multi-agent closed-loop system satisfies a user-defined formation task. Both
the learning phase and the adaptive control policy are distributed, in the
sense that each agent computes its own actions using only local information
from its neighboring agents. The proposed algorithm does not use any a priori
information on the agents' unknown dynamic terms or any approximation schemes.
We provide formal theoretical guarantees on the achievement of the formation
task.
Related papers
- Decentralized Learning Strategies for Estimation Error Minimization with Graph Neural Networks [94.2860766709971]
We address the challenge of sampling and remote estimation for autoregressive Markovian processes in a wireless network with statistically-identical agents.
Our goal is to minimize time-average estimation error and/or age of information with decentralized scalable sampling and transmission policies.
arXiv Detail & Related papers (2024-04-04T06:24:11Z) - Joint Learning of Network Topology and Opinion Dynamics Based on Bandit
Algorithms [1.6912877206492036]
We study joint learning of network topology and a mixed opinion dynamics.
We propose a learning algorithm based on multi-armed bandit algorithms to address the problem.
arXiv Detail & Related papers (2023-06-25T21:53:13Z) - Safe Multi-agent Learning via Trapping Regions [89.24858306636816]
We apply the concept of trapping regions, known from qualitative theory of dynamical systems, to create safety sets in the joint strategy space for decentralized learning.
We propose a binary partitioning algorithm for verification that candidate sets form trapping regions in systems with known learning dynamics, and a sampling algorithm for scenarios where learning dynamics are not known.
arXiv Detail & Related papers (2023-02-27T14:47:52Z) - Scalable, Decentralized Multi-Agent Reinforcement Learning Methods
Inspired by Stigmergy and Ant Colonies [0.0]
We investigate a novel approach to decentralized multi-agent learning and planning.
In particular, this method is inspired by the cohesion, coordination, and behavior of ant colonies.
The approach combines single-agent RL and an ant-colony-inspired decentralized, stigmergic algorithm for multi-agent path planning and environment modification.
arXiv Detail & Related papers (2021-05-08T01:04:51Z) - Identification of Nonlinear Dynamic Systems Using Type-2 Fuzzy Neural
Networks -- A Novel Learning Algorithm and a Comparative Study [12.77304082363491]
A sliding mode theory-based learning algorithm has been proposed to tune both the premise and consequent parts of type-2 fuzzy neural networks.
The stability of the proposed learning algorithm has been proved by using an appropriate Lyapunov function.
Several comparisons have been realized and shown that the proposed algorithm has faster convergence speed than the existing methods.
arXiv Detail & Related papers (2021-04-04T23:44:59Z) - Detection of Insider Attacks in Distributed Projected Subgradient
Algorithms [11.096339082411882]
We show that a general neural network is particularly suitable for detecting and localizing malicious agents.
We propose to adopt one of the state-of-art approaches in federated learning, i.e., a collaborative peer-to-peer machine learning protocol.
In our simulations, a least-squared problem is considered to verify the feasibility and effectiveness of AI-based methods.
arXiv Detail & Related papers (2021-01-18T08:01:06Z) - Coded Stochastic ADMM for Decentralized Consensus Optimization with Edge
Computing [113.52575069030192]
Big data, including applications with high security requirements, are often collected and stored on multiple heterogeneous devices, such as mobile devices, drones and vehicles.
Due to the limitations of communication costs and security requirements, it is of paramount importance to extract information in a decentralized manner instead of aggregating data to a fusion center.
We consider the problem of learning model parameters in a multi-agent system with data locally processed via distributed edge nodes.
A class of mini-batch alternating direction method of multipliers (ADMM) algorithms is explored to develop the distributed learning model.
arXiv Detail & Related papers (2020-10-02T10:41:59Z) - Adaptive Serverless Learning [114.36410688552579]
We propose a novel adaptive decentralized training approach, which can compute the learning rate from data dynamically.
Our theoretical results reveal that the proposed algorithm can achieve linear speedup with respect to the number of workers.
To reduce the communication-efficient overhead, we further propose a communication-efficient adaptive decentralized training approach.
arXiv Detail & Related papers (2020-08-24T13:23:02Z) - Learning High-Level Policies for Model Predictive Control [54.00297896763184]
Model Predictive Control (MPC) provides robust solutions to robot control tasks.
We propose a self-supervised learning algorithm for learning a neural network high-level policy.
We show that our approach can handle situations that are difficult for standard MPC.
arXiv Detail & Related papers (2020-07-20T17:12:34Z) - Distributed Voltage Regulation of Active Distribution System Based on
Enhanced Multi-agent Deep Reinforcement Learning [9.7314654861242]
This paper proposes a data-driven distributed voltage control approach based on the spectrum clustering and the enhanced multi-agent deep reinforcement learning (MADRL) algorithm.
The proposed method can significantly reduce the requirements of communications and knowledge of system parameters.
It also effectively deals with uncertainties and can provide online coordinated control based on the latest local information.
arXiv Detail & Related papers (2020-05-31T15:48:27Z) - Decentralized MCTS via Learned Teammate Models [89.24858306636816]
We present a trainable online decentralized planning algorithm based on decentralized Monte Carlo Tree Search.
We show that deep learning and convolutional neural networks can be employed to produce accurate policy approximators.
arXiv Detail & Related papers (2020-03-19T13:10:20Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.