Model-aided Federated Reinforcement Learning for Multi-UAV Trajectory
Planning in IoT Networks
- URL: http://arxiv.org/abs/2306.02029v2
- Date: Sat, 7 Oct 2023 07:50:35 GMT
- Title: Model-aided Federated Reinforcement Learning for Multi-UAV Trajectory
Planning in IoT Networks
- Authors: Jichao Chen, Omid Esrafilian, Harald Bayerlein, David Gesbert, and
Marco Caccamo
- Abstract summary: We propose a novel model-aided federated MARL algorithm to coordinate multiple UAVs on a data harvesting mission with only limited knowledge about the environment.
A performance comparison with standard MARL algorithms demonstrates that our proposed model-aided FedQMIX algorithm reduces the need for real-world training experiences by around three magnitudes.
- Score: 17.770665737751372
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Deploying teams of unmanned aerial vehicles (UAVs) to harvest data from
distributed Internet of Things (IoT) devices requires efficient trajectory
planning and coordination algorithms. Multi-agent reinforcement learning (MARL)
has emerged as a solution, but requires extensive and costly real-world
training data. To tackle this challenge, we propose a novel model-aided
federated MARL algorithm to coordinate multiple UAVs on a data harvesting
mission with only limited knowledge about the environment. The proposed
algorithm alternates between building an environment simulation model from
real-world measurements, specifically learning the radio channel
characteristics and estimating unknown IoT device positions, and federated QMIX
training in the simulated environment. Each UAV agent trains a local QMIX model
in its simulated environment and continuously consolidates it through federated
learning with other agents, accelerating the learning process. A performance
comparison with standard MARL algorithms demonstrates that our proposed
model-aided FedQMIX algorithm reduces the need for real-world training
experiences by around three magnitudes while attaining similar data collection
performance.
Related papers
- Federated Learning for Misbehaviour Detection with Variational Autoencoders and Gaussian Mixture Models [0.2999888908665658]
Federated Learning (FL) has become an attractive approach to collaboratively train Machine Learning (ML) models.
This work proposes a novel unsupervised FL approach for the identification of potential misbehavior in vehicular environments.
We leverage the computing capabilities of public cloud services for model aggregation purposes.
arXiv Detail & Related papers (2024-05-16T08:49:50Z) - Asynchronous Multi-Model Dynamic Federated Learning over Wireless
Networks: Theory, Modeling, and Optimization [20.741776617129208]
Federated learning (FL) has emerged as a key technique for distributed machine learning (ML)
We first formulate rectangular scheduling steps and functions to capture the impact of system parameters on learning performance.
Our analysis sheds light on the joint impact of device training variables and asynchronous scheduling decisions.
arXiv Detail & Related papers (2023-05-22T21:39:38Z) - Vertical Federated Learning over Cloud-RAN: Convergence Analysis and
System Optimization [82.12796238714589]
We propose a novel cloud radio access network (Cloud-RAN) based vertical FL system to enable fast and accurate model aggregation.
We characterize the convergence behavior of the vertical FL algorithm considering both uplink and downlink transmissions.
We establish a system optimization framework by joint transceiver and fronthaul quantization design, for which successive convex approximation and alternate convex search based system optimization algorithms are developed.
arXiv Detail & Related papers (2023-05-04T09:26:03Z) - Scheduling and Aggregation Design for Asynchronous Federated Learning
over Wireless Networks [56.91063444859008]
Federated Learning (FL) is a collaborative machine learning framework that combines on-device training and server-based aggregation.
We propose an asynchronous FL design with periodic aggregation to tackle the straggler issue in FL systems.
We show that an age-aware'' aggregation weighting design can significantly improve the learning performance in an asynchronous FL setting.
arXiv Detail & Related papers (2022-12-14T17:33:01Z) - Parallel Successive Learning for Dynamic Distributed Model Training over
Heterogeneous Wireless Networks [50.68446003616802]
Federated learning (FedL) has emerged as a popular technique for distributing model training over a set of wireless devices.
We develop parallel successive learning (PSL), which expands the FedL architecture along three dimensions.
Our analysis sheds light on the notion of cold vs. warmed up models, and model inertia in distributed machine learning.
arXiv Detail & Related papers (2022-02-07T05:11:01Z) - Robust Semi-supervised Federated Learning for Images Automatic
Recognition in Internet of Drones [57.468730437381076]
We present a Semi-supervised Federated Learning (SSFL) framework for privacy-preserving UAV image recognition.
There are significant differences in the number, features, and distribution of local data collected by UAVs using different camera modules.
We propose an aggregation rule based on the frequency of the client's participation in training, namely the FedFreq aggregation rule.
arXiv Detail & Related papers (2022-01-03T16:49:33Z) - UAV-assisted Online Machine Learning over Multi-Tiered Networks: A
Hierarchical Nested Personalized Federated Learning Approach [25.936914508952086]
We consider distributed machine learning (ML) through unmanned aerial vehicles (UAVs) for geo-distributed device clusters.
We propose five new technologies/techniques: (i) stratified UAV swarms with leader, worker, and coordinator UAVs, (ii) hierarchical nested personalized federated learning (HN-PFL), and (iii) cooperative UAV resource pooling for distributed ML using the UAVs' local computational capabilities.
arXiv Detail & Related papers (2021-06-29T21:40:28Z) - Edge Federated Learning Via Unit-Modulus Over-The-Air Computation
(Extended Version) [64.76619508293966]
This paper proposes a unit-modulus over-the-air computation (UM-AirComp) framework to facilitate efficient edge federated learning.
It uploads simultaneously local model parameters and updates global model parameters via analog beamforming.
We demonstrate the implementation of UM-AirComp in a vehicle-to-everything autonomous driving simulation platform.
arXiv Detail & Related papers (2021-01-28T15:10:22Z) - Fast-Convergent Federated Learning [82.32029953209542]
Federated learning is a promising solution for distributing machine learning tasks through modern networks of mobile devices.
We propose a fast-convergent federated learning algorithm, called FOLB, which performs intelligent sampling of devices in each round of model training.
arXiv Detail & Related papers (2020-07-26T14:37:51Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.