Top-k Multi-Armed Bandit Learning for Content Dissemination in Swarms of Micro-UAVs
- URL: http://arxiv.org/abs/2404.10845v1
- Date: Tue, 16 Apr 2024 18:47:07 GMT
- Title: Top-k Multi-Armed Bandit Learning for Content Dissemination in Swarms of Micro-UAVs
- Authors: Amit Kumar Bhuyan, Hrishikesh Dutta, Subir Biswas,
- Abstract summary: In communication-deprived disaster scenarios, this paper introduces a Micro-Unmanned Aerial Vehicle (UAV)- enhanced content management system.
In the absence of cellular infrastructure, this system deploys a hybrid network of stationary and mobile UAVs to offer vital content access to isolated communities.
The primary goal is to devise an adaptive content dissemination system that dynamically learns caching policies to maximize content accessibility.
- Score: 2.3076690318595676
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: In communication-deprived disaster scenarios, this paper introduces a Micro-Unmanned Aerial Vehicle (UAV)- enhanced content management system. In the absence of cellular infrastructure, this system deploys a hybrid network of stationary and mobile UAVs to offer vital content access to isolated communities. Static anchor UAVs equipped with both vertical and lateral links cater to local users, while agile micro-ferrying UAVs, equipped with lateral links and greater mobility, reach users in various communities. The primary goal is to devise an adaptive content dissemination system that dynamically learns caching policies to maximize content accessibility. The paper proposes a decentralized Top-k Multi-Armed Bandit (Top-k MAB) learning approach for UAV caching decisions, accommodating geotemporal disparities in content popularity and diverse content demands. The proposed mechanism involves a Selective Caching Algorithm that algorithmically reduces redundant copies of the contents by leveraging the shared information between the UAVs. It is demonstrated that Top-k MAB learning, along with selective caching algorithm, can improve system performance while making the learning process adaptive. The paper does functional verification and performance evaluation of the proposed caching framework under a wide range of network size, swarm of micro-ferrying UAVs, and heterogeneous popularity distributions.
Related papers
- Aerial Reliable Collaborative Communications for Terrestrial Mobile Users via Evolutionary Multi-Objective Deep Reinforcement Learning [59.660724802286865]
Unmanned aerial vehicles (UAVs) have emerged as the potential aerial base stations (BSs) to improve terrestrial communications.
This work employs collaborative beamforming through a UAV-enabled virtual antenna array to improve transmission performance from the UAV to terrestrial mobile users.
arXiv Detail & Related papers (2025-02-09T09:15:47Z) - Towards Federated Multi-Armed Bandit Learning for Content Dissemination using Swarm of UAVs [2.3076690318595676]
The proposed architecture leverages a hybrid network of stationary anchor UAVs and mobile Micro-UAVs for ubiquitous content dissemination.
The focus is on developing a content dissemination system that dynamically learns optimal caching policies to maximize content availability.
A Selective Caching Algorithm is also introduced to reduce redundant content replication by incorporating inter-UAV information sharing.
arXiv Detail & Related papers (2025-01-15T20:55:13Z) - Cluster-Based Multi-Agent Task Scheduling for Space-Air-Ground Integrated Networks [60.085771314013044]
Low-altitude economy holds significant potential for development in areas such as communication and sensing.
We propose a Clustering-based Multi-agent Deep Deterministic Policy Gradient (CMADDPG) algorithm to address the multi-UAV cooperative task scheduling challenges in SAGIN.
arXiv Detail & Related papers (2024-12-14T06:17:33Z) - Blockchain-enabled Clustered and Scalable Federated Learning (BCS-FL)
Framework in UAV Networks [8.278150104847183]
This paper presents the Clustered and Scalable Federated Learning (BCS-FL) framework for UAV networks.
It improves the decentralization, coordination, scalability, and efficiency of FL in large-scale UAV networks.
arXiv Detail & Related papers (2024-02-07T12:26:56Z) - Multi-Armed Bandit Learning for Content Provisioning in Network of UAVs [2.3076690318595676]
This paper proposes an unmanned aerial vehicle (UAV) aided content management system in communication-challenged disaster scenarios.
Without cellular infrastructure in such scenarios, community of stranded users can be provided access to situation-critical contents using a hybrid network of static and traveling UAVs.
A set of relatively static anchor UAVs can download content from central servers and provide content access to its local users.
A set of ferrying UAVs with wider mobility can provision content to users by shuffling them across different anchor UAVs while visiting different communities of users.
arXiv Detail & Related papers (2023-12-18T15:24:01Z) - Deep Reinforcement Learning for Combined Coverage and Resource
Allocation in UAV-aided RAN-slicing [1.7214664783818676]
This work presents a UAV-assisted 5G network, where the aerial base stations (UAV-BS) are empowered with network slicing capabilities.
A first application of multi-agent and multi-decision deep reinforcement learning for UAV-BS in a network slicing context is introduced.
The performance of the presented strategy have been tested and compared to benchmarks, highlighting a higher percentage of satisfied users (at least 27% more) in a variety of scenarios.
arXiv Detail & Related papers (2022-11-15T06:50:00Z) - Towards efficient feature sharing in MIMO architectures [102.40140369542755]
Multi-input multi-output architectures propose to train multipleworks within one base network and then average the subnetwork predictions to benefit from ensembling for free.
Despite some relative success, these architectures are wasteful in their use of parameters.
We highlight in this paper that the learned subnetwork fail to share even generic features which limits their applicability on smaller mobile and AR/VR devices.
arXiv Detail & Related papers (2022-05-20T12:33:34Z) - 5G Network on Wings: A Deep Reinforcement Learning Approach to the
UAV-based Integrated Access and Backhaul [11.197456628712846]
Unmanned aerial vehicle (UAV) based aerial networks offer a promising alternative for fast, flexible, and reliable wireless communications.
In this paper, we study how to control multiple UAV-BSs in both static and dynamic environments.
Deep reinforcement learning algorithm is developed to jointly optimize the three-dimensional placement of these multiple UAV-BSs.
arXiv Detail & Related papers (2022-02-04T07:45:06Z) - Robust Semi-supervised Federated Learning for Images Automatic
Recognition in Internet of Drones [57.468730437381076]
We present a Semi-supervised Federated Learning (SSFL) framework for privacy-preserving UAV image recognition.
There are significant differences in the number, features, and distribution of local data collected by UAVs using different camera modules.
We propose an aggregation rule based on the frequency of the client's participation in training, namely the FedFreq aggregation rule.
arXiv Detail & Related papers (2022-01-03T16:49:33Z) - Distributed Reinforcement Learning for Privacy-Preserving Dynamic Edge
Caching [91.50631418179331]
A privacy-preserving distributed deep policy gradient (P2D3PG) is proposed to maximize the cache hit rates of devices in the MEC networks.
We convert the distributed optimizations into model-free Markov decision process problems and then introduce a privacy-preserving federated learning method for popularity prediction.
arXiv Detail & Related papers (2021-10-20T02:48:27Z) - Caching Placement and Resource Allocation for Cache-Enabling UAV NOMA
Networks [87.6031308969681]
This article investigates the cache-enabling unmanned aerial vehicle (UAV) cellular networks with massive access capability supported by non-orthogonal multiple access (NOMA)
We formulate the long-term caching placement and resource allocation optimization problem for content delivery delay minimization as a Markov decision process (MDP)
We propose a Q-learning based caching placement and resource allocation algorithm, where the UAV learns and selects action with emphsoft $varepsilon$-greedy strategy to search for the optimal match between actions and states.
arXiv Detail & Related papers (2020-08-12T08:33:51Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.