Distributed Reinforcement Learning for Privacy-Preserving Dynamic Edge
Caching
- URL: http://arxiv.org/abs/2110.10349v1
- Date: Wed, 20 Oct 2021 02:48:27 GMT
- Title: Distributed Reinforcement Learning for Privacy-Preserving Dynamic Edge
Caching
- Authors: Shengheng Liu, Chong Zheng, Yongming Huang, Tony Q. S. Quek
- Abstract summary: A privacy-preserving distributed deep policy gradient (P2D3PG) is proposed to maximize the cache hit rates of devices in the MEC networks.
We convert the distributed optimizations into model-free Markov decision process problems and then introduce a privacy-preserving federated learning method for popularity prediction.
- Score: 91.50631418179331
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Mobile edge computing (MEC) is a prominent computing paradigm which expands
the application fields of wireless communication. Due to the limitation of the
capacities of user equipments and MEC servers, edge caching (EC) optimization
is crucial to the effective utilization of the caching resources in MEC-enabled
wireless networks. However, the dynamics and complexities of content
popularities over space and time as well as the privacy preservation of users
pose significant challenges to EC optimization. In this paper, a
privacy-preserving distributed deep deterministic policy gradient (P2D3PG)
algorithm is proposed to maximize the cache hit rates of devices in the MEC
networks. Specifically, we consider the fact that content popularities are
dynamic, complicated and unobservable, and formulate the maximization of cache
hit rates on devices as distributed problems under the constraints of privacy
preservation. In particular, we convert the distributed optimizations into
distributed model-free Markov decision process problems and then introduce a
privacy-preserving federated learning method for popularity prediction.
Subsequently, a P2D3PG algorithm is developed based on distributed
reinforcement learning to solve the distributed problems. Simulation results
demonstrate the superiority of the proposed approach in improving EC hit rate
over the baseline methods while preserving user privacy.
Related papers
- Linear-Time User-Level DP-SCO via Robust Statistics [55.350093142673316]
User-level differentially private convex optimization (DP-SCO) has garnered significant attention due to the importance of safeguarding user privacy in machine learning applications.
Current methods, such as those based on differentially private gradient descent (DP-SGD), often struggle with high noise accumulation and suboptimal utility.
We introduce a novel linear-time algorithm that leverages robust statistics, specifically the median and trimmed mean, to overcome these challenges.
arXiv Detail & Related papers (2025-02-13T02:05:45Z) - Graph Federated Learning Based Proactive Content Caching in Edge Computing [5.492113449220096]
This paper proposes a Graph Federated Learning-based Proactive Content Caching scheme that enhances caching efficiency while preserving user privacy.
The proposed approach integrates federated learning and graph neural networks, enabling users to locally train Light Graph Convolutional Networks (LightGCN) to capture user-item relationships and predict content popularity.
arXiv Detail & Related papers (2025-02-07T08:48:06Z) - Privacy-Aware Multi-Device Cooperative Edge Inference with Distributed Resource Bidding [3.9287497907611875]
Mobile edge computing (MEC) has empowered mobile devices (MDs) in supporting artificial intelligence (AI) applications.
Despite the great promise of device-edge cooperative AI inference, data privacy becomes an increasing concern.
We develop a privacy-aware multi-device cooperative edge inference system for classification tasks.
arXiv Detail & Related papers (2024-12-30T16:37:17Z) - Digital Twin-Assisted Data-Driven Optimization for Reliable Edge Caching in Wireless Networks [60.54852710216738]
We introduce a novel digital twin-assisted optimization framework, called D-REC, to ensure reliable caching in nextG wireless networks.
By incorporating reliability modules into a constrained decision process, D-REC can adaptively adjust actions, rewards, and states to comply with advantageous constraints.
arXiv Detail & Related papers (2024-06-29T02:40:28Z) - Differentially Private Deep Q-Learning for Pattern Privacy Preservation
in MEC Offloading [76.0572817182483]
attackers may eavesdrop on the offloading decisions to infer the edge server's (ES's) queue information and users' usage patterns.
We propose an offloading strategy which jointly minimizes the latency, ES's energy consumption, and task dropping rate, while preserving pattern privacy (PP)
We develop a Differential Privacy Deep Q-learning based Offloading (DP-DQO) algorithm to solve this problem while addressing the PP issue by injecting noise into the generated offloading decisions.
arXiv Detail & Related papers (2023-02-09T12:50:18Z) - Unsupervised Recurrent Federated Learning for Edge Popularity Prediction
in Privacy-Preserving Mobile Edge Computing Networks [31.871608633577047]
We propose an unsupervised and privacy-preserving popularity prediction framework for MEC-enabled IIoT.
The proposed framework can enhance the prediction accuracy in terms of a reduced root-mean-squared error by up to $60.5%-68.7%$.
arXiv Detail & Related papers (2022-07-02T06:50:54Z) - Decentralized Stochastic Optimization with Inherent Privacy Protection [103.62463469366557]
Decentralized optimization is the basic building block of modern collaborative machine learning, distributed estimation and control, and large-scale sensing.
Since involved data, privacy protection has become an increasingly pressing need in the implementation of decentralized optimization algorithms.
arXiv Detail & Related papers (2022-05-08T14:38:23Z) - Local Differential Privacy for Bayesian Optimization [12.05395706770007]
We consider a black-box optimization in the nonparametric Gaussian process setting with local differential privacy (LDP) guarantee.
Specifically, the rewards from each user are further corrupted to protect privacy and the learner only has access to the corrupted rewards to minimize the regret.
We present three almost optimal algorithms based on the GP-UCB framework and Laplace DP mechanism.
arXiv Detail & Related papers (2020-10-13T21:50:09Z) - A Privacy-Preserving-Oriented DNN Pruning and Mobile Acceleration
Framework [56.57225686288006]
Weight pruning of deep neural networks (DNNs) has been proposed to satisfy the limited storage and computing capability of mobile edge devices.
Previous pruning methods mainly focus on reducing the model size and/or improving performance without considering the privacy of user data.
We propose a privacy-preserving-oriented pruning and mobile acceleration framework that does not require the private training dataset.
arXiv Detail & Related papers (2020-03-13T23:52:03Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.