Artificial Intelligence Assisted Collaborative Edge Caching in Small
Cell Networks
- URL: http://arxiv.org/abs/2005.07941v2
- Date: Wed, 16 Sep 2020 01:38:02 GMT
- Title: Artificial Intelligence Assisted Collaborative Edge Caching in Small
Cell Networks
- Authors: Md Ferdous Pervej, Le Thanh Tan, Rose Qingyang Hu
- Abstract summary: This paper considers heterogeneous content preference of the users with heterogeneous caching models at the edge nodes.
We propose a modified particle swarm optimization (M-PSO) algorithm that efficiently solves the complex constraint problem in a reasonable time.
- Score: 19.605382256630538
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Edge caching is a new paradigm that has been exploited over the past several
years to reduce the load for the core network and to enhance the content
delivery performance. Many existing caching solutions only consider homogeneous
caching placement due to the immense complexity associated with the
heterogeneous caching models. Unlike these legacy modeling paradigms, this
paper considers heterogeneous content preference of the users with
heterogeneous caching models at the edge nodes. Besides, aiming to maximize the
cache hit ratio (CHR) in a two-tier heterogeneous network, we let the edge
nodes collaborate. However, due to complex combinatorial decision variables,
the formulated problem is hard to solve in the polynomial time. Moreover, there
does not even exist a ready-to-use tool or software to solve the problem. We
propose a modified particle swarm optimization (M-PSO) algorithm that
efficiently solves the complex constraint problem in a reasonable time. Using
numerical analysis and simulation, we validate that the proposed algorithm
significantly enhances the CHR performance when comparing to that of the
existing baseline caching schemes.
Related papers
- TrimCaching: Parameter-sharing AI Model Caching in Wireless Edge Networks [36.39118138582416]
Next-generation mobile networks are expected to facilitate fast AI model downloading to end users.
By caching models on edge servers, mobile networks can deliver models to end users with low latency.
We develop a novel model placement scheme, called parameter-sharing model caching (TrimCaching)
arXiv Detail & Related papers (2024-05-07T04:08:49Z) - Smuche: Scalar-Multiplicative Caching in Homomorphic Encryption [1.3824176915623292]
Homomorphic encryption (HE) is used in machine learning systems in untrusted environments.
We introduce a novel textitconstant-time caching technique that is independent of any parameters.
Smuche stands for Scalar-multiplicative Caching of Homomorphic Encryption.
arXiv Detail & Related papers (2023-12-26T23:11:25Z) - Learning Spatio-Temporal Aggregations for Large-Scale Capacity Expansion
Problems [0.0]
Capacity Expansion Problems (CEPs) are expensive to solve due to large network sizes, heterogeneous node characteristics, and a large number of operational periods.
Here, we propose a novel graph convolutional autoencoder approach for identifytemporal aggregation of a generic CEP with heterogeneous nodes.
We show that our approach provides upper bounds that are 33% (resp. 10%) lower those than obtained from benchmark spatial (resp. temporal) aggregation approaches.
arXiv Detail & Related papers (2023-03-16T00:00:52Z) - Decision Diagram-Based Branch-and-Bound with Caching for Dominance and
Suboptimality Detection [9.175779296469194]
This paper presents new ingredients to speed up the search by exploiting the structure of dynamic programming models.
The key idea is to prevent the repeated expansion of nodes corresponding to the same dynamic programming states by querying expansion thresholds cached throughout the search.
Experiments show that the pruning brought by this caching mechanism allows significantly reducing the number of nodes expanded by the algorithm.
arXiv Detail & Related papers (2022-11-22T10:18:33Z) - Symmetric Tensor Networks for Generative Modeling and Constrained
Combinatorial Optimization [72.41480594026815]
Constrained optimization problems abound in industry, from portfolio optimization to logistics.
One of the major roadblocks in solving these problems is the presence of non-trivial hard constraints which limit the valid search space.
In this work, we encode arbitrary integer-valued equality constraints of the form Ax=b, directly into U(1) symmetric networks (TNs) and leverage their applicability as quantum-inspired generative models.
arXiv Detail & Related papers (2022-11-16T18:59:54Z) - Adaptivity and Non-stationarity: Problem-dependent Dynamic Regret for Online Convex Optimization [70.4342220499858]
We introduce novel online algorithms that can exploit smoothness and replace the dependence on $T$ in dynamic regret with problem-dependent quantities.
Our results are adaptive to the intrinsic difficulty of the problem, since the bounds are tighter than existing results for easy problems and safeguard the same rate in the worst case.
arXiv Detail & Related papers (2021-12-29T02:42:59Z) - Accelerating Deep Learning Classification with Error-controlled
Approximate-key Caching [72.50506500576746]
We propose a novel caching paradigm, that we named approximate-key caching.
While approximate cache hits alleviate DL inference workload and increase the system throughput, they however introduce an approximation error.
We analytically model our caching system performance for classic LRU and ideal caches, we perform a trace-driven evaluation of the expected performance, and we compare the benefits of our proposed approach with the state-of-the-art similarity caching.
arXiv Detail & Related papers (2021-12-13T13:49:11Z) - Learning from Images: Proactive Caching with Parallel Convolutional
Neural Networks [94.85780721466816]
A novel framework for proactive caching is proposed in this paper.
It combines model-based optimization with data-driven techniques by transforming an optimization problem into a grayscale image.
Numerical results show that the proposed scheme can reduce 71.6% computation time with only 0.8% additional performance cost.
arXiv Detail & Related papers (2021-08-15T21:32:47Z) - Rethinking Differentiable Search for Mixed-Precision Neural Networks [83.55785779504868]
Low-precision networks with weights and activations quantized to low bit-width are widely used to accelerate inference on edge devices.
Current solutions are uniform, using identical bit-width for all filters.
This fails to account for the different sensitivities of different filters and is suboptimal.
Mixed-precision networks address this problem, by tuning the bit-width to individual filter requirements.
arXiv Detail & Related papers (2020-04-13T07:02:23Z) - GACEM: Generalized Autoregressive Cross Entropy Method for Multi-Modal
Black Box Constraint Satisfaction [69.94831587339539]
We present a modified Cross-Entropy Method (CEM) that uses a masked auto-regressive neural network for modeling uniform distributions over the solution space.
Our algorithm is able to express complicated solution spaces, thus allowing it to track a variety of different solution regions.
arXiv Detail & Related papers (2020-02-17T20:21:20Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.