Low Complexity Approaches for End-to-End Latency Prediction
- URL: http://arxiv.org/abs/2302.00004v1
- Date: Tue, 31 Jan 2023 10:31:41 GMT
- Title: Low Complexity Approaches for End-to-End Latency Prediction
- Authors: Pierre Larrenie (LIGM), Jean-Fran\c{c}ois Bercher (LIGM), Olivier
Venard (ESYCOM), Iyad Lahsen-Cherif (INPT)
- Abstract summary: We focus on end-to-end latency prediction, for which we illustrate our approaches and results on a public dataset from the recent international challenge on GNN.
We propose several low complexity, locally implementable approaches, achieving significantly lower wall time both for training and inference, with marginally worse prediction accuracy compared to state-of-the-art global GNN solutions.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Software Defined Networks have opened the door to statistical and AI-based
techniques to improve efficiency of networking. Especially to ensure a certain
Quality of Service (QoS) for specific applications by routing packets with
awareness on content nature (VoIP, video, files, etc.) and its needs (latency,
bandwidth, etc.) to use efficiently resources of a network. Predicting various
Key Performance Indicators (KPIs) at any level may handle such problems while
preserving network bandwidth. The question addressed in this work is the design
of efficient and low-cost algorithms for KPI prediction, implementable at the
local level. We focus on end-to-end latency prediction, for which we illustrate
our approaches and results on a public dataset from the recent international
challenge on GNN [1]. We propose several low complexity, locally implementable
approaches, achieving significantly lower wall time both for training and
inference, with marginally worse prediction accuracy compared to
state-of-the-art global GNN solutions.
Related papers
- Decentralized Optimization in Time-Varying Networks with Arbitrary Delays [22.40154714677385]
We consider a decentralized optimization problem for networks affected by communication delays.
Examples of such networks include collaborative machine learning, sensor networks, and multi-agent systems.
To mimic communication delays, we add virtual non-computing nodes to the network, resulting in directed graphs.
arXiv Detail & Related papers (2024-05-29T20:51:38Z) - Optimized Generation of Entanglement by Real-Time Ordering of Swapping Operations [4.152766500389854]
Long-distance quantum communication in quantum networks faces significant challenges due to the constraints imposed by the no-cloning theorem.
Most existing quantum communication protocols rely on the a priori distribution of entanglement pairs (EPs)
We present a greedy algorithm that iteratively determines the best route and/or entanglement-swapping operation to perform at each stage based on the current network.
arXiv Detail & Related papers (2024-05-13T06:43:11Z) - Low Complexity Adaptive Machine Learning Approaches for End-to-End
Latency Prediction [0.0]
This work is the design of efficient, low-cost adaptive algorithms for estimation, monitoring and prediction.
We focus on end-to-end latency prediction, for which we illustrate our approaches and results on data obtained from a public generator provided after the recent international challenge on GNN.
arXiv Detail & Related papers (2023-01-31T10:29:11Z) - Accuracy-Guaranteed Collaborative DNN Inference in Industrial IoT via
Deep Reinforcement Learning [10.223526707269537]
Collaboration among industrial Internet of Things (IoT) devices and edge networks is essential to support computation-intensive deep neural network (DNN) inference services.
In this paper, we investigate the collaborative inference problem in industrial IoT networks.
arXiv Detail & Related papers (2022-12-31T05:53:17Z) - Robustification of Online Graph Exploration Methods [59.50307752165016]
We study a learning-augmented variant of the classical, notoriously hard online graph exploration problem.
We propose an algorithm that naturally integrates predictions into the well-known Nearest Neighbor (NN) algorithm.
arXiv Detail & Related papers (2021-12-10T10:02:31Z) - Adaptive Inference through Early-Exit Networks: Design, Challenges and
Directions [80.78077900288868]
We decompose the design methodology of early-exit networks to its key components and survey the recent advances in each one of them.
We position early-exiting against other efficient inference solutions and provide our insights on the current challenges and most promising future directions for research in the field.
arXiv Detail & Related papers (2021-06-09T12:33:02Z) - Multi-Exit Semantic Segmentation Networks [78.44441236864057]
We propose a framework for converting state-of-the-art segmentation models to MESS networks.
specially trained CNNs that employ parametrised early exits along their depth to save during inference on easier samples.
We co-optimise the number, placement and architecture of the attached segmentation heads, along with the exit policy, to adapt to the device capabilities and application-specific requirements.
arXiv Detail & Related papers (2021-06-07T11:37:03Z) - Better than the Best: Gradient-based Improper Reinforcement Learning for
Network Scheduling [60.48359567964899]
We consider the problem of scheduling in constrained queueing networks with a view to minimizing packet delay.
We use a policy gradient based reinforcement learning algorithm that produces a scheduler that performs better than the available atomic policies.
arXiv Detail & Related papers (2021-05-01T10:18:34Z) - MS-RANAS: Multi-Scale Resource-Aware Neural Architecture Search [94.80212602202518]
We propose Multi-Scale Resource-Aware Neural Architecture Search (MS-RANAS)
We employ a one-shot architecture search approach in order to obtain a reduced search cost.
We achieve state-of-the-art results in terms of accuracy-speed trade-off.
arXiv Detail & Related papers (2020-09-29T11:56:01Z) - AOWS: Adaptive and optimal network width search with latency constraints [30.39613826468697]
We introduce a novel efficient one-shot NAS approach to optimally search for channel numbers.
Experiments on ImageNet classification show that our approach can find networks fitting the resource constraints on different target platforms.
arXiv Detail & Related papers (2020-05-21T06:46:16Z) - Resolution Adaptive Networks for Efficient Inference [53.04907454606711]
We propose a novel Resolution Adaptive Network (RANet), which is inspired by the intuition that low-resolution representations are sufficient for classifying "easy" inputs.
In RANet, the input images are first routed to a lightweight sub-network that efficiently extracts low-resolution representations.
High-resolution paths in the network maintain the capability to recognize the "hard" samples.
arXiv Detail & Related papers (2020-03-16T16:54:36Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.