An Intelligent Resource Reservation for Crowdsourced Live Video
Streaming Applications in Geo-Distributed Cloud Environment
- URL: http://arxiv.org/abs/2106.02420v1
- Date: Fri, 4 Jun 2021 11:45:09 GMT
- Title: An Intelligent Resource Reservation for Crowdsourced Live Video
Streaming Applications in Geo-Distributed Cloud Environment
- Authors: Emna Baccour, Fatima Haouari, Aiman Erbad, Amr Mohamed, Kashif Bilal,
Mohsen Guizani, Mounir Hamdi
- Abstract summary: We introduce a machine-learning based predictive resource allocation framework for geo-distributed cloud sites.
First, we present an offline optimization that decides the required resources in distributed regions near the viewers.
Second, we use machine learning to build forecasting models that proactively predict the resources to be reserved at each cloud site ahead of time.
- Score: 45.61165288624505
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Crowdsourced live video streaming (livecast) services such as Facebook Live,
YouNow, Douyu and Twitch are gaining more momentum recently. Allocating the
limited resources in a cost-effective manner while maximizing the Quality of
Service (QoS) through real-time delivery and the provision of the appropriate
representations for all viewers is a challenging problem. In our paper, we
introduce a machine-learning based predictive resource allocation framework for
geo-distributed cloud sites, considering the delay and quality constraints to
guarantee the maximum QoS for viewers and the minimum cost for content
providers. First, we present an offline optimization that decides the required
transcoding resources in distributed regions near the viewers with a trade-off
between the QoS and the overall cost. Second, we use machine learning to build
forecasting models that proactively predict the approximate transcoding
resources to be reserved at each cloud site ahead of time. Finally, we develop
a Greedy Nearest and Cheapest algorithm (GNCA) to perform the resource
allocation of real-time broadcasted videos on the rented resources. Extensive
simulations have shown that GNCA outperforms the state-of-the art resource
allocation approaches for crowdsourced live streaming by achieving more than
20% gain in terms of system cost while serving the viewers with relatively
lower latency.
Related papers
- Topology-aware Preemptive Scheduling for Co-located LLM Workloads [7.240168647854797]
We develop a fine-grained topology-aware method for scheduling of hybrid workloads.
This method significantly increases the efficiency of preemption and improves overall scheduled performance for LLM workloads by $55%$.
arXiv Detail & Related papers (2024-11-18T13:26:09Z) - Double Deep Q-Learning-based Path Selection and Service Placement for
Latency-Sensitive Beyond 5G Applications [11.864695986880347]
This paper studies the joint problem of communication and computing resource allocation, dubbed CCRA, to minimize total cost.
We formulate the problem as a non-linear programming model and propose two approaches, dubbed B&B-CCRA and WF-CCRA, based on the Branch & Bound and Water-Filling algorithms.
Numerical simulations show that B&B-CCRA optimally solves the problem, whereas WF-CCRA delivers near-optimal solutions in a substantially shorter time.
arXiv Detail & Related papers (2023-09-18T22:17:23Z) - Adaptive Resource Allocation for Virtualized Base Stations in O-RAN with
Online Learning [60.17407932691429]
Open Radio Access Network systems, with their base stations (vBSs), offer operators the benefits of increased flexibility, reduced costs, vendor diversity, and interoperability.
We propose an online learning algorithm that balances the effective throughput and vBS energy consumption, even under unforeseeable and "challenging'' environments.
We prove the proposed solutions achieve sub-linear regret, providing zero average optimality gap even in challenging environments.
arXiv Detail & Related papers (2023-09-04T17:30:21Z) - RAPID: Enabling Fast Online Policy Learning in Dynamic Public Cloud
Environments [7.825552412435501]
We propose a novel framework for fast, fully-online resource allocation policy learning in dynamic operating environments.
We show that our framework can learn stable resource allocation policies in minutes, as compared with hours in prior state-of-the-art.
arXiv Detail & Related papers (2023-04-10T18:04:39Z) - A Bandit Approach to Online Pricing for Heterogeneous Edge Resource
Allocation [8.089950414444115]
Two novel online pricing mechanisms are proposed for heterogeneous edge resource allocation.
The mechanisms operate in real-time and do not require prior knowledge of demand distribution.
The proposed posted pricing schemes allow users to select and pay for their preferred resources, with the platform dynamically adjusting resource prices based on observed historical data.
arXiv Detail & Related papers (2023-02-14T10:21:14Z) - Coordinated Online Learning for Multi-Agent Systems with Coupled
Constraints and Perturbed Utility Observations [91.02019381927236]
We introduce a novel method to steer the agents toward a stable population state, fulfilling the given resource constraints.
The proposed method is a decentralized resource pricing method based on the resource loads resulting from the augmentation of the game's Lagrangian.
arXiv Detail & Related papers (2020-10-21T10:11:17Z) - A Predictive Autoscaler for Elastic Batch Jobs [8.354712625979776]
Large batch jobs such as Deep Learning, HPC and Spark require far more computational resources and higher cost than conventional online service.
We propose a predictive autoscaler to provide an elastic interface for the customers and overprovision instances.
arXiv Detail & Related papers (2020-10-10T17:35:55Z) - Pollux: Co-adaptive Cluster Scheduling for Goodput-Optimized Deep
Learning [61.29990368322931]
Pollux improves scheduling performance in deep learning (DL) clusters by adaptively co-optimizing inter-dependent factors.
Pollux reduces average job completion times by 37-50% relative to state-of-the-art DL schedulers.
arXiv Detail & Related papers (2020-08-27T16:56:48Z) - A Non-Stationary Bandit-Learning Approach to Energy-Efficient
Femto-Caching with Rateless-Coded Transmission [98.47527781626161]
We study a resource allocation problem for joint caching and transmission in small cell networks.
We then formulate the problem as to select a file from the cache together with a transmission power level for every broadcast round.
In contrast to the state-of-the-art research, the proposed approach is especially suitable for networks with time-variant statistical properties.
arXiv Detail & Related papers (2020-04-13T09:07:17Z) - Non-Cooperative Game Theory Based Rate Adaptation for Dynamic Video
Streaming over HTTP [89.30855958779425]
Dynamic Adaptive Streaming over HTTP (DASH) has demonstrated to be an emerging and promising multimedia streaming technique.
We propose a novel algorithm to optimally allocate the limited export bandwidth of the server to multi-users to maximize their Quality of Experience (QoE) with fairness guaranteed.
arXiv Detail & Related papers (2019-12-27T01:19:14Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.