Real-world Video Adaptation with Reinforcement Learning
- URL: http://arxiv.org/abs/2008.12858v1
- Date: Fri, 28 Aug 2020 21:44:24 GMT
- Title: Real-world Video Adaptation with Reinforcement Learning
- Authors: Hongzi Mao, Shannon Chen, Drew Dimmery, Shaun Singh, Drew Blaisdell,
Yuandong Tian, Mohammad Alizadeh, Eytan Bakshy
- Abstract summary: Client-side video players employ adaptive (ABR) algorithms to optimize user quality of experience (QoE)
We evaluate recently proposed RL-based ABR methods in Facebook's web-based video streaming platform.
In a week-long worldwide deployment with more than 30 million video streaming sessions, our RL approach outperforms the existing human-engineered ABR algorithms.
- Score: 38.26695924173461
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Client-side video players employ adaptive bitrate (ABR) algorithms to
optimize user quality of experience (QoE). We evaluate recently proposed
RL-based ABR methods in Facebook's web-based video streaming platform.
Real-world ABR contains several challenges that requires customized designs
beyond off-the-shelf RL algorithms -- we implement a scalable neural network
architecture that supports videos with arbitrary bitrate encodings; we design a
training method to cope with the variance resulting from the stochasticity in
network conditions; and we leverage constrained Bayesian optimization for
reward shaping in order to optimize the conflicting QoE objectives. In a
week-long worldwide deployment with more than 30 million video streaming
sessions, our RL approach outperforms the existing human-engineered ABR
algorithms.
Related papers
- RL-RC-DoT: A Block-level RL agent for Task-Aware Video Compression [68.31184784672227]
In modern applications such as autonomous driving, an overwhelming majority of videos serve as input for AI systems performing tasks.
It is therefore useful to optimize the encoder for a downstream task instead of for image quality.
Here, we address this challenge by controlling the Quantization Parameters (QPs) at the macro-block level to optimize the downstream task.
arXiv Detail & Related papers (2025-01-21T15:36:08Z) - Reinforcement Learning -based Adaptation and Scheduling Methods for
Multi-source DASH [1.1971219484941955]
Dynamic adaptive streaming over HTTP (DASH) has been widely used in video streaming recently.
In multi-source streaming, video chunks may arrive out of order due to different conditions of the network paths.
This paper proposes two algorithms for streaming from multiple sources: RL-based adaptation with greedy scheduling (RLAGS) and RL-based adaptation and scheduling (RLAS)
arXiv Detail & Related papers (2023-07-25T06:47:12Z) - An Adaptive Device-Edge Co-Inference Framework Based on Soft
Actor-Critic [72.35307086274912]
High-dimension parameter model and large-scale mathematical calculation restrict execution efficiency, especially for Internet of Things (IoT) devices.
We propose a new Deep Reinforcement Learning (DRL)-Soft Actor Critic for discrete (SAC-d), which generates the emphexit point, emphexit point, and emphcompressing bits by soft policy iterations.
Based on the latency and accuracy aware reward design, such an computation can well adapt to the complex environment like dynamic wireless channel and arbitrary processing, and is capable of supporting the 5G URL
arXiv Detail & Related papers (2022-01-09T09:31:50Z) - Cross Layer Optimization and Distributed Reinforcement Learning for Wireless 360° Video Streaming [54.60967639512643]
We propose a cross layer optimization approach that maximizes the available rate to each user and efficiently uses it to maximize users' QoE.
We show that the problem can be decoupled into two interrelated subproblems.
An actor-critic deep reinforcement learning (DRL) is proposed to leverage the parallel training of multiple independent agents and solve the application layer subproblem.
arXiv Detail & Related papers (2020-11-12T12:59:10Z) - NANCY: Neural Adaptive Network Coding methodologY for video distribution
over wireless networks [1.636104578028594]
NANCY is a system that generates adaptive bit rates (ABR) for video and adaptive network coding rates (ANCR)
NANCY trains a neural network model with rewards formulated as quality of experience (QoE) metrics.
Our results show that NANCY provides 29.91% and 60.34% higher average QoE than Pensieve and robustMPC, respectively.
arXiv Detail & Related papers (2020-08-21T15:55:32Z) - Meta-Reinforcement Learning for Trajectory Design in Wireless UAV
Networks [151.65541208130995]
A drone base station (DBS) is dispatched to provide uplink connectivity to ground users whose demand is dynamic and unpredictable.
In this case, the DBS's trajectory must be adaptively adjusted to satisfy the dynamic user access requests.
A meta-learning algorithm is proposed in order to adapt the DBS's trajectory when it encounters novel environments.
arXiv Detail & Related papers (2020-05-25T20:43:59Z) - AOWS: Adaptive and optimal network width search with latency constraints [30.39613826468697]
We introduce a novel efficient one-shot NAS approach to optimally search for channel numbers.
Experiments on ImageNet classification show that our approach can find networks fitting the resource constraints on different target platforms.
arXiv Detail & Related papers (2020-05-21T06:46:16Z) - Non-Cooperative Game Theory Based Rate Adaptation for Dynamic Video
Streaming over HTTP [89.30855958779425]
Dynamic Adaptive Streaming over HTTP (DASH) has demonstrated to be an emerging and promising multimedia streaming technique.
We propose a novel algorithm to optimally allocate the limited export bandwidth of the server to multi-users to maximize their Quality of Experience (QoE) with fairness guaranteed.
arXiv Detail & Related papers (2019-12-27T01:19:14Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.