Human-in-the-Loop Bandwidth Estimation for Quality of Experience Optimization in Real-Time Video Communication
- URL: http://arxiv.org/abs/2510.12265v1
- Date: Tue, 14 Oct 2025 08:18:30 GMT
- Title: Human-in-the-Loop Bandwidth Estimation for Quality of Experience Optimization in Real-Time Video Communication
- Authors: Sami Khairy, Gabriel Mittag, Vishak Gopal, Ross Cutler,
- Abstract summary: Bandwidth estimation for real-time communications remains an open challenge.<n>We propose a deployed, human-in-the-loop, data-driven framework for bandwidth estimation to address these challenges.
- Score: 16.82306116067726
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The quality of experience (QoE) delivered by video conferencing systems is significantly influenced by accurately estimating the time-varying available bandwidth between the sender and receiver. Bandwidth estimation for real-time communications remains an open challenge due to rapidly evolving network architectures, increasingly complex protocol stacks, and the difficulty of defining QoE metrics that reliably improve user experience. In this work, we propose a deployed, human-in-the-loop, data-driven framework for bandwidth estimation to address these challenges. Our approach begins with training objective QoE reward models derived from subjective user evaluations to measure audio and video quality in real-time video conferencing systems. Subsequently, we collect roughly $1$M network traces with objective QoE rewards from real-world Microsoft Teams calls to curate a bandwidth estimation training dataset. We then introduce a novel distributional offline reinforcement learning (RL) algorithm to train a neural-network-based bandwidth estimator aimed at improving QoE for users. Our real-world A/B test demonstrates that the proposed approach reduces the subjective poor call ratio by $11.41\%$ compared to the baseline bandwidth estimator. Furthermore, the proposed offline RL algorithm is benchmarked on D4RL tasks to demonstrate its generalization beyond bandwidth estimation.
Related papers
- Satellite Streaming Video QoE Prediction: A Real-World Subjective Database and Network-Level Prediction Models [59.061552498630874]
We introduce the LIVE-Viasat Real-World Satellite QoE Database.
This database consists of 179 videos recorded from real-world streaming services affected by various authentic distortion patterns.
We demonstrate the usefulness of this unique new resource by evaluating the efficacy of QoE-prediction models on it.
We also created a new model that maps the network parameters to predicted human perception scores, which can be used by ISPs to optimize the video streaming quality of their networks.
arXiv Detail & Related papers (2024-10-17T18:22:50Z) - Multi-Task Decision-Making for Multi-User 360 Video Processing over Wireless Networks [9.642593500545997]
We study a multi-task decision-making problem for 360 video processing in a wireless multi-user virtual reality (VR) system.
This comes at the expense of increased data volume and required bandwidth.
We propose a constrained quality of experience (QoE) problem in which the rebuffering time and quality variation between video frames are rely by user and video requirements.
arXiv Detail & Related papers (2024-07-03T18:09:25Z) - Resource-Aware Hierarchical Federated Learning for Video Caching in
Wireless Networks [29.137803674759848]
A privacy-preserving method is desirable to learn how users' demands change over time.
This paper proposes a novel resource-aware hierarchical federated learning (RawHFL) solution to predict users' future content requests.
Our simulation results show that the proposed solution significantly outperforms the considered baselines in terms of prediction accuracy and total energy expenditure.
arXiv Detail & Related papers (2023-11-12T18:23:17Z) - Offline to Online Learning for Real-Time Bandwidth Estimation [18.33604214120801]
Real-time video applications require accurate estimation to maintain user experience across varying network conditions.<n>We present Merlin, an imitation learning-based solution that replaces the manual parameter tuning of bandwidth-based methods with data-driven updates.
arXiv Detail & Related papers (2023-09-23T21:39:51Z) - Low Complexity Adaptive Machine Learning Approaches for End-to-End
Latency Prediction [0.0]
This work is the design of efficient, low-cost adaptive algorithms for estimation, monitoring and prediction.
We focus on end-to-end latency prediction, for which we illustrate our approaches and results on data obtained from a public generator provided after the recent international challenge on GNN.
arXiv Detail & Related papers (2023-01-31T10:29:11Z) - Neighbourhood Representative Sampling for Efficient End-to-end Video
Quality Assessment [60.57703721744873]
The increased resolution of real-world videos presents a dilemma between efficiency and accuracy for deep Video Quality Assessment (VQA)
In this work, we propose a unified scheme, spatial-temporal grid mini-cube sampling (St-GMS) to get a novel type of sample, named fragments.
With fragments and FANet, the proposed efficient end-to-end FAST-VQA and FasterVQA achieve significantly better performance than existing approaches on all VQA benchmarks.
arXiv Detail & Related papers (2022-10-11T11:38:07Z) - Low-Latency Federated Learning over Wireless Channels with Differential
Privacy [142.5983499872664]
In federated learning (FL), model training is distributed over clients and local models are aggregated by a central server.
In this paper, we aim to minimize FL training delay over wireless channels, constrained by overall training performance as well as each client's differential privacy (DP) requirement.
arXiv Detail & Related papers (2021-06-20T13:51:18Z) - A Deep Value-network Based Approach for Multi-Driver Order Dispatching [55.36656442934531]
We propose a deep reinforcement learning based solution for order dispatching.
We conduct large scale online A/B tests on DiDi's ride-dispatching platform.
Results show that CVNet consistently outperforms other recently proposed dispatching methods.
arXiv Detail & Related papers (2021-06-08T16:27:04Z) - Feeling of Presence Maximization: mmWave-Enabled Virtual Reality Meets
Deep Reinforcement Learning [76.46530937296066]
This paper investigates the problem of providing ultra-reliable and energy-efficient virtual reality (VR) experiences for wireless mobile users.
To ensure reliable ultra-high-definition (UHD) video frame delivery to mobile users, a coordinated multipoint (CoMP) transmission technique and millimeter wave (mmWave) communications are exploited.
arXiv Detail & Related papers (2021-06-03T08:35:10Z) - Real-world Video Adaptation with Reinforcement Learning [38.26695924173461]
Client-side video players employ adaptive (ABR) algorithms to optimize user quality of experience (QoE)
We evaluate recently proposed RL-based ABR methods in Facebook's web-based video streaming platform.
In a week-long worldwide deployment with more than 30 million video streaming sessions, our RL approach outperforms the existing human-engineered ABR algorithms.
arXiv Detail & Related papers (2020-08-28T21:44:24Z) - Non-Cooperative Game Theory Based Rate Adaptation for Dynamic Video
Streaming over HTTP [89.30855958779425]
Dynamic Adaptive Streaming over HTTP (DASH) has demonstrated to be an emerging and promising multimedia streaming technique.
We propose a novel algorithm to optimally allocate the limited export bandwidth of the server to multi-users to maximize their Quality of Experience (QoE) with fairness guaranteed.
arXiv Detail & Related papers (2019-12-27T01:19:14Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.