Automating Staged Rollout with Reinforcement Learning
- URL: http://arxiv.org/abs/2204.02189v1
- Date: Fri, 1 Apr 2022 21:22:39 GMT
- Title: Automating Staged Rollout with Reinforcement Learning
- Authors: Shadow Pritchard, Vidhyashree Nagaraju, Lance Fiondella
- Abstract summary: Staged rollout is a strategy of incrementally releasing software updates to portions of the user population in order to accelerate defect discovery without incurring catastrophic outcomes such as system wide outages.
This paper demonstrates the potential to automate staged rollout with multi-objective reinforcement learning in order to dynamically balance stakeholder needs such as time to deliver new features and downtime incurred by failures due to latent defects.
- Score: 1.3750624267664155
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Staged rollout is a strategy of incrementally releasing software updates to
portions of the user population in order to accelerate defect discovery without
incurring catastrophic outcomes such as system wide outages. Some past studies
have examined how to quantify and automate staged rollout, but stop short of
simultaneously considering multiple product or process metrics explicitly. This
paper demonstrates the potential to automate staged rollout with
multi-objective reinforcement learning in order to dynamically balance
stakeholder needs such as time to deliver new features and downtime incurred by
failures due to latent defects.
Related papers
- CORT: Class-Oriented Real-time Tracking for Embedded Systems [46.3107850275261]
This work proposes a new approach to multi-class object tracking.
It allows achieving smaller and more predictable execution times, without penalizing the tracking performance.
The proposed solution was evaluated in complex urban scenarios with several objects of different types.
arXiv Detail & Related papers (2024-07-20T09:12:17Z) - Apparate: Rethinking Early Exits to Tame Latency-Throughput Tensions in ML Serving [10.926767319124547]
We present Apparate, a system that automatically applies and manages early exits in machine learning models.
To cope with the time-varying overhead and accuracy challenges that EEs bring, Apparate repurposes exits to provide continual feedback.
Apparate lowers median response latencies by 40.5--91.5% and 10.0--24.2% for diverse CV and NLP classification workloads.
arXiv Detail & Related papers (2023-12-08T21:49:09Z) - Self-Supervised Multi-Object Tracking For Autonomous Driving From
Consistency Across Timescales [53.55369862746357]
Self-supervised multi-object trackers have tremendous potential as they enable learning from raw domain-specific data.
However, their re-identification accuracy still falls short compared to their supervised counterparts.
We propose a training objective that enables self-supervised learning of re-identification features from multiple sequential frames.
arXiv Detail & Related papers (2023-04-25T20:47:29Z) - Self-Supervised Representation Learning from Temporal Ordering of
Automated Driving Sequences [49.91741677556553]
We propose TempO, a temporal ordering pretext task for pre-training region-level feature representations for perception tasks.
We embed each frame by an unordered set of proposal feature vectors, a representation that is natural for object detection or tracking systems.
Extensive evaluations on the BDD100K, nuImages, and MOT17 datasets show that our TempO pre-training approach outperforms single-frame self-supervised learning methods.
arXiv Detail & Related papers (2023-02-17T18:18:27Z) - A Learning Based Framework for Handling Uncertain Lead Times in
Multi-Product Inventory Management [8.889304968879163]
Most existing literature on supply chain and inventory management consider demand processes with zero or constant lead times.
Motivated by the recently introduced delay-resolved deep Q-learning (DRDQN) algorithm, this paper develops a reinforcement learning based paradigm for handling uncertainty in lead times.
arXiv Detail & Related papers (2022-03-02T05:50:04Z) - Leveraging Time Irreversibility with Order-Contrastive Pre-training [3.1848820580333737]
We explore an "order-contrastive" method for self-supervised pre-training on longitudinal data.
We prove a finite-sample guarantee for the downstream error of a representation learned with order-contrastive pre-training.
Our results indicate that pre-training methods designed for particular classes of distributions and downstream tasks can improve the performance of self-supervised learning.
arXiv Detail & Related papers (2021-11-04T02:56:52Z) - Automated Machine Learning Techniques for Data Streams [91.3755431537592]
This paper surveys the state-of-the-art open-source AutoML tools, applies them to data collected from streams, and measures how their performance changes over time.
The results show that off-the-shelf AutoML tools can provide satisfactory results but in the presence of concept drift, detection or adaptation techniques have to be applied to maintain the predictive accuracy over time.
arXiv Detail & Related papers (2021-06-14T11:42:46Z) - Induction and Exploitation of Subgoal Automata for Reinforcement
Learning [75.55324974788475]
We present ISA, an approach for learning and exploiting subgoals in episodic reinforcement learning (RL) tasks.
ISA interleaves reinforcement learning with the induction of a subgoal automaton, an automaton whose edges are labeled by the task's subgoals.
A subgoal automaton also consists of two special states: a state indicating the successful completion of the task, and a state indicating that the task has finished without succeeding.
arXiv Detail & Related papers (2020-09-08T16:42:55Z) - Bayesian active learning for production, a systematic study and a
reusable library [85.32971950095742]
In this paper, we analyse the main drawbacks of current active learning techniques.
We do a systematic study on the effects of the most common issues of real-world datasets on the deep active learning process.
We derive two techniques that can speed up the active learning loop such as partial uncertainty sampling and larger query size.
arXiv Detail & Related papers (2020-06-17T14:51:11Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.