VABO: Violation-Aware Bayesian Optimization for Closed-Loop Control
Performance Optimization with Unmodeled Constraints
- URL: http://arxiv.org/abs/2110.07479v1
- Date: Thu, 14 Oct 2021 15:51:03 GMT
- Title: VABO: Violation-Aware Bayesian Optimization for Closed-Loop Control
Performance Optimization with Unmodeled Constraints
- Authors: Wenjie Xu, Colin N Jones, Bratislav Svetozarevic, Christopher R.
Laughman, Ankush Chakrabarty
- Abstract summary: We study the problem of performance optimization of closed-loop control systems with unmodeled dynamics.
We propose a violation-aware BO algorithm (VABO) that optimize closed-loop performance while simultaneously learning constraint-feasible solutions.
We demonstrate the effectiveness of our proposed VABO method for energy minimization of industrial vapor compression systems.
- Score: 2.205124036454768
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We study the problem of performance optimization of closed-loop control
systems with unmodeled dynamics. Bayesian optimization (BO) has been
demonstrated effective for improving closed-loop performance by automatically
tuning controller gains or reference setpoints in a model-free manner. However,
BO methods have rarely been tested on dynamical systems with unmodeled
constraints. In this paper, we propose a violation-aware BO algorithm (VABO)
that optimizes closed-loop performance while simultaneously learning
constraint-feasible solutions. Unlike classical constrained BO methods which
allow an unlimited constraint violations, or safe BO algorithms that are
conservative and try to operate with near-zero violations, we allow budgeted
constraint violations to improve constraint learning and accelerate
optimization. We demonstrate the effectiveness of our proposed VABO method for
energy minimization of industrial vapor compression systems.
Related papers
- Sample-efficient Bayesian Optimisation Using Known Invariances [56.34916328814857]
We show that vanilla and constrained BO algorithms are inefficient when optimising invariant objectives.
We derive a bound on the maximum information gain of these invariant kernels.
We use our method to design a current drive system for a nuclear fusion reactor, finding a high-performance solution.
arXiv Detail & Related papers (2024-10-22T12:51:46Z) - Growing Q-Networks: Solving Continuous Control Tasks with Adaptive Control Resolution [51.83951489847344]
In robotics applications, smooth control signals are commonly preferred to reduce system wear and energy efficiency.
In this work, we aim to bridge this performance gap by growing discrete action spaces from coarse to fine control resolution.
Our work indicates that an adaptive control resolution in combination with value decomposition yields simple critic-only algorithms that yield surprisingly strong performance on continuous control tasks.
arXiv Detail & Related papers (2024-04-05T17:58:37Z) - Model-based Causal Bayesian Optimization [74.78486244786083]
We introduce the first algorithm for Causal Bayesian Optimization with Multiplicative Weights (CBO-MW)
We derive regret bounds for CBO-MW that naturally depend on graph-related quantities.
Our experiments include a realistic demonstration of how CBO-MW can be used to learn users' demand patterns in a shared mobility system.
arXiv Detail & Related papers (2023-07-31T13:02:36Z) - Primal-Dual Contextual Bayesian Optimization for Control System Online
Optimization with Time-Average Constraints [21.38692458445459]
This paper studies the problem of online performance optimization of constrained closed-loop control systems.
A primal-dual contextual Bayesian optimization algorithm is proposed that achieves sublinear cumulative regret with respect to the dynamic optimal solution.
arXiv Detail & Related papers (2023-04-12T18:37:52Z) - Violation-Aware Contextual Bayesian Optimization for Controller
Performance Optimization with Unmodeled Constraints [1.8730951928453339]
We study the problem of performance optimization of closed-loop control systems with unmodeled dynamics.
We propose a violation-aware contextual BO algorithm (VACBO) that optimize closed-loop performance while simultaneously learning constraint-feasible solutions.
We demonstrate the effectiveness of our proposed VACBO method for energy minimization of industrial vapor compression systems under time-varying ambient temperature and humidity.
arXiv Detail & Related papers (2023-01-28T05:48:40Z) - CONFIG: Constrained Efficient Global Optimization for Closed-Loop
Control System Optimization with Unmodeled Constraints [11.523746174066702]
The OPT algorithm is applied to optimize the closed-loop control performance of an unknown system with unmodeled constraints.
Results show that our algorithm can achieve performance competitive with the popular CEI (Constrained Expected Improvement) algorithm, which has no known optimality guarantee.
arXiv Detail & Related papers (2022-11-21T19:44:00Z) - Model-based Causal Bayesian Optimization [78.120734120667]
We propose model-based causal Bayesian optimization (MCBO)
MCBO learns a full system model instead of only modeling intervention-reward pairs.
Unlike in standard Bayesian optimization, our acquisition function cannot be evaluated in closed form.
arXiv Detail & Related papers (2022-11-18T14:28:21Z) - Tuning Particle Accelerators with Safety Constraints using Bayesian
Optimization [73.94660141019764]
tuning machine parameters of particle accelerators is a repetitive and time-consuming task.
We propose and evaluate a step size-limited variant of safe Bayesian optimization.
arXiv Detail & Related papers (2022-03-26T02:21:03Z) - Safe and Efficient Model-free Adaptive Control via Bayesian Optimization [39.962395119933596]
We propose a purely data-driven, model-free approach for adaptive control.
tuning low-level controllers based solely on system data raises concerns on the underlying algorithm safety and computational performance.
We numerically demonstrate for several types of disturbances that our approach is sample efficient, outperforms constrained Bayesian optimization in terms of safety, and achieves the performance optima computed by grid evaluation.
arXiv Detail & Related papers (2021-01-19T19:15:00Z) - Guided Constrained Policy Optimization for Dynamic Quadrupedal Robot
Locomotion [78.46388769788405]
We introduce guided constrained policy optimization (GCPO), an RL framework based upon our implementation of constrained policy optimization (CPPO)
We show that guided constrained RL offers faster convergence close to the desired optimum resulting in an optimal, yet physically feasible, robotic control behavior without the need for precise reward function tuning.
arXiv Detail & Related papers (2020-02-22T10:15:53Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.