Related papers: Competitive Control

Competitive Control

URL: http://arxiv.org/abs/2107.13657v2
Date: Fri, 30 Jul 2021 03:06:03 GMT
Title: Competitive Control
Authors: Gautam Goel and Babak Hassibi
Abstract summary: We focus on designing an online controller which competes against a clairvoyant offline optimal controller. A natural performance metric in this setting is competitive ratio, which is the ratio between the cost incurred by the online controller and the cost incurred by the offline optimal controller.
Score: 52.28457815067461
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: We consider control from the perspective of competitive analysis. Unlike much prior work on learning-based control, which focuses on minimizing regret against the best controller selected in hindsight from some specific class, we focus on designing an online controller which competes against a clairvoyant offline optimal controller. A natural performance metric in this setting is competitive ratio, which is the ratio between the cost incurred by the online controller and the cost incurred by the offline optimal controller. Using operator-theoretic techniques from robust control, we derive a computationally efficient state-space description of the the controller with optimal competitive ratio in both finite-horizon and infinite-horizon settings. We extend competitive control to nonlinear systems using Model Predictive Control (MPC) and present numerical experiments which show that our competitive controller can significantly outperform standard $H_2$ and $H_{\infty}$ controllers in the MPC setting.

Related papers

Which price to pay? Auto-tuning building MPC controller for optimal economic cost [7.400001848945602]
Model predictive control (MPC) controller is considered for temperature management in buildings. We propose an efficient performance-oriented building MPC controller tuning method based on a cutting-edge efficient constrained Bayesian optimization algorithm. The results indicate that with an optimized simple MPC, the monthly electricity cost of a household can be reduced by up to 26.90% compared with the cost when controlled by a basic rule-based controller.
arXiv Detail & Related papers (2025-01-18T19:52:27Z)
Optimal Competitive-Ratio Control [40.89951305613357]
We show that the optimal competitive ratio formula can be computed as the maximal eigenvalue of a simple matrix. We conduct an extensive numerical study to verify this analytical solution, and demonstrate that the optimal competitive-ratio controller outperforms other controllers on several large scale practical systems.
arXiv Detail & Related papers (2022-06-03T19:01:07Z)
Regret Analysis of Learning-Based MPC with Partially-Unknown Cost Function [5.601217969637838]
exploration/exploitation trade-off is an inherent challenge in data-driven and adaptive control. We propose the use of a finitehorizon oracle controller with perfect knowledge of all system parameters as a reference for optimal control actions. We develop learning-based policies that we prove achieve low regret with respect to this oracle finite-horizon controller.
arXiv Detail & Related papers (2021-08-04T22:43:51Z)
Regret-optimal Estimation and Control [52.28457815067461]
We show that the regret-optimal estimator and regret-optimal controller can be derived in state-space form. We propose regret-optimal analogs of Model-Predictive Control (MPC) and the Extended KalmanFilter (EKF) for systems with nonlinear dynamics.
arXiv Detail & Related papers (2021-06-22T23:14:21Z)
Regret-Optimal LQR Control [37.99652162611661]
We find a causal controller that minimizes the worst-case regret over all bounded energy disturbances. We derive explicit formulas for the optimal regret and for the regret-optimal controller for the state-space setting. The regret-optimal controller presents itself as a viable option for control systems design.
arXiv Detail & Related papers (2021-05-04T01:51:00Z)
Stable Online Control of Linear Time-Varying Systems [49.41696101740271]
COCO-LQ is an efficient online control algorithm that guarantees input-to-state stability for a large class of LTV systems. We empirically demonstrate the performance of COCO-LQ in both synthetic experiments and a power system frequency control example.
arXiv Detail & Related papers (2021-04-29T06:18:49Z)
Regret-optimal measurement-feedback control [39.76359052907755]
We consider measurement-feedback control in linear dynamical systems from the perspective of regret. We show that in the measurement-feedback setting, unlike in the full information setting, there is no single offline controller which outperforms every other offline controller on every disturbance. We show that the corresponding regret-optimal online controller can be found via a novel reduction to the classical Nehari problem and present a tight data-dependent bound on its regret.
arXiv Detail & Related papers (2020-11-24T01:36:48Z)
Regret-optimal control in dynamic environments [39.76359052907755]
We focus on the problem of designing an online controller which minimizes regret against the best dynamic sequence of control actions selected in hindsight. We derive the state-space structure of the regret-optimal controller via a novel reduction to $H_infty$ control. We present numerical experiments which show that our regret-optimal controller interpolates between the performance of the $H_infty$-optimal controllers across and adversarial environments.
arXiv Detail & Related papers (2020-10-20T17:32:17Z)
Learning a Contact-Adaptive Controller for Robust, Efficient Legged Locomotion [95.1825179206694]
We present a framework that synthesizes robust controllers for a quadruped robot. A high-level controller learns to choose from a set of primitives in response to changes in the environment. A low-level controller that utilizes an established control method to robustly execute the primitives.
arXiv Detail & Related papers (2020-09-21T16:49:26Z)
Improper Learning for Non-Stochastic Control [78.65807250350755]
We consider the problem of controlling a possibly unknown linear dynamical system with adversarial perturbations, adversarially chosen convex loss functions, and partially observed states. Applying online descent to this parametrization yields a new controller which attains sublinear regret vs. a large class of closed-loop policies. Our bounds are the first in the non-stochastic control setting that compete with emphall stabilizing linear dynamical controllers.
arXiv Detail & Related papers (2020-01-25T02:12:48Z)

This list is automatically generated from the titles and abstracts of the papers in this site.