Controlling Rayleigh-B\'enard convection via Reinforcement Learning
- URL: http://arxiv.org/abs/2003.14358v1
- Date: Tue, 31 Mar 2020 16:39:25 GMT
- Title: Controlling Rayleigh-B\'enard convection via Reinforcement Learning
- Authors: Gerben Beintema, Alessandro Corbetta, Luca Biferale, Federico Toschi
- Abstract summary: The identification of effective control strategies to suppress or enhance the convective heat exchange under fixed external thermal gradients is an outstanding fundamental and technological issue.
In this work, we explore a novel approach, based on a state-of-the-art Reinforcement Learning (RL) algorithm.
We show that our RL-based control is able to stabilize the conductive regime and bring the onset of convection up to a Rayleigh number.
- Score: 62.997667081978825
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Thermal convection is ubiquitous in nature as well as in many industrial
applications. The identification of effective control strategies to, e.g.,
suppress or enhance the convective heat exchange under fixed external thermal
gradients is an outstanding fundamental and technological issue. In this work,
we explore a novel approach, based on a state-of-the-art Reinforcement Learning
(RL) algorithm, which is capable of significantly reducing the heat transport
in a two-dimensional Rayleigh-B\'enard system by applying small temperature
fluctuations to the lower boundary of the system. By using numerical
simulations, we show that our RL-based control is able to stabilize the
conductive regime and bring the onset of convection up to a Rayleigh number
$Ra_c \approx 3 \cdot 10^4$, whereas in the uncontrolled case it holds
$Ra_{c}=1708$. Additionally, for $Ra > 3 \cdot 10^4$, our approach outperforms
other state-of-the-art control algorithms reducing the heat flux by a factor of
about $2.5$. In the last part of the manuscript, we address theoretical limits
connected to controlling an unstable and chaotic dynamics as the one considered
here. We show that controllability is hindered by observability and/or
capabilities of actuating actions, which can be quantified in terms of
characteristic time delays. When these delays become comparable with the
Lyapunov time of the system, control becomes impossible.
Related papers
- Provably Efficient Action-Manipulation Attack Against Continuous Reinforcement Learning [49.48615590763914]
We propose a black-box attack algorithm named LCBT, which uses the Monte Carlo tree search method for efficient action searching and manipulation.
We conduct our proposed attack methods on three aggressive algorithms: DDPG, PPO, and TD3 in continuous settings, which show a promising attack performance.
arXiv Detail & Related papers (2024-11-20T08:20:29Z) - Multi-agent reinforcement learning for the control of three-dimensional Rayleigh-BĂ©nard convection [0.7864304771129751]
Multi-agent RL (MARL) has shown to be more effective than single-agent RL in controlling flows exhibiting locality and translational invariance.
We present for the first time, an implementation of MARL-based control of three-dimensional Rayleigh-B'enard convection.
arXiv Detail & Related papers (2024-07-31T12:41:20Z) - Dicke superradiant enhancement of the heat current in circuit QED [0.0]
Collective effects, such as Dicke superradiant emission, can enhance the performance of a quantum device.
We study the heat current flowing between a cold and a hot bath through an ensemble of $N$ qubits, which are collectively coupled to the thermal baths.
arXiv Detail & Related papers (2024-01-30T22:06:37Z) - Demand response for residential building heating: Effective Monte Carlo Tree Search control based on physics-informed neural networks [4.1860949813005375]
This paper focuses on using a demand response (DR) algorithm to limit the energy consumption of a residential building's heating system.
One such RL method is Monte Carlo Tree Search (MCTS), which has achieved impressive success in playing board games (go, chess)
arXiv Detail & Related papers (2023-12-06T09:06:14Z) - Genetically-inspired convective heat transfer enhancement in a turbulent
boundary layer [0.0]
The convective heat transfer in a turbulent boundary layer (TBL) on a flat plate is enhanced using an artificial intelligence approach.
The actuator is a set of six slot jets in crossflow aligned with the freestream.
The control laws are optimised with respect to the unperturbed TBL and to the actuation with a steady jet.
arXiv Detail & Related papers (2023-04-25T07:28:32Z) - Direct data-driven forecast of local turbulent heat flux in
Rayleigh-B\'{e}nard convection [0.0]
Two-dimensional turbulent Rayleigh-B'enard convection flow at Prandtl number $rm Pr=7$ and Rayleigh number $rm Ra=107$.
Two recurrent neural networks are applied for the temporal advancement of flow data in the reduced latent data space.
Convolutional autoencoder with 12 hidden layers is able to reduce the dimensionality of the turbulence data to about 0.2 % of their original size.
arXiv Detail & Related papers (2022-02-26T12:39:19Z) - Finite-time System Identification and Adaptive Control in Autoregressive
Exogenous Systems [79.67879934935661]
We study the problem of system identification and adaptive control of unknown ARX systems.
We provide finite-time learning guarantees for the ARX systems under both open-loop and closed-loop data collection.
arXiv Detail & Related papers (2021-08-26T18:00:00Z) - Regret-optimal Estimation and Control [52.28457815067461]
We show that the regret-optimal estimator and regret-optimal controller can be derived in state-space form.
We propose regret-optimal analogs of Model-Predictive Control (MPC) and the Extended KalmanFilter (EKF) for systems with nonlinear dynamics.
arXiv Detail & Related papers (2021-06-22T23:14:21Z) - Adaptive Control and Regret Minimization in Linear Quadratic Gaussian
(LQG) Setting [91.43582419264763]
We propose LqgOpt, a novel reinforcement learning algorithm based on the principle of optimism in the face of uncertainty.
LqgOpt efficiently explores the system dynamics, estimates the model parameters up to their confidence interval, and deploys the controller of the most optimistic model.
arXiv Detail & Related papers (2020-03-12T19:56:38Z) - NeurOpt: Neural network based optimization for building energy
management and climate control [58.06411999767069]
We propose a data-driven control algorithm based on neural networks to reduce this cost of model identification.
We validate our learning and control algorithms on a two-story building with ten independently controlled zones, located in Italy.
arXiv Detail & Related papers (2020-01-22T00:51:03Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.